guiaderioclaro.com.br
robots.txt

Robots Exclusion Standard data for guiaderioclaro.com.br

Resource Scan

Scan Details

Site Domain guiaderioclaro.com.br
Base Domain guiaderioclaro.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-11-17T05:38:45+00:00
Next Scan 2026-02-15T05:38:45+00:00

Last Successful Scan

Scanned2024-04-27T01:38:48+00:00
URL https://guiaderioclaro.com.br/robots.txt
Domain IPs 187.1.136.75, 2804:10:8015::136:75
Response IP 187.1.136.75
Found Yes
Hash 35442364590a8612ce00d8f8ddfe1096a1dd2ff528c6f45a0a833be5bec489e8
SimHash 2d5151e5e550

Groups

bingbot
slurp
googlebot
googlebot-images
adsbot-google
mediapartners-google
feedfetcher-google
facebot
facebookexternalhit
twitterbot
ia_archiver

Product Comment
bingbot Bing
slurp Yahoo
googlebot Google
googlebot-images Google Imagens
adsbot-google Google Adwords
mediapartners-google Google Partners
feedfetcher-google Google Feed
facebot facebook
facebookexternalhit facebook hit
twitterbot twitterbot
ia_archiver ia
Rule Path
Disallow /app
Disallow /api
Disallow /assinantes
Disallow /cgi-bin
Disallow /classificado
Disallow /curriculo
Disallow /download
Disallow /emprego
Disallow /english
Disallow /espanol
Disallow /evento
Disallow /financeiro
Disallow /gerencia
Disallow /import
Disallow /js
Disallow /lib
Disallow /vendas

*

Rule Path
Disallow /