infoguia.com
robots.txt

Robots Exclusion Standard data for infoguia.com

Resource Scan

Scan Details

Site Domain infoguia.com
Base Domain infoguia.com
Scan Status Ok
Last Scan2024-11-14T13:27:30+00:00
Next Scan 2024-11-21T13:27:30+00:00

Last Scan

Scanned2024-11-14T13:27:30+00:00
URL https://infoguia.com/robots.txt
Domain IPs 52.22.79.83
Response IP 52.22.79.83
Found Yes
Hash f4f37642bc9652e19182a4f724ea627140913a0c028c4f692ca0d2daedfda591
SimHash f41cd950a2a0

Groups

applebot-extended
facebookbot
amazonbot
gptbot
ccbot
chatgpt-user
google-extended
anthropic-ai
claudebot
claude-web
omgili
omgilibot
imagesiftbot
bytespider
awariorssbot
awariosmartbot
cohere-ai
perplexitybot
dataforseobot
diffbot
youbot
magpie-crawler
peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /