infoguia.com.do
robots.txt

Robots Exclusion Standard data for infoguia.com.do

Resource Scan

Scan Details

Site Domain infoguia.com.do
Base Domain infoguia.com.do
Scan Status Ok
Last Scan2026-02-09T01:58:39+00:00
Next Scan 2026-02-16T01:58:39+00:00

Last Scan

Scanned2026-02-09T01:58:39+00:00
URL https://infoguia.com.do/robots.txt
Domain IPs 52.22.79.83
Response IP 52.22.79.83
Found Yes
Hash f9c69efef7dbc0899e69b4dca762b595966049703968537231c5fffd748367fb
SimHash fc1c495083a0

Groups

*

Rule Path
Disallow /webempresa.asp?cod=*
Disallow /linkstats.asp?cod=*

applebot-extended
ccbot
anthropic-ai
omgili
omgilibot
imagesiftbot
bytespider
awariorssbot
awariosmartbot
cohere-ai
dataforseobot
diffbot
magpie-crawler
peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

*

Rule Path
Disallow /tienda/