curioctopus.guru
robots.txt

Robots Exclusion Standard data for curioctopus.guru

Resource Scan

Scan Details

Site Domain curioctopus.guru
Base Domain curioctopus.guru
Scan Status Ok
Last Scan2024-11-15T09:32:56+00:00
Next Scan 2024-11-22T09:32:56+00:00

Last Scan

Scanned2024-11-15T09:32:56+00:00
URL http://curioctopus.guru/robots.txt
Redirect https://www.curioctopus.it/robots.txt
Redirect Domain www.curioctopus.it
Redirect Base curioctopus.it
Domain IPs 52.214.198.101, 52.214.95.34
Redirect IPs 13.33.88.116, 13.33.88.47, 13.33.88.69, 13.33.88.8, 2600:9000:223b:2600:17:b92f:8180:93a1, 2600:9000:223b:3400:17:b92f:8180:93a1, 2600:9000:223b:6400:17:b92f:8180:93a1, 2600:9000:223b:7c00:17:b92f:8180:93a1, 2600:9000:223b:8c00:17:b92f:8180:93a1, 2600:9000:223b:9800:17:b92f:8180:93a1, 2600:9000:223b:c600:17:b92f:8180:93a1, 2600:9000:223b:e200:17:b92f:8180:93a1
Response IP 13.33.88.8
Found Yes
Hash a37e95981697d5f05191c59ae4e155a20053bf963cb2b4b8a0ffa4f27bc894b4
SimHash ef0d1c50e6f0

Groups

*

Rule Path
Disallow /search

mauibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

linguee

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.curioctopus.it/sitemap.xml