curioctopus.guru
robots.txt

Robots Exclusion Standard data for curioctopus.guru

Resource Scan

Scan Details

Site Domain curioctopus.guru
Base Domain curioctopus.guru
Scan Status Ok
Last Scan2024-09-26T04:01:07+00:00
Next Scan 2024-10-03T04:01:07+00:00

Last Scan

Scanned2024-09-26T04:01:07+00:00
URL http://curioctopus.guru/robots.txt
Redirect https://www.curioctopus.it/robots.txt
Redirect Domain www.curioctopus.it
Redirect Base curioctopus.it
Domain IPs 34.248.5.78, 99.80.238.80
Redirect IPs 108.156.133.102, 108.156.133.15, 108.156.133.5, 108.156.133.50, 2600:9000:21d1:0:17:b92f:8180:93a1, 2600:9000:21d1:600:17:b92f:8180:93a1, 2600:9000:21d1:6200:17:b92f:8180:93a1, 2600:9000:21d1:6a00:17:b92f:8180:93a1, 2600:9000:21d1:8000:17:b92f:8180:93a1, 2600:9000:21d1:be00:17:b92f:8180:93a1, 2600:9000:21d1:c800:17:b92f:8180:93a1, 2600:9000:21d1:e600:17:b92f:8180:93a1
Response IP 108.156.133.15
Found Yes
Hash a37e95981697d5f05191c59ae4e155a20053bf963cb2b4b8a0ffa4f27bc894b4
SimHash ef0d1c50e6f0

Groups

*

Rule Path
Disallow /search

mauibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

linguee

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.curioctopus.it/sitemap.xml