curioctopus.it
robots.txt

Robots Exclusion Standard data for curioctopus.it

Resource Scan

Scan Details

Site Domain curioctopus.it
Base Domain curioctopus.it
Scan Status Ok
Last Scan2024-11-09T04:20:28+00:00
Next Scan 2024-11-16T04:20:28+00:00

Last Scan

Scanned2024-11-09T04:20:28+00:00
URL https://curioctopus.it/robots.txt
Redirect https://www.curioctopus.it/robots.txt
Redirect Domain www.curioctopus.it
Redirect Base curioctopus.it
Domain IPs 13.33.88.116, 13.33.88.47, 13.33.88.69, 13.33.88.8
Redirect IPs 13.33.88.116, 13.33.88.47, 13.33.88.69, 13.33.88.8, 2600:9000:223b:1e00:17:b92f:8180:93a1, 2600:9000:223b:2400:17:b92f:8180:93a1, 2600:9000:223b:2e00:17:b92f:8180:93a1, 2600:9000:223b:4e00:17:b92f:8180:93a1, 2600:9000:223b:5e00:17:b92f:8180:93a1, 2600:9000:223b:8c00:17:b92f:8180:93a1, 2600:9000:223b:c200:17:b92f:8180:93a1, 2600:9000:223b:cc00:17:b92f:8180:93a1
Response IP 13.33.88.47
Found Yes
Hash a37e95981697d5f05191c59ae4e155a20053bf963cb2b4b8a0ffa4f27bc894b4
SimHash ef0d1c50e6f0

Groups

*

Rule Path
Disallow /search

mauibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

linguee

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.curioctopus.it/sitemap.xml