it.surf-forecast.com
robots.txt

Robots Exclusion Standard data for it.surf-forecast.com

Resource Scan

Scan Details

Site Domain it.surf-forecast.com
Base Domain surf-forecast.com
Scan Status Ok
Last Scan2024-09-26T20:53:11+00:00
Next Scan 2024-10-10T20:53:11+00:00

Last Scan

Scanned2024-09-26T20:53:11+00:00
URL https://it.surf-forecast.com/robots.txt
Domain IPs 104.22.26.216, 104.22.27.216, 172.67.10.178, 2606:4700:10::6816:1ad8, 2606:4700:10::6816:1bd8, 2606:4700:10::ac43:ab2
Response IP 104.22.27.216
Found Yes
Hash 77c636033c3435592cbd2b3cd05b4badc1d00367175a5a753819b07d467e8bf6
SimHash 69058849a133

Groups

*

Rule Path
Disallow *page%3D*tweet%3D*
Disallow /breaks/*/photos/new
Disallow /pages/terms
Disallow /pages/privacy
Disallow /pages/cookie_policy

gptbot
chatgpt-user
ccbot
anthropic-ai

Rule Path
Disallow /breaks/*
Allow /breaks/*/forecasts/*

Other Records

Field Value
sitemap https://it.surf-forecast.com/sitemap_index.xml.gz