de.weatherspark.com
robots.txt

Robots Exclusion Standard data for de.weatherspark.com

Resource Scan

Scan Details

Site Domain de.weatherspark.com
Base Domain weatherspark.com
Scan Status Ok
Last Scan2024-05-25T10:45:35+00:00
Next Scan 2024-06-24T10:45:35+00:00

Last Scan

Scanned2024-05-25T10:45:35+00:00
URL https://de.weatherspark.com/robots.txt
Domain IPs 18.155.68.107, 18.155.68.80, 18.155.68.85, 18.155.68.94
Response IP 18.155.68.80
Found Yes
Hash 50b4f6f4605f2b87457dffa2a7fa8ec7e0d4b8fed4363dc1dbbe3ae240ef5046
SimHash 6b84be60ebd7

Groups

*

Rule Path
Disallow /d/
Disallow /td/
Disallow /h/d/
Disallow /h/td/
Disallow /countries/d/
Disallow /compare/s/
Disallow /compare/m/
Disallow /compare/d/
Disallow /map
Disallow /search
Disallow /license

Other Records

Field Value
crawl-delay 1

mauibot
ahrefsbot
semrushbot
petalbot
liebaofast
mqqbrowser
mb2345browser
gptbot
claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://de.weatherspark.com/sitemap.xml