weatherbase.com
robots.txt

Robots Exclusion Standard data for weatherbase.com

Resource Scan

Scan Details

Site Domain weatherbase.com
Base Domain weatherbase.com
Scan Status Ok
Last Scan2024-09-21T10:02:11+00:00
Next Scan 2024-09-28T10:02:11+00:00

Last Scan

Scanned2024-09-21T10:02:11+00:00
URL https://weatherbase.com/robots.txt
Domain IPs 104.21.18.41, 172.67.180.92, 2606:4700:3033::6815:1229, 2606:4700:3033::ac43:b45c
Response IP 104.21.18.41
Found Yes
Hash 269350d0c5391c620ada789b5f7c7f2dfb5ecc3b1d1a86b6b22e28782a657271
SimHash 69285344649a

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /ads/
Disallow /api/
Disallow /cgi-bin/
Disallow /includes/
Disallow /olddeleteme/
Disallow /test/
Disallow /monthly/
Disallow /pepsi/
Disallow /partners/

Other Records

Field Value
sitemap http://www.weatherbase.com/sitemaps/sitemap.xml
sitemap http://www.weatherbase.com/sitemaps/sitemap-dailyaverage.xml
sitemap http://www.weatherbase.com/sitemaps/sitemap-hourly.xml.gz