weatherhq.com
robots.txt
Robots Exclusion Standard data for weatherhq.com
Resource Scan
Scan Details
Site Domain | weatherhq.com |
Base Domain | weatherhq.com |
Scan Status | Ok |
Last Scan | 2024-05-31T06:01:38+00:00 |
Next Scan | 2024-06-07T06:01:38+00:00 |
Last Scan
Scanned | 2024-05-31T06:01:38+00:00 |
URL | https://weatherhq.com/robots.txt |
Domain IPs | 104.21.2.229, 172.67.129.199, 2606:4700:3034::ac43:81c7, 2606:4700:3037::6815:2e5 |
Response IP | 172.67.129.199 |
Found | Yes |
Hash | 121cd486a519b7a2a4dca7fbfdcdde635a451f68c3115f2caf3fea4247c91e67 |
SimHash | c01cd5c0ee92 |
Groups
*
Rule | Path |
---|---|
Disallow | /authentication |
Disallow | |
Disallow | /weather/widgetv2 |
Disallow | /desktop |