theweather.net
robots.txt
Robots Exclusion Standard data for theweather.net
Resource Scan
Scan Details
Site Domain | theweather.net |
Base Domain | theweather.net |
Scan Status | Ok |
Last Scan | 2024-09-20T04:14:26+00:00 |
Next Scan | 2024-09-27T04:14:26+00:00 |
Last Scan
Scanned | 2024-09-20T04:14:26+00:00 |
URL | https://theweather.net/robots.txt |
Redirect | https://www.theweather.net/robots/ca.txt |
Redirect Domain | www.theweather.net |
Redirect Base | theweather.net |
Domain IPs | 104.16.62.112, 104.16.63.112, 2606:4700::6810:3e70, 2606:4700::6810:3f70 |
Redirect IPs | 104.16.62.112, 104.16.63.112, 2606:4700::6810:3e70, 2606:4700::6810:3f70 |
Response IP | 104.16.63.112 |
Found | Yes |
Hash | 9819ac912ffc828f3c9a51740efeb6676cd495fabbe6c53262df0c8a42f98477 |
SimHash | 695551204573 |
Groups
*
Rule | Path |
---|---|
Disallow | /pruebas-publi/* |
Disallow | /s%3D* |
Disallow | /cdn-cgi/rum* |
Allow | / |