theherald.co.za
robots.txt
Robots Exclusion Standard data for theherald.co.za
Resource Scan
Scan Details
Site Domain | theherald.co.za |
Base Domain | theherald.co.za |
Scan Status | Ok |
Last Scan | 2024-11-16T10:04:27+00:00 |
Next Scan | 2024-11-23T10:04:27+00:00 |
Last Scan
Scanned | 2024-11-16T10:04:27+00:00 |
URL | https://theherald.co.za/robots.txt |
Redirect | https://www.heraldlive.co.za/robots.txt |
Redirect Domain | www.heraldlive.co.za |
Redirect Base | heraldlive.co.za |
Domain IPs | 178.79.178.218, 2a01:7e00:e000:3f7:: |
Redirect IPs | 2404:6800:4003:c01::79, 74.125.130.121 |
Response IP | 172.253.118.121 |
Found | Yes |
Hash | d2e368b124130e5fbfb06ec3ba5440b4318b5b1b365b1d4c22b4ad02e6103166 |
SimHash | 61541960c311 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /welcome/* |
Disallow | /u/* |
Disallow | /buy/* |
Disallow | /static/* |
Disallow | /build/bundles/* |
Disallow | /herald_cosmos_images/* |
Disallow | /share/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.heraldlive.co.za/sitemap/ |
sitemap | https://www.heraldlive.co.za/sitemap/google-news/ |
sitemap | https://www.heraldlive.co.za/sitemap/video/ |