mashable.com
robots.txt
Robots Exclusion Standard data for mashable.com
Resource Scan
Scan Details
Site Domain | mashable.com |
Base Domain | mashable.com |
Scan Status | Ok |
Last Scan | 2024-05-01T00:12:49+00:00 |
Next Scan | 2024-05-08T00:12:49+00:00 |
Last Scan
Scanned | 2024-05-01T00:12:49+00:00 |
URL | https://mashable.com/robots.txt |
Domain IPs | 104.18.33.218, 172.64.154.38, 2606:4700:4400::6812:21da, 2606:4700:4400::ac40:9a26 |
Response IP | 172.64.154.38 |
Found | Yes |
Hash | e7d82e774b2ee0809c6b2cbee0e3d77185e046ab49b2f3ca58814c9207416b0c |
SimHash | 6904db50e711 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /archive/ |
Disallow | /cdn-cgi/ |
Other Records
Field | Value |
---|---|
sitemap | https://mashable.com/sitemap-index.xml |
sitemap | https://mashable.com/sitemap-news-0.xml |