mashable.com
robots.txt

Robots Exclusion Standard data for mashable.com

Resource Scan

Scan Details

Site Domain mashable.com
Base Domain mashable.com
Scan Status Ok
Last Scan2024-05-01T00:12:49+00:00
Next Scan 2024-05-08T00:12:49+00:00

Last Scan

Scanned2024-05-01T00:12:49+00:00
URL https://mashable.com/robots.txt
Domain IPs 104.18.33.218, 172.64.154.38, 2606:4700:4400::6812:21da, 2606:4700:4400::ac40:9a26
Response IP 172.64.154.38
Found Yes
Hash e7d82e774b2ee0809c6b2cbee0e3d77185e046ab49b2f3ca58814c9207416b0c
SimHash 6904db50e711

Groups

*

Rule Path
Disallow /search
Disallow /archive/
Disallow /cdn-cgi/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /
Disallow /*?page=%5B0-9%5D%5B0-9%5D
Disallow /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D
Disallow /*?page=%5B0-9%5D%5B0-9%5D%5B0-9%5D%5B0-9%5D
Allow /*?page=%5B0-9%5D

Other Records

Field Value
sitemap https://mashable.com/sitemap-index.xml
sitemap https://mashable.com/sitemap-news-0.xml