sciencealert.com
robots.txt

Robots Exclusion Standard data for sciencealert.com

Resource Scan

Scan Details

Site Domain sciencealert.com
Base Domain sciencealert.com
Scan Status Ok
Last Scan2024-06-12T13:57:46+00:00
Next Scan 2024-06-19T13:57:46+00:00

Last Scan

Scanned2024-06-12T13:57:46+00:00
URL https://sciencealert.com/robots.txt
Domain IPs 104.18.18.94, 104.18.19.94, 2606:4700::6812:125e, 2606:4700::6812:135e
Response IP 104.18.19.94
Found Yes
Hash 3d07e394503c25f5f976786e37ae7501f8e5e1c08b691fa7ab5800acf5932b8c
SimHash 052e4c69c590

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /*?s=
Disallow /*?ignore_amp
Disallow /*?perpetual=
Disallow /*%26limitstart%3D
Disallow /*?limitstart=
Disallow /*?utm_campaign=
Disallow /*?utm_content=
Disallow /*?utm_medium=
Disallow /*?utm_source=
Disallow /*?ref=
Disallow /*?trk=
Disallow /*?fbclid=
Disallow /*?curator=
Disallow /*?alm_mvr=
Disallow /*?utm_=

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sciencealert.com/30-day-sitemap.xml
sitemap https://www.sciencealert.com/news-sitemap.xml