forum.newsweek.com
robots.txt

Robots Exclusion Standard data for forum.newsweek.com

Resource Scan

Scan Details

Site Domain forum.newsweek.com
Base Domain newsweek.com
Scan Status Ok
Last Scan2024-05-09T08:33:39+00:00
Next Scan 2024-05-23T08:33:39+00:00

Last Scan

Scanned2024-05-09T08:33:39+00:00
URL https://forum.newsweek.com/robots.txt
Domain IPs 199.232.194.217, 199.232.198.217
Response IP 151.101.42.217
Found Yes
Hash 1dd60c923f79923be22d29795c60657141b8f946191091ff88ac5a2999463526
SimHash 7855de70cd93

Groups

googlebot

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/

*

Rule Path
Allow /
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/

Other Records

Field Value
sitemap https://forum.newsweek.com/sitemap.xml

Warnings

  • 1 invalid line.