blog.pressreader.com
robots.txt

Robots Exclusion Standard data for blog.pressreader.com

Resource Scan

Scan Details

Site Domain blog.pressreader.com
Base Domain pressreader.com
Scan Status Ok
Last Scan2025-05-26T11:18:57+00:00
Next Scan 2025-06-25T11:18:57+00:00

Last Scan

Scanned2025-05-26T11:18:57+00:00
URL https://blog.pressreader.com/robots.txt
Domain IPs 199.60.103.228, 199.60.103.28, 2606:2c40::c73c:671c, 2606:2c40::c73c:67e4
Response IP 199.60.103.228
Found Yes
Hash 277009d0d96a48f19bcfd698984f45dc24cae6d9f38d4271bd6aa5e9c7c2ccc5
SimHash 3af5ce28ccb3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /author/*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*