readwhere.com
robots.txt
Robots Exclusion Standard data for readwhere.com
Resource Scan
Scan Details
Site Domain | readwhere.com |
Base Domain | readwhere.com |
Scan Status | Ok |
Last Scan | 2024-06-15T19:45:34+00:00 |
Next Scan | 2024-06-22T19:45:34+00:00 |
Last Scan
Scanned | 2024-06-15T19:45:34+00:00 |
URL | https://readwhere.com/robots.txt |
Redirect | https://www.readwhere.com/robots.txt |
Redirect Domain | www.readwhere.com |
Redirect Base | readwhere.com |
Domain IPs | 34.117.35.89 |
Redirect IPs | 34.117.35.89 |
Response IP | 34.117.35.89 |
Found | Yes |
Hash | e2590b0daed14f9d411e1fad76651d0e4b66d0167c12c17b77b8721cf890ec00 |
SimHash | 10320452a6d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /tagapiv1/ |
Disallow | /api/ |
Disallow | /read/api/ |
Disallow | /impression/ |
Disallow | /ajax/ |
Disallow | /searchv2/ |
Disallow | /publicajax/ |
Disallow | /read/cartcheckout/ |
Disallow | /speedynews/ |
Disallow | /lite/ |
Disallow | /mashup/ |
Disallow | /publication/ |
Disallow | /m/search/ |
Disallow | /search/ |
Disallow | /m/logout/ |
Disallow | /user/logout |
Disallow | /1009127/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.readwhere.com/sitemap/sitemapindex.xml |
sitemap | https://global.readwhere.com/sitemap/globalsitemapindex.xml |
Comments