readwhere.com
robots.txt

Robots Exclusion Standard data for readwhere.com

Resource Scan

Scan Details

Site Domain readwhere.com
Base Domain readwhere.com
Scan Status Ok
Last Scan2024-06-15T19:45:34+00:00
Next Scan 2024-06-22T19:45:34+00:00

Last Scan

Scanned2024-06-15T19:45:34+00:00
URL https://readwhere.com/robots.txt
Redirect https://www.readwhere.com/robots.txt
Redirect Domain www.readwhere.com
Redirect Base readwhere.com
Domain IPs 34.117.35.89
Redirect IPs 34.117.35.89
Response IP 34.117.35.89
Found Yes
Hash e2590b0daed14f9d411e1fad76651d0e4b66d0167c12c17b77b8721cf890ec00
SimHash 10320452a6d3

Groups

*

Rule Path
Disallow /tagapiv1/
Disallow /api/
Disallow /read/api/
Disallow /impression/
Disallow /ajax/
Disallow /searchv2/
Disallow /publicajax/
Disallow /read/cartcheckout/
Disallow /speedynews/
Disallow /lite/
Disallow /mashup/
Disallow /publication/
Disallow /m/search/
Disallow /search/
Disallow /m/logout/
Disallow /user/logout
Disallow /1009127/

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.readwhere.com/sitemap/sitemapindex.xml
sitemap https://global.readwhere.com/sitemap/globalsitemapindex.xml

Comments

  • block search page crawl
  • block logout page crawl
  • block access to DFP codes
  • blocks access to whole site for yandex
  • Sitemap Reference