bookshop.org
robots.txt

Robots Exclusion Standard data for bookshop.org

Resource Scan

Scan Details

Site Domain bookshop.org
Base Domain bookshop.org
Scan Status Ok
Last Scan2024-11-08T16:05:43+00:00
Next Scan 2024-11-15T16:05:43+00:00

Last Scan

Scanned2024-11-08T16:05:43+00:00
URL https://bookshop.org/robots.txt
Domain IPs 104.18.22.66, 104.18.23.66, 2606:4700::6812:1642, 2606:4700::6812:1742
Response IP 104.18.23.66
Found Yes
Hash ba33744838142ddc74b6d27b1a2503cebb6a178b67ecf92e3838c0981775ab01
SimHash e9110f555f77

Groups

*

Rule Path
Disallow /checkout
Disallow /cart
Disallow /orders
Disallow /user
Disallow /account
Disallow /api
Disallow /password
Disallow /affiliates/profile
Allow /affiliates/profile/introduction

Other Records

Field Value
sitemap https://bookshop.org/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file