bookshop.org
robots.txt
Robots Exclusion Standard data for bookshop.org
Resource Scan
Scan Details
Site Domain | bookshop.org |
Base Domain | bookshop.org |
Scan Status | Ok |
Last Scan | 2024-11-08T16:05:43+00:00 |
Next Scan | 2024-11-15T16:05:43+00:00 |
Last Scan
Scanned | 2024-11-08T16:05:43+00:00 |
URL | https://bookshop.org/robots.txt |
Domain IPs | 104.18.22.66, 104.18.23.66, 2606:4700::6812:1642, 2606:4700::6812:1742 |
Response IP | 104.18.23.66 |
Found | Yes |
Hash | ba33744838142ddc74b6d27b1a2503cebb6a178b67ecf92e3838c0981775ab01 |
SimHash | e9110f555f77 |
Groups
*
Rule | Path |
---|---|
Disallow | /checkout |
Disallow | /cart |
Disallow | /orders |
Disallow | /user |
Disallow | /account |
Disallow | /api |
Disallow | /password |
Disallow | /affiliates/profile |
Allow | /affiliates/profile/introduction |
Other Records
Field | Value |
---|---|
sitemap | https://bookshop.org/sitemap.xml.gz |
Comments