bookharbour.com
robots.txt

Robots Exclusion Standard data for bookharbour.com

Resource Scan

Scan Details

Site Domain bookharbour.com
Base Domain bookharbour.com
Scan Status Ok
Last Scan2024-10-16T22:17:04+00:00
Next Scan 2024-11-15T22:17:04+00:00

Last Scan

Scanned2024-10-16T22:17:04+00:00
URL https://bookharbour.com/robots.txt
Redirect https://www.bookharbour.com/robots.txt
Redirect Domain www.bookharbour.com
Redirect Base bookharbour.com
Domain IPs 104.26.0.172, 104.26.1.172, 172.67.75.42, 2606:4700:20::681a:1ac, 2606:4700:20::681a:ac, 2606:4700:20::ac43:4b2a
Redirect IPs 104.26.0.172, 104.26.1.172, 172.67.75.42, 2606:4700:20::681a:1ac, 2606:4700:20::681a:ac, 2606:4700:20::ac43:4b2a
Response IP 104.26.1.172
Found Yes
Hash 2702abf920064f05e28bde629bcac20739740f08ea6fa3964c7f921a10cd5ef1
SimHash bb25716288d8

Groups

*

Rule Path
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalogsearch/
Disallow /checkout/
Disallow /customer/
Disallow /review/
Disallow /wishlist/
Disallow /sendfriend/
Disallow /*?*product_list_mode=
Disallow /*?*product_list_order=
Disallow /*?*product_list_limit=
Disallow /*?*product_list_dir=
Disallow /*?SID=
Disallow /*?*cat=*
Disallow /*?*price=*
Disallow /*?*format=*
Disallow /*?*edition=*
Disallow /*?*author=*
Disallow /*?*publisher=*

Other Records

Field Value
sitemap https://www.bookharbour.com/sitemaps/sitemap.xml

Comments

  • Paths (clean URLs)
  • Do not index pages that are sorted or filtered.
  • Do not index session ID
  • Filters