uscbookstore.com
robots.txt

Robots Exclusion Standard data for uscbookstore.com

Resource Scan

Scan Details

Site Domain uscbookstore.com
Base Domain uscbookstore.com
Scan Status Ok
Last Scan2024-09-22T06:43:07+00:00
Next Scan 2024-10-22T06:43:07+00:00

Last Scan

Scanned2024-09-22T06:43:07+00:00
URL https://www.uscbookstore.com/robots.txt
Domain IPs 125.56.219.17, 23.32.29.89
Response IP 125.56.219.17
Found Yes
Hash 92b0c4ba5bba97561112e5bf91c9d5857804dd7b4157e380f8d99e3c32c6bf62
SimHash b9514d140d57

Groups

*

Rule Path
Disallow /signin.aspx
Disallow /booklist.aspx
Disallow /wishlist.aspx
Disallow /custitem_usc_ewr_item_flag
Disallow /custitem_ef_gw_is_giftwrap
Disallow /product/undefined
Disallow /error
Disallow /cart
Disallow /California-Electronic-Waste-Recycling-*

Other Records

Field Value
sitemap https://www.uscbookstore.com/sitemap_www.uscbookstore.com_Index.xml