gettextbooks.co.uk
robots.txt

Robots Exclusion Standard data for gettextbooks.co.uk

Resource Scan

Scan Details

Site Domain gettextbooks.co.uk
Base Domain gettextbooks.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-05-30T07:33:43+00:00
Next Scan 2025-08-28T07:33:43+00:00

Last Successful Scan

Scanned2025-01-08T07:22:25+00:00
URL https://gettextbooks.co.uk/robots.txt
Redirect https://www.gettextbooks.co.uk/robots.txt
Redirect Domain www.gettextbooks.co.uk
Redirect Base gettextbooks.co.uk
Domain IPs 13.35.238.112, 13.35.238.27, 13.35.238.33, 13.35.238.40
Redirect IPs 13.35.238.112, 13.35.238.27, 13.35.238.33, 13.35.238.40
Response IP 13.35.238.27
Found Yes
Hash 18ecb3ce5e4d7baa4fb0b33b969a6e19b554c2e52be91fde7ee2ef527ef814b7
SimHash 3705501ef192

Groups

*

Rule Path
Disallow /user/
Disallow /bookbag/add/
Disallow /wishlist/add/
Disallow /ibundle/
Disallow /ibundle/add
Disallow /mybundle/
Disallow /mybundle/add/
Disallow /pricealert/
Disallow /ean/
Disallow /asin/
Disallow /pi/
Disallow /jbd/
Disallow /go.aspx

ia_archiver

Rule Path
Disallow /go.aspx
Disallow /isbn/
Disallow /browse/
Disallow /search/