discovery.nationalarchives.gov.uk
robots.txt

Robots Exclusion Standard data for discovery.nationalarchives.gov.uk

Resource Scan

Scan Details

Site Domain discovery.nationalarchives.gov.uk
Base Domain nationalarchives.gov.uk
Scan Status Ok
Last Scan2024-05-14T15:28:19+00:00
Next Scan 2024-06-13T15:28:19+00:00

Last Scan

Scanned2024-05-14T15:28:19+00:00
URL https://discovery.nationalarchives.gov.uk/robots.txt
Domain IPs 52.222.144.38, 52.222.144.55, 52.222.144.72, 52.222.144.8
Response IP 18.165.171.46
Found Yes
Hash 9916f71533c2eb11b2d509fef113a2aa07c1d15f547d58be85c8173de1edcbfe
SimHash 71489c849f12

Groups

*

Rule Path
Disallow /browse/
Disallow /Details/AddtoBasket
Disallow /details/addtobasket
Disallow /Details/AssetMain
Disallow /details/AssetMain
Disallow /Details/FindRelatedIA
Disallow /details/FindRelatedIA
Disallow /Details/FlagTag
Disallow /details/FlagTag
Disallow /hbrowse
Disallow /home/redirect
Disallow /image/
Disallow /mdr
Disallow /redirect/notfound
Disallow /register
Disallow /results
Disallow /tag
Disallow /pagecheck

Comments

  • Tells Scanning Robots where they are and are not welcome