discovery.nationalarchives.gov.uk
robots.txt
Robots Exclusion Standard data for discovery.nationalarchives.gov.uk
Resource Scan
Scan Details
Site Domain | discovery.nationalarchives.gov.uk |
Base Domain | nationalarchives.gov.uk |
Scan Status | Ok |
Last Scan | 2024-05-14T15:28:19+00:00 |
Next Scan | 2024-06-13T15:28:19+00:00 |
Last Scan
Scanned | 2024-05-14T15:28:19+00:00 |
URL | https://discovery.nationalarchives.gov.uk/robots.txt |
Domain IPs | 52.222.144.38, 52.222.144.55, 52.222.144.72, 52.222.144.8 |
Response IP | 18.165.171.46 |
Found | Yes |
Hash | 9916f71533c2eb11b2d509fef113a2aa07c1d15f547d58be85c8173de1edcbfe |
SimHash | 71489c849f12 |
Groups
*
Rule | Path |
---|---|
Disallow | /browse/ |
Disallow | /Details/AddtoBasket |
Disallow | /details/addtobasket |
Disallow | /Details/AssetMain |
Disallow | /details/AssetMain |
Disallow | /Details/FindRelatedIA |
Disallow | /details/FindRelatedIA |
Disallow | /Details/FlagTag |
Disallow | /details/FlagTag |
Disallow | /hbrowse |
Disallow | /home/redirect |
Disallow | /image/ |
Disallow | /mdr |
Disallow | /redirect/notfound |
Disallow | /register |
Disallow | /results |
Disallow | /tag |
Disallow | /pagecheck |
Comments