discovery.nationalarchives.gov.uk
robots.txt

Robots Exclusion Standard data for discovery.nationalarchives.gov.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	discovery.nationalarchives.gov.uk
Base Domain	nationalarchives.gov.uk
Scan Status	Ok
Last Scan	2024-05-14T15:28:19+00:00
Next Scan	2024-06-13T15:28:19+00:00

Last Scan

Scanned	2024-05-14T15:28:19+00:00
URL	https://discovery.nationalarchives.gov.uk/robots.txt
Domain IPs	52.222.144.38, 52.222.144.55, 52.222.144.72, 52.222.144.8
Response IP	18.165.171.46
Found	Yes
Hash	9916f71533c2eb11b2d509fef113a2aa07c1d15f547d58be85c8173de1edcbfe
SimHash	71489c849f12

Groups

*

Rule	Path
Disallow	/browse/
Disallow	/Details/AddtoBasket
Disallow	/details/addtobasket
Disallow	/Details/AssetMain
Disallow	/details/AssetMain
Disallow	/Details/FindRelatedIA
Disallow	/details/FindRelatedIA
Disallow	/Details/FlagTag
Disallow	/details/FlagTag
Disallow	/hbrowse
Disallow	/home/redirect
Disallow	/image/
Disallow	/mdr
Disallow	/redirect/notfound
Disallow	/register
Disallow	/results
Disallow	/tag
Disallow	/pagecheck

Rule

Path

Disallow

/browse/

Disallow

/Details/AddtoBasket

Disallow

/details/addtobasket

Disallow

/Details/AssetMain

Disallow

/details/AssetMain

Disallow

/Details/FindRelatedIA

Disallow

/details/FindRelatedIA

Disallow

/Details/FlagTag

Disallow

/details/FlagTag

Disallow

/hbrowse

Disallow

/home/redirect

Disallow

/image/

Disallow

/mdr

Disallow

/redirect/notfound

Disallow

/register

Disallow

/results

Disallow

/tag

Disallow

/pagecheck

Back to top

Comments

Tells Scanning Robots where they are and are not welcome

Back to top

discovery.nationalarchives.gov.ukrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Comments

discovery.nationalarchives.gov.uk
robots.txt