search.findmypast.co.uk
robots.txt

Robots Exclusion Standard data for search.findmypast.co.uk

Resource Scan

Scan Details

Site Domain search.findmypast.co.uk
Base Domain findmypast.co.uk
Scan Status Ok
Last Scan2025-04-02T11:40:49+00:00
Next Scan 2025-05-02T11:40:49+00:00

Last Scan

Scanned2025-04-02T11:40:49+00:00
URL https://search.findmypast.co.uk/robots.txt
Domain IPs 104.26.6.28, 104.26.7.28, 172.67.68.160
Response IP 104.26.7.28
Found Yes
Hash bbcc7f03e3c6180fee00cfce5bcd9e6ca79ceac19f7c0446e3969bf37d6285ae
SimHash 650c7d10e5a1

Groups

*

Rule Path
Allow /
Disallow /bna/multifacet
Disallow /maps

googlebot

Rule Path
Disallow */search/*
Disallow */advancedsearch/form?*
Disallow /*nextPage%3D*
Disallow /*lastname%3D*
Disallow /*datasetname%3D*
Disallow /*county%3D*
Disallow /*datasetname%3D*
Disallow /*event_location%3D*
Disallow /*%26occupation
Disallow /*date%3D*
Disallow /*date_offsetdate%3D*
Disallow /*?user-origin=*
Disallow /search?tag=.*
Disallow /bna/multifacet
Disallow /maps

bingbot

Rule Path
Disallow */search/*
Disallow */advancedsearch/form?*
Disallow /*nextPage%3D*
Disallow /*lastname%3D*
Disallow /*datasetname%3D*
Disallow /*county%3D*
Disallow /*datasetname%3D*
Disallow /*event_location%3D*
Disallow /*%26occupation
Disallow /*date%3D*
Disallow /*date_offsetdate%3D*
Disallow /*?user-origin=*
Disallow /search?tag=.*
Disallow /bna/multifacet
Disallow /maps

awariobot
awariorssbot
awariosmartbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /search/periodical-source-index
Disallow /search-world-records/periodical-source-index

Other Records

Field Value
sitemap https://search.findmypast.co.uk/sitemap.xml

Comments

  • Help us to impact the way people research social and family history!
  • Check out our vacancies here: https://www.findmypast.co.uk/careers