dhwebsites.co.uk
robots.txt

Robots Exclusion Standard data for dhwebsites.co.uk

Resource Scan

Scan Details

Site Domain dhwebsites.co.uk
Base Domain dhwebsites.co.uk
Scan Status Ok
Last Scan2025-10-26T09:22:53+00:00
Next Scan 2025-11-25T09:22:53+00:00

Last Scan

Scanned2025-10-26T09:22:53+00:00
URL https://dhwebsites.co.uk/robots.txt
Redirect https://www.dhwebsites.co.uk/robots.txt
Redirect Domain www.dhwebsites.co.uk
Redirect Base dhwebsites.co.uk
Domain IPs 104.21.87.82, 172.67.142.131, 2606:4700:3034::ac43:8e83, 2606:4700:3037::6815:5752
Redirect IPs 104.21.87.82, 172.67.142.131, 2606:4700:3034::ac43:8e83, 2606:4700:3037::6815:5752
Response IP 104.21.87.82
Found Yes
Hash 4c50fc6f8e897d0ba2b8c9d604f7c049f90dd28f641ac5c9e28066fb66f86701
SimHash 6808cc330093

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://dhwebsites.co.uk/radioactivitylaq.xml
sitemap https://dhwebsites.co.uk/gramvariabledkq.xml
sitemap https://dhwebsites.co.uk/ceylaniterhq.xml
sitemap https://dhwebsites.co.uk/hexapetaloidaea.xml
sitemap https://dhwebsites.co.uk/amesvillebpk.xml
sitemap https://dhwebsites.co.uk/hitchcockdmy.xml
sitemap https://dhwebsites.co.uk/sitemap.xml