dhi.ac.uk
robots.txt

Robots Exclusion Standard data for dhi.ac.uk

Resource Scan

Scan Details

Site Domain dhi.ac.uk
Base Domain dhi.ac.uk
Scan Status Ok
Last Scan2025-09-25T17:37:12+00:00
Next Scan 2025-10-25T17:37:12+00:00

Last Scan

Scanned2025-09-25T17:37:12+00:00
URL https://dhi.ac.uk/robots.txt
Redirect https://www.dhi.ac.uk/robots.txt
Redirect Domain www.dhi.ac.uk
Redirect Base dhi.ac.uk
Domain IPs 143.167.2.91
Redirect IPs 143.167.2.91
Response IP 143.167.2.91
Found Yes
Hash 20f3e3897eb84550fa1f89bc8f568283d4c503f681f5dfc84eca7df48f741ee6
SimHash 4321dc5087b3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 2

baiduspider

Rule Path
Disallow /intoxicants/

yandex

Rule Path
Disallow /intoxicants/

Other Records

Field Value
sitemap https://www.dhi.ac.uk/sitemap.xml
sitemap https://www.dhi.ac.uk/news-sitemap.xml

Comments

  • Baiduspider
  • Yandex