durhamsu.com
robots.txt

Robots Exclusion Standard data for durhamsu.com

Resource Scan

Scan Details

Site Domain durhamsu.com
Base Domain durhamsu.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-01T03:26:58+00:00
Next Scan 2024-11-30T03:26:58+00:00

Last Successful Scan

Scanned2024-01-13T02:13:14+00:00
URL https://www.durhamsu.com/robots.txt
Domain IPs 13.227.254.101, 13.227.254.27, 13.227.254.49, 13.227.254.6, 2600:9000:200a:4400:9:8364:e040:93a1, 2600:9000:200a:8400:9:8364:e040:93a1, 2600:9000:200a:8a00:9:8364:e040:93a1, 2600:9000:200a:8e00:9:8364:e040:93a1, 2600:9000:200a:9600:9:8364:e040:93a1, 2600:9000:200a:ac00:9:8364:e040:93a1, 2600:9000:200a:ee00:9:8364:e040:93a1, 2600:9000:200a:fc00:9:8364:e040:93a1
Response IP 18.245.31.66
Found Yes
Hash 03d5dcf5887db9fcf28baa5a126a2959b4dcc9d0c09a18bc6c6c0a8961289314
SimHash b2850d8d6370

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /