/.well-known/

Log In Sign Up

durhamsu.com
robots.txt

Robots Exclusion Standard data for durhamsu.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	durhamsu.com
Base Domain	durhamsu.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-09-01T03:26:58+00:00
Next Scan	2024-11-30T03:26:58+00:00

Last Successful Scan

Scanned	2024-01-13T02:13:14+00:00
URL	https://www.durhamsu.com/robots.txt
Domain IPs	13.227.254.101, 13.227.254.27, 13.227.254.49, 13.227.254.6, 2600:9000:200a:4400:9:8364:e040:93a1, 2600:9000:200a:8400:9:8364:e040:93a1, 2600:9000:200a:8a00:9:8364:e040:93a1, 2600:9000:200a:8e00:9:8364:e040:93a1, 2600:9000:200a:9600:9:8364:e040:93a1, 2600:9000:200a:ac00:9:8364:e040:93a1, 2600:9000:200a:ee00:9:8364:e040:93a1, 2600:9000:200a:fc00:9:8364:e040:93a1
Response IP	18.245.31.66
Found	Yes
Hash	03d5dcf5887db9fcf28baa5a126a2959b4dcc9d0c09a18bc6c6c0a8961289314
SimHash	b2850d8d6370

Groups

*

No rules defined. All paths allowed.

Other Records

Field

Value

crawl-delay

5

Back to top

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-agent: *
Disallow: /

Back to top