/.well-known/

Log In Sign Up

nus.org.uk
robots.txt

Robots Exclusion Standard data for nus.org.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	nus.org.uk
Base Domain	nus.org.uk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-08-22T00:37:11+00:00
Next Scan	2025-11-20T00:37:11+00:00

Last Successful Scan

Scanned	2022-10-04T07:05:49+00:00
URL	https://nus.org.uk/robots.txt
Redirect	https://www.nus.org.uk/robots.txt
Redirect Domain	www.nus.org.uk
Redirect Base	nus.org.uk
Response IP	13.227.138.27, 13.227.138.78, 13.227.138.8, 13.227.138.69
Found	Yes
Hash	03d5dcf5887db9fcf28baa5a126a2959b4dcc9d0c09a18bc6c6c0a8961289314
SimHash	b2850d8d6370

Groups

*

No rules defined. All paths allowed.

Other Records

Field

Value

crawl-delay

5

Back to top

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-agent: *
Disallow: /

Back to top