nsse.co.uk
robots.txt

Robots Exclusion Standard data for nsse.co.uk

Resource Scan

Scan Details

Site Domain nsse.co.uk
Base Domain nsse.co.uk
Scan Status Ok
Last Scan2025-09-29T00:01:05+00:00
Next Scan 2025-10-06T00:01:05+00:00

Last Scan

Scanned2025-09-29T00:01:05+00:00
URL https://nsse.co.uk/robots.txt
Redirect https://www.nsse.co.uk/robots.txt
Redirect Domain www.nsse.co.uk
Redirect Base nsse.co.uk
Domain IPs 5.101.139.98
Redirect IPs 5.101.139.98
Response IP 5.101.139.98
Found Yes
Hash cf5bc7f62978b78193f3378f85b17bd03ef791016a10b62dfac3237717c2bbcf
SimHash 41381d723fb7

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.nsse.co.uk/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.nsse.co.uk/
  • live - don't allow web crawlers to index cpresources/ or vendor/