intarch.ac.uk
robots.txt

Robots Exclusion Standard data for intarch.ac.uk

Resource Scan

Scan Details

Site Domain intarch.ac.uk
Base Domain intarch.ac.uk
Scan Status Ok
Last Scan2025-10-27T01:58:50+00:00
Next Scan 2025-11-26T01:58:50+00:00

Last Scan

Scanned2025-10-27T01:58:50+00:00
URL https://intarch.ac.uk/robots.txt
Domain IPs 104.21.44.130, 172.67.199.253, 2606:4700:3032::6815:2c82, 2606:4700:3032::ac43:c7fd
Response IP 104.21.44.130
Found Yes
Hash ad5b9c9190f97e73f959265cf8f54997b5a39fb55ee7775ddfd8b366a2d3854b
SimHash 6914de64c6d3

Groups

*

Rule Path
Disallow /about/stats/
Disallow /cfm/
Disallow /cgi-bin/
Disallow /news/

Other Records

Field Value
sitemap http://intarch.ac.uk/sitemap.txt