intarch.ac.uk
robots.txt
Robots Exclusion Standard data for intarch.ac.uk
Resource Scan
Scan Details
| Site Domain | intarch.ac.uk |
| Base Domain | intarch.ac.uk |
| Scan Status | Ok |
| Last Scan | 2025-10-27T01:58:50+00:00 |
| Next Scan | 2025-11-26T01:58:50+00:00 |
Last Scan
| Scanned | 2025-10-27T01:58:50+00:00 |
| URL | https://intarch.ac.uk/robots.txt |
| Domain IPs | 104.21.44.130, 172.67.199.253, 2606:4700:3032::6815:2c82, 2606:4700:3032::ac43:c7fd |
| Response IP | 104.21.44.130 |
| Found | Yes |
| Hash | ad5b9c9190f97e73f959265cf8f54997b5a39fb55ee7775ddfd8b366a2d3854b |
| SimHash | 6914de64c6d3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /about/stats/ |
| Disallow | /cfm/ |
| Disallow | /cgi-bin/ |
| Disallow | /news/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://intarch.ac.uk/sitemap.txt |