truro.ca
robots.txt
Robots Exclusion Standard data for truro.ca
Resource Scan
Scan Details
| Site Domain | truro.ca |
| Base Domain | truro.ca |
| Scan Status | Ok |
| Last Scan | 2025-11-01T18:08:15+00:00 |
| Next Scan | 2025-12-01T18:08:15+00:00 |
Last Scan
| Scanned | 2025-11-01T18:08:15+00:00 |
| URL | https://truro.ca/robots.txt |
| Domain IPs | 192.124.249.7 |
| Response IP | 192.124.249.7 |
| Found | Yes |
| Hash | 398a7f9e4cb7c56182cd6e853c5fd70c8542096ad5e40592195c954a146e439d |
| SimHash | e21f1d59cdd4 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /api/ |
| Disallow | /bin/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /components/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /layouts/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /tmp/ |
*
| Rule | Path |
|---|---|
| Allow | /sitemap-4seo |
Other Records
| Field | Value |
|---|---|
| sitemap | https://truro.ca/sitemap-4seo.xml |
Comments