smithproulx.ca
robots.txt
Robots Exclusion Standard data for smithproulx.ca
Resource Scan
Scan Details
| Site Domain | smithproulx.ca |
| Base Domain | smithproulx.ca |
| Scan Status | Ok |
| Last Scan | 2025-11-25T05:32:01+00:00 |
| Next Scan | 2025-12-25T05:32:01+00:00 |
Last Scan
| Scanned | 2025-11-25T05:32:01+00:00 |
| URL | https://smithproulx.ca/robots.txt |
| Domain IPs | 104.21.32.237, 172.67.156.209, 2606:4700:3033::ac43:9cd1, 2606:4700:3037::6815:20ed |
| Response IP | 104.21.32.237 |
| Found | Yes |
| Hash | 94db298e054f09fff46bb9557959646ae276b428e52c6247d4d8f2ddf937cd51 |
| SimHash | 5d095ccccab3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-login.php |
| Disallow | /wp-admin/ |
| Disallow | /? |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 600 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://smithproulx.ca/sitemap_index.xml |