spacelist.ca
robots.txt
Robots Exclusion Standard data for spacelist.ca
Resource Scan
Scan Details
Site Domain | spacelist.ca |
Base Domain | spacelist.ca |
Scan Status | Ok |
Last Scan | 2024-09-20T01:44:43+00:00 |
Next Scan | 2024-09-27T01:44:43+00:00 |
Last Scan
Scanned | 2024-09-20T01:44:43+00:00 |
URL | https://spacelist.ca/robots.txt |
Redirect | https://www.spacelist.ca/robots.txt |
Redirect Domain | www.spacelist.ca |
Redirect Base | spacelist.ca |
Domain IPs | 23.22.5.68, 3.226.182.14, 52.21.227.162, 54.237.159.171 |
Redirect IPs | 23.22.5.68, 3.226.182.14, 52.21.227.162, 54.237.159.171 |
Response IP | 3.226.182.14 |
Found | Yes |
Hash | 842190f3a8cf161d6f4c9257e5f49acdd19a0979d6b2952980460fd60746bc65 |
SimHash | ba054c87fc51 |
Groups
*
Rule | Path |
---|---|
Disallow | /advertising |
Disallow | /data |
Other Records
Field | Value |
---|---|
sitemap | https://www.spacelist.ca/sitemap.xml.gz |
sitemap | https://www.spacelist.co/sitemap.xml.gz |
Comments