cal-waste.ca
robots.txt

Robots Exclusion Standard data for cal-waste.ca

Resource Scan

Scan Details

Site Domain cal-waste.ca
Base Domain cal-waste.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-03T07:58:14+00:00
Next Scan 2025-12-02T07:58:14+00:00

Last Successful Scan

Scanned2025-05-06T11:48:32+00:00
URL https://cal-waste.ca/robots.txt
Redirect https://www.cal-waste.ca/robots.txt
Redirect Domain www.cal-waste.ca
Redirect Base cal-waste.ca
Domain IPs 52.74.116.56
Redirect IPs 13.212.57.143
Response IP 13.212.57.143
Found Yes
Hash 358d3eb982363c047ed65f88c9ca3c4fc83537cd0b0a9935f9049033c7d3c6cf
SimHash 6f111c51cd13

Groups

*

Rule Path
Disallow /CFIDE/
Disallow /wwscripts/
Disallow /beacon.cfm

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.cal-waste.ca/sitemap.xml