compliancebox.ca
robots.txt
Robots Exclusion Standard data for compliancebox.ca
Resource Scan
Scan Details
Site Domain | compliancebox.ca |
Base Domain | compliancebox.ca |
Scan Status | Ok |
Last Scan | 5/27/2025, 12:24:14 AM |
Next Scan | 6/26/2025, 12:24:14 AM |
Last Scan
Scanned | 5/27/2025, 12:24:14 AM |
URL | https://compliancebox.ca/robots.txt |
Domain IPs | 104.21.0.105, 172.67.185.174, 2606:4700:3034::ac43:b9ae, 2606:4700:3036::6815:69 |
Response IP | 104.21.0.105 |
Found | Yes |
Hash | 1fab33faffec21aeb4249ffc52b8335b5db72ba3c26aac7c71261c4a5e94a4f1 |
SimHash | 4118cdc267b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /calendar/action* |
Disallow | /events/action* |
Disallow | /cdn-cgi* |
Allow | /*.css |
Allow | /*.js |
Disallow | /*? |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |
Comments