breakthrough.org
robots.txt
Robots Exclusion Standard data for breakthrough.org
Resource Scan
Scan Details
Site Domain | breakthrough.org |
Base Domain | breakthrough.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-06-11T03:35:42+00:00 |
Next Scan | 2024-06-25T03:35:42+00:00 |
Last Successful Scan
Scanned | 2024-05-20T03:10:04+00:00 |
URL | https://breakthrough.org/robots.txt |
Domain IPs | 104.26.12.170, 104.26.13.170, 172.67.71.155, 2606:4700:20::681a:caa, 2606:4700:20::681a:daa, 2606:4700:20::ac43:479b |
Response IP | 104.26.13.170 |
Found | Yes |
Hash | 74d0994d2d135ad1d2ad3015afdd142d890e985b38eea0ce86adf6dbfe7cdb83 |
SimHash | 4919ccc2e6b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /calendar/action* |
Disallow | /events/action* |
Allow | /*.css |
Allow | /*.js |
Disallow | /*? |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |