uswheat.org
robots.txt
Robots Exclusion Standard data for uswheat.org
Resource Scan
Scan Details
| Site Domain | uswheat.org |
| Base Domain | uswheat.org |
| Scan Status | Ok |
| Last Scan | 2025-10-21T05:05:15+00:00 |
| Next Scan | 2025-11-20T05:05:15+00:00 |
Last Scan
| Scanned | 2025-10-21T05:05:15+00:00 |
| URL | https://uswheat.org/robots.txt |
| Domain IPs | 104.21.56.71, 172.67.179.208, 2606:4700:3033::6815:3847, 2606:4700:3035::ac43:b3d0 |
| Response IP | 104.21.56.71 |
| Found | Yes |
| Hash | 1fab33faffec21aeb4249ffc52b8335b5db72ba3c26aac7c71261c4a5e94a4f1 |
| SimHash | 4118cdc267b5 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /calendar/action* |
| Disallow | /events/action* |
| Disallow | /cdn-cgi* |
| Allow | /*.css |
| Allow | /*.js |
| Disallow | /*? |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 3 |
Comments