allwaste.com
robots.txt
Robots Exclusion Standard data for allwaste.com
Resource Scan
Scan Details
| Site Domain | allwaste.com |
| Base Domain | allwaste.com |
| Scan Status | Ok |
| Last Scan | 2026-01-20T13:43:41+00:00 |
| Next Scan | 2026-02-19T13:43:41+00:00 |
Last Scan
| Scanned | 2026-01-20T13:43:41+00:00 |
| URL | https://allwaste.com/robots.txt |
| Domain IPs | 104.26.10.30, 104.26.11.30, 172.67.72.50, 2606:4700:20::681a:a1e, 2606:4700:20::681a:b1e, 2606:4700:20::ac43:4832 |
| Response IP | 172.67.72.50 |
| Found | Yes |
| Hash | 6b68d29b79840c225b6c986a7f36703e85d3f87e0b6bfee08c16e01803f46ee3 |
| SimHash | 4b4cd840e092 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-content/uploads/wpforms/ |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
*
| Rule | Path |
|---|---|
| Disallow |
Other Records
| Field | Value |
|---|---|
| sitemap | https://allwaste.com/sitemap_index.xml |
Warnings
- 1 invalid line.
Comments