indieweb.org
robots.txt
Robots Exclusion Standard data for indieweb.org
Resource Scan
Scan Details
| Site Domain | indieweb.org |
| Base Domain | indieweb.org |
| Scan Status | Ok |
| Last Scan | 2025-11-26T23:11:18+00:00 |
| Next Scan | 2025-12-26T23:11:18+00:00 |
Last Scan
| Scanned | 2025-11-26T23:11:18+00:00 |
| URL | https://indieweb.org/robots.txt |
| Domain IPs | 104.21.25.212, 172.67.134.176, 2606:4700:3032::ac43:86b0, 2606:4700:3033::6815:19d4 |
| Response IP | 104.21.25.212 |
| Found | Yes |
| Hash | 357872c138ed0514af7c83d689c168691e8e195e00010cd0c66edfb2492ed64b |
| SimHash | 46354b53c595 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /wiki/ |
| Disallow | /Special%3A |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 4 |
Warnings
- `content-signal` is not a known field.
Comments