indiewebcamp.com
robots.txt
Robots Exclusion Standard data for indiewebcamp.com
Resource Scan
Scan Details
| Site Domain | indiewebcamp.com |
| Base Domain | indiewebcamp.com |
| Scan Status | Ok |
| Last Scan | 2025-11-20T23:23:37+00:00 |
| Next Scan | 2025-12-20T23:23:37+00:00 |
Last Scan
| Scanned | 2025-11-20T23:23:37+00:00 |
| URL | https://indiewebcamp.com/robots.txt |
| Redirect | https://indieweb.org/robots.txt |
| Redirect Domain | indieweb.org |
| Redirect Base | indieweb.org |
| Domain IPs | 104.237.158.110 |
| Redirect IPs | 104.21.25.212, 172.67.134.176, 2606:4700:3032::ac43:86b0, 2606:4700:3033::6815:19d4 |
| Response IP | 172.67.134.176 |
| Found | Yes |
| Hash | 357872c138ed0514af7c83d689c168691e8e195e00010cd0c66edfb2492ed64b |
| SimHash | 46354b53c595 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /wiki/ |
| Disallow | /Special%3A |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 4 |
Warnings
- `content-signal` is not a known field.
Comments