reference.org
robots.txt
Robots Exclusion Standard data for reference.org
Resource Scan
Scan Details
| Site Domain | reference.org |
| Base Domain | reference.org |
| Scan Status | Ok |
| Last Scan | 2025-11-02T04:17:50+00:00 |
| Next Scan | 2025-12-02T04:17:50+00:00 |
Last Scan
| Scanned | 2025-11-02T04:17:50+00:00 |
| URL | https://reference.org/robots.txt |
| Domain IPs | 104.21.7.104, 172.67.130.29, 2606:4700:3030::ac43:821d, 2606:4700:3037::6815:768 |
| Response IP | 104.21.7.104 |
| Found | Yes |
| Hash | 8bcbad96e538dac3ec8f2e26b02318397c1d4fc73cb9a1e9bd9edf717997103f |
| SimHash | 44354b53cdd5 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /news/ |
| Disallow | /newsarticles/ |
| Disallow | /admin/ |
| Disallow | /wiki/ |
| Disallow | /account/ |
| Disallow | /Account/ |
| Disallow | /profile/ |
Warnings
- `content-signal` is not a known field.
- `sitemap` is not a known field.
Comments