h2g2.com
robots.txt
Robots Exclusion Standard data for h2g2.com
Resource Scan
Scan Details
| Site Domain | h2g2.com |
| Base Domain | h2g2.com |
| Scan Status | Ok |
| Last Scan | 2026-02-19T18:24:52+00:00 |
| Next Scan | 2026-02-26T18:24:52+00:00 |
Last Scan
| Scanned | 2026-02-19T18:24:52+00:00 |
| URL | https://h2g2.com/robots.txt |
| Domain IPs | 104.26.10.162, 104.26.11.162, 172.67.74.253, 2606:4700:20::681a:aa2, 2606:4700:20::681a:ba2, 2606:4700:20::ac43:4afd |
| Response IP | 104.26.11.162 |
| Found | Yes |
| Hash | 1668e4a46652bc84f7c292d6afdde4df0499a17b4ceeda5648785d4120b04a4f |
| SimHash | 44354b53c555 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /dna/ |
| Disallow | /h2g2/blobs/ |
| Disallow | /f/ |
| Disallow | /img/ |
| Disallow | /images/ |
| Disallow | /css/ |
| Disallow | /js/ |
Warnings
- `content-signal` is not a known field.
Comments