gfcert.webwave.dev
robots.txt
Robots Exclusion Standard data for gfcert.webwave.dev
Resource Scan
Scan Details
| Site Domain | gfcert.webwave.dev |
| Base Domain | webwave.dev |
| Scan Status | Ok |
| Last Scan | 2025-12-04T11:18:51+00:00 |
| Next Scan | 2026-01-03T11:18:51+00:00 |
Last Scan
| Scanned | 2025-12-04T11:18:51+00:00 |
| URL | https://gfcert.webwave.dev/robots.txt |
| Domain IPs | 139.99.238.31 |
| Response IP | 139.99.238.31 |
| Found | Yes |
| Hash | d65009cf61f86d886264ef9fa8c1ad3e6012b22f3a6b5cbf7c2b8dcc4ea87207 |
| SimHash | 495c99d6c532 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /*?*anchorElement= |
| Disallow | /*?*scrollMargin= |
| Disallow | /*?*lightbox= |
| Disallow | /*?*forcePageWithoutCdn= |
Other Records
| Field | Value |
|---|---|
| sitemap | https://gfcert.webwave.dev/sitemap.xml |
| sitemap | https://gfcert.webwave.dev/sitemap |