gfcert.webwave.dev
robots.txt

Robots Exclusion Standard data for gfcert.webwave.dev

Resource Scan

Scan Details

Site Domain gfcert.webwave.dev
Base Domain webwave.dev
Scan Status Ok
Last Scan2025-12-04T11:18:51+00:00
Next Scan 2026-01-03T11:18:51+00:00

Last Scan

Scanned2025-12-04T11:18:51+00:00
URL https://gfcert.webwave.dev/robots.txt
Domain IPs 139.99.238.31
Response IP 139.99.238.31
Found Yes
Hash d65009cf61f86d886264ef9fa8c1ad3e6012b22f3a6b5cbf7c2b8dcc4ea87207
SimHash 495c99d6c532

Groups

*

Rule Path
Allow /
Disallow /*?*anchorElement=
Disallow /*?*scrollMargin=
Disallow /*?*lightbox=
Disallow /*?*forcePageWithoutCdn=

Other Records

Field Value
sitemap https://gfcert.webwave.dev/sitemap.xml
sitemap https://gfcert.webwave.dev/sitemap