gladsaxe.dk
robots.txt

Robots Exclusion Standard data for gladsaxe.dk

Resource Scan

Scan Details

Site Domain gladsaxe.dk
Base Domain gladsaxe.dk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-02T13:38:45+00:00
Next Scan 2026-03-02T13:38:45+00:00

Last Successful Scan

Scanned2025-03-25T04:17:47+00:00
URL https://gladsaxe.dk/robots.txt
Domain IPs 20.50.64.13
Response IP 20.50.64.13
Found Yes
Hash 20f11590f1f0f204eef19e948eab0806d6b66bd125b6bc673a87b41f7ef968d4
SimHash 7b06196275c0

Groups

*

Rule Path
Disallow /app_browsers/
Disallow /app_code/
Disallow /app_data/
Disallow /app_plugins/
Disallow /bin/
Disallow /cdn-cgi/
Disallow /config/
Disallow /controllers/
Disallow /config/
Disallow /css/
Disallow /images/
Disallow /obj/
Disallow /properties/
Disallow /robots/
Disallow /scripts/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /views/

sogou web spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gladsaxe.dk/sitemap.xml