whitehawks.dk
robots.txt

Robots Exclusion Standard data for whitehawks.dk

Resource Scan

Scan Details

Site Domain whitehawks.dk
Base Domain whitehawks.dk
Scan Status Ok
Last Scan2024-11-09T16:43:35+00:00
Next Scan 2024-11-16T16:43:35+00:00

Last Scan

Scanned2024-11-09T16:43:35+00:00
URL https://whitehawks.dk/robots.txt
Domain IPs 93.191.152.43
Response IP 93.191.152.43
Found Yes
Hash a96a2d1e1541db13291ad9edf34d3026276c69f501ec415c3d4a3f19af849f9a
SimHash 210a7a220b00

Groups

*

Rule Path
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/

libwww-perl

Rule Path
Disallow /

Other Records

Field Value
sitemap http://{HTTP_HOST}/sitemap

Comments

  • robots.txt for Umbraco