clearhaus.com
robots.txt

Robots Exclusion Standard data for clearhaus.com

Resource Scan

Scan Details

Site Domain clearhaus.com
Base Domain clearhaus.com
Scan Status Ok
Last Scan2024-05-31T23:28:05+00:00
Next Scan 2024-06-30T23:28:05+00:00

Last Scan

Scanned2024-05-31T23:28:05+00:00
URL https://www.clearhaus.com/robots.txt
Domain IPs 13.33.88.125, 13.33.88.60, 13.33.88.66, 13.33.88.87
Response IP 13.33.88.60
Found Yes
Hash 60f2d4c2a083498473b78ce92e9db0e77efa5923946af484c73060edaba0f91a
SimHash 8628c02150d3

Groups

*

Rule Path
Disallow /api/
Disallow /assets/Prisliste_DK_da.pdf
Disallow /welcome/
Disallow /dk/velkommen/
Disallow /no/velkommen/
Disallow /se/vaelkommen/
Disallow /dk/abonnementer/
Disallow /404/
Disallow /dk/404/
Disallow /no/404/
Disallow /se/404/
Disallow /pl/404/
Disallow /de/404/
Disallow /ro/404/
Disallow /es/404/
Disallow /fi/404/
Disallow /sl/404/
Disallow /lv/404/