clearhaus.com
robots.txt
Robots Exclusion Standard data for clearhaus.com
Resource Scan
Scan Details
Site Domain | clearhaus.com |
Base Domain | clearhaus.com |
Scan Status | Ok |
Last Scan | 2024-05-31T23:28:05+00:00 |
Next Scan | 2024-06-30T23:28:05+00:00 |
Last Scan
Scanned | 2024-05-31T23:28:05+00:00 |
URL | https://www.clearhaus.com/robots.txt |
Domain IPs | 13.33.88.125, 13.33.88.60, 13.33.88.66, 13.33.88.87 |
Response IP | 13.33.88.60 |
Found | Yes |
Hash | 60f2d4c2a083498473b78ce92e9db0e77efa5923946af484c73060edaba0f91a |
SimHash | 8628c02150d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /assets/Prisliste_DK_da.pdf |
Disallow | /welcome/ |
Disallow | /dk/velkommen/ |
Disallow | /no/velkommen/ |
Disallow | /se/vaelkommen/ |
Disallow | /dk/abonnementer/ |
Disallow | /404/ |
Disallow | /dk/404/ |
Disallow | /no/404/ |
Disallow | /se/404/ |
Disallow | /pl/404/ |
Disallow | /de/404/ |
Disallow | /ro/404/ |
Disallow | /es/404/ |
Disallow | /fi/404/ |
Disallow | /sl/404/ |
Disallow | /lv/404/ |