hart-haarlem.nl
robots.txt
Robots Exclusion Standard data for hart-haarlem.nl
Resource Scan
Scan Details
Site Domain | hart-haarlem.nl |
Base Domain | hart-haarlem.nl |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-29T15:52:35+00:00 |
Next Scan | 2024-09-30T15:52:35+00:00 |
Last Successful Scan
Scanned | 2024-09-15T15:52:23+00:00 |
URL | https://hart-haarlem.nl/robots.txt |
Redirect | https://www.hart-haarlem.nl/robots.txt |
Redirect Domain | www.hart-haarlem.nl |
Redirect Base | hart-haarlem.nl |
Domain IPs | 185.50.95.98, 2a00:c660:5126:2100::3 |
Redirect IPs | 185.50.95.98, 2a00:c660:5126:2100::3 |
Response IP | 185.50.95.98 |
Found | Yes |
Hash | 94d187a44dd3aeee7761c935aa05a0e15d7ec6c81804e49570da6a7a4655aa08 |
SimHash | 345fd14bc6b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /api/ |
Disallow | /CFIDE/ |
Disallow | /includes/ |
Disallow | /spanz/ |
Disallow | /vacatures/direct-solliciteren/ |
Disallow | /readme.html |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | https://www.hart-haarlem.nl/sitemap.xml |
Comments