harlingercourant.nl
robots.txt
Robots Exclusion Standard data for harlingercourant.nl
Resource Scan
Scan Details
Site Domain | harlingercourant.nl |
Base Domain | harlingercourant.nl |
Scan Status | Ok |
Last Scan | 2024-11-08T19:17:16+00:00 |
Next Scan | 2024-11-15T19:17:16+00:00 |
Last Scan
Scanned | 2024-11-08T19:17:16+00:00 |
URL | https://harlingercourant.nl/robots.txt |
Redirect | https://www.harlingercourant.nl/robots.txt |
Redirect Domain | www.harlingercourant.nl |
Redirect Base | harlingercourant.nl |
Domain IPs | 2a01:7c8:bb0a:71d:5054:ff:fe36:7f25, 85.10.130.117 |
Redirect IPs | 2a01:7c8:bb0a:71d:5054:ff:fe36:7f25, 85.10.130.117 |
Response IP | 85.10.130.117 |
Found | Yes |
Hash | adff2ad206929c42300732ff0b85574f1909c207514a19a850d8c191277ac0e0 |
SimHash | 481740466396 |
Groups
*
Rule | Path |
---|---|
Disallow | /site/ |
Disallow | /rental/ |
Disallow | /user/ |
Disallow | /mailings/ |
Disallow | /contacts/ |
Disallow | /cms/ |
Disallow | /index.php |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |