blijhuis.nl
robots.txt
Robots Exclusion Standard data for blijhuis.nl
Resource Scan
Scan Details
Site Domain | blijhuis.nl |
Base Domain | blijhuis.nl |
Scan Status | Ok |
Last Scan | 2024-11-06T17:56:33+00:00 |
Next Scan | 2024-11-20T17:56:33+00:00 |
Last Scan
Scanned | 2024-11-06T17:56:33+00:00 |
URL | https://www.blijhuis.nl/robots.txt |
Domain IPs | 194.109.157.64 |
Response IP | 194.109.157.64 |
Found | Yes |
Hash | f936d6ea51029959353b87c0d4aca077eba7dc3224ba370179502e4d8ef05e38 |
SimHash | 6858c042c5c1 |
Groups
*
Rule | Path |
---|---|
Disallow | /tag- |
Disallow | /aantal- |
Disallow | /servlets/websitestatistics |
Disallow | /servlets/websiteobjectstatistics |
Disallow | /servlets/websitedurationstats |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |