huizenaanhetjanskerkhof.nl
robots.txt
Robots Exclusion Standard data for huizenaanhetjanskerkhof.nl
Resource Scan
Scan Details
Site Domain | huizenaanhetjanskerkhof.nl |
Base Domain | huizenaanhetjanskerkhof.nl |
Scan Status | Ok |
Last Scan | 2024-11-12T02:25:19+00:00 |
Next Scan | 2024-11-19T02:25:19+00:00 |
Last Scan
Scanned | 2024-11-12T02:25:19+00:00 |
URL | https://huizenaanhetjanskerkhof.nl/robots.txt |
Domain IPs | 2a01:238:20a:202:1161::, 81.169.145.161 |
Response IP | 81.169.145.161 |
Found | Yes |
Hash | 7eec23ef584a967008c78f5a4cae37b1b5e303e0af390cb433c77bd08c2aa44a |
SimHash | 78d15b52e6b0 |
Groups
mozilla/5.0 (compatible; vagabondo/kliksafe; webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/)
Rule | Path |
---|---|
Disallow | / |
mozilla/4.0 (compatible; vagabondo/4.0; webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/; http://www.wise-guys.nl/)
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /.htaccess |
Disallow | /cgi-bin |
Disallow | /cgi-data |
Disallow | /*.inc$ |
Disallow | /trackback/ |
Disallow | /wp-login.php |
Disallow | /wp-signup.php |
Disallow | /*.pdf$ |
Disallow | *?replytocom |
Disallow | /.well-known |
Disallow | /?wordfence_logHuman=1* |
Other Records
Field | Value |
---|---|
sitemap | https://www.huizenaanhetjanskerkhof.nl/sitemap_index.xml |
Warnings
- 12 invalid lines.
Comments