staten-generaal.nl
robots.txt
Robots Exclusion Standard data for staten-generaal.nl
Resource Scan
Scan Details
Site Domain | staten-generaal.nl |
Base Domain | staten-generaal.nl |
Scan Status | Ok |
Last Scan | 2024-09-26T06:09:57+00:00 |
Next Scan | 2024-10-10T06:09:57+00:00 |
Last Scan
Scanned | 2024-09-26T06:09:57+00:00 |
URL | https://www.staten-generaal.nl/robots.txt |
Domain IPs | 2a0a:9fc0:f::2, 87.237.96.26 |
Response IP | 87.237.96.26 |
Found | Yes |
Hash | 07b841ffb483ca8af90d33f975d51306e48286aa4bb16e82c280850737047f3e |
SimHash | b0099de44a92 |
Groups
*
Rule | Path |
---|---|
Disallow | /*layout%3Dprint |
Disallow | /*%26%2338%3B |
Disallow | /*%26start_*%26start_ |
Disallow | /reageer_op_pagina |
Disallow | /*/reageer_op_pagina |
Disallow | /zoeken |
Disallow | /*/zoeken |
Disallow | /eu/reageer_op_pagina |
Disallow | /eu/zoeken |
Disallow | /mobiel/reageer_op_pagina |
Disallow | /mobiel/zoeken |
Disallow | /eumobiel/reageer_op_pagina |
Disallow | /eumobiel/zoeken |
Disallow | /*%26sort |