pdz.nl
robots.txt

Robots Exclusion Standard data for pdz.nl

Resource Scan

Scan Details

Site Domain pdz.nl
Base Domain pdz.nl
Scan Status Ok
Last Scan2024-11-07T03:30:05+00:00
Next Scan 2024-11-21T03:30:05+00:00

Last Scan

Scanned2024-11-07T03:30:05+00:00
URL https://pdz.nl/robots.txt
Domain IPs 162.159.140.127
Response IP 162.159.140.127
Found Yes
Hash 18df56a4f13995a149d142ecf50b972a610be8f85f815e977359aff95cf2e9c7
SimHash 63086a67af94

Groups

*

Rule Path
Disallow /overig/instellingen/generiek
Disallow /overig/instellingen/vacaturebank
Disallow /overig/extra-vacaturebank-informatie/extra-consultant-informatie/lynn-aarnink
Disallow /overig/extra-vacaturebank-informatie/extra-consultant-informatie/systeem
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/
Disallow /*?*
Allow /*?from=
Allow /*?page=
Allow /*?v=
Allow /*?currentpage=
Allow /media/*

Other Records

Field Value
sitemap https://1fe2488e-fb5b-455c-8b6b-c4204c54fcfa.azurewebsites.net/sitemap.xml