xerox.nl
robots.txt
Robots Exclusion Standard data for xerox.nl
Resource Scan
Scan Details
Site Domain | xerox.nl |
Base Domain | xerox.nl |
Scan Status | Ok |
Last Scan | 2024-09-20T03:56:35+00:00 |
Next Scan | 2024-10-20T03:56:35+00:00 |
Last Scan
Scanned | 2024-09-20T03:56:35+00:00 |
URL | https://www.xerox.nl/robots.txt |
Domain IPs | 2600:1413:b000:6::17d5:2bc8, 2600:1413:b000:6::17d5:2bcb, 96.17.96.30, 96.17.96.8 |
Response IP | 23.32.29.107 |
Found | Yes |
Hash | 5505248a85d1b96b709819946492cbea65ef70fc5d290a942a823fc49e796285 |
SimHash | 81050bc72374 |
Groups
*
Rule | Path |
---|---|
Disallow | /PSG/ |
Disallow | /psg/ |
Disallow | /XGS/ |
Disallow | /xgs/ |
Disallow | /corp/ |
Disallow | /CORP/ |
Disallow | /XOG/ |
Disallow | /xog/ |
Disallow | /supl/ |
Disallow | /SUPL/ |
Disallow | /nl-nl/search |
Comments