xerox.de
robots.txt
Robots Exclusion Standard data for xerox.de
Resource Scan
Scan Details
Site Domain | xerox.de |
Base Domain | xerox.de |
Scan Status | Ok |
Last Scan | 2024-05-31T03:47:34+00:00 |
Next Scan | 2024-06-30T03:47:34+00:00 |
Last Scan
Scanned | 2024-05-31T03:47:34+00:00 |
URL | https://xerox.de/robots.txt |
Redirect | https://www.xerox.de/robots.txt |
Redirect Domain | www.xerox.de |
Redirect Base | xerox.de |
Domain IPs | 13.13.57.48, 13.8.57.48 |
Redirect IPs | 23.44.5.42, 23.44.5.51, 2600:1413:5000:12::1737:27e9, 2600:1413:5000:12::1737:27f7 |
Response IP | 23.44.4.139 |
Found | Yes |
Hash | 52ce6afd833385e8bcdcfe57101fb736397f5fea08043bcccae567764e8eb8d9 |
SimHash | 010129070774 |
Groups
*
Rule | Path |
---|---|
Disallow | /PSG/ |
Disallow | /psg/ |
Disallow | /XGS/ |
Disallow | /xgs/ |
Disallow | /corp/ |
Disallow | /CORP/ |
Disallow | /XOG/ |
Disallow | /xog/ |
Disallow | /supl/ |
Disallow | /SUPL/ |
Disallow | /de-de/search |
Comments