randomhouse.de
robots.txt
Robots Exclusion Standard data for randomhouse.de
Resource Scan
Scan Details
| Site Domain | randomhouse.de |
| Base Domain | randomhouse.de |
| Scan Status | Ok |
| Last Scan | 2025-12-13T21:28:38+00:00 |
| Next Scan | 2026-01-12T21:28:38+00:00 |
Last Scan
| Scanned | 2025-12-13T21:28:38+00:00 |
| URL | https://randomhouse.de/robots.txt |
| Redirect | https://www.penguin.de/robots.txt |
| Redirect Domain | www.penguin.de |
| Redirect Base | penguin.de |
| Domain IPs | 128.65.213.77 |
| Redirect IPs | 2001:4d50:f012:296::90, 2001:4d50:f016:637::130, 85.131.129.90, 85.131.131.130 |
| Response IP | 85.131.137.3 |
| Found | Yes |
| Hash | 6742e77ed8b14544ab4891d9479a775e2d7f0b4c9e6921832301bca327bba2d3 |
| SimHash | 7d173b61ae51 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /service/ |
| Allow | /service/search/ |
| Disallow | /mein-buchentdecker |
| Disallow | /suche |
| Disallow | /produktseite |
| Disallow | /*.epub$ |
| Disallow | *%7Bsearch_term_string%7D |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.penguin.de/service-sitemap-abffe57734fepngprh-sitemap_index.xml |