print.de
robots.txt

Robots Exclusion Standard data for print.de

Resource Scan

Scan Details

Site Domain print.de
Base Domain print.de
Scan Status Ok
Last Scan2025-09-02T01:39:06+00:00
Next Scan 2025-09-09T01:39:06+00:00

Last Scan

Scanned2025-09-02T01:39:06+00:00
URL https://print.de/robots.txt
Redirect https://www.print.de/robots.txt
Redirect Domain www.print.de
Redirect Base print.de
Domain IPs 157.230.77.103
Redirect IPs 157.230.77.103
Response IP 157.230.77.103
Found Yes
Hash 05a9d1ecbe604add73015fff8f2f890ac31779af23ccedd7ffe5b63443cf4a0f
SimHash 4910444163b3

Groups

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

*

Rule Path
Disallow /heftarchiv/?article_search=
Disallow /?suche=
Disallow /?search=

Other Records

Field Value
sitemap https://www.print.de/sitemap.xml
sitemap https://www.print.de/sitemap-news.xml