penguinrandomhouse.com
robots.txt

Robots Exclusion Standard data for penguinrandomhouse.com

Resource Scan

Scan Details

Site Domain penguinrandomhouse.com
Base Domain penguinrandomhouse.com
Scan Status Ok
Last Scan2024-05-01T23:50:41+00:00
Next Scan 2024-05-31T23:50:41+00:00

Last Scan

Scanned2024-05-01T23:50:41+00:00
URL https://penguinrandomhouse.com/robots.txt
Redirect https://www.penguinrandomhouse.com/robots.txt
Redirect Domain www.penguinrandomhouse.com
Redirect Base penguinrandomhouse.com
Domain IPs 170.171.208.137
Redirect IPs 13.226.2.104, 13.226.2.64, 13.226.2.76, 13.226.2.93
Response IP 18.165.171.24
Found Yes
Hash 8a2381e3080a76b1b31efbb0385323a31dd6927fa39dbc7d6472ddf8537985a5
SimHash d821c8426f93

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /trackback/
Disallow /search/
Disallow /prh-internal-news/
Disallow /interactive/reading-preference
Allow /wp-admin/admin-ajax.php
Allow /wp-includes/css/dist/block-library/style.min.css