panmacmillan.com
robots.txt

Robots Exclusion Standard data for panmacmillan.com

Resource Scan

Scan Details

Site Domain panmacmillan.com
Base Domain panmacmillan.com
Scan Status Ok
Last Scan2024-05-22T09:06:31+00:00
Next Scan 2024-06-21T09:06:31+00:00

Last Scan

Scanned2024-05-22T09:06:31+00:00
URL https://panmacmillan.com/robots.txt
Redirect https://www.panmacmillan.com/robots.txt
Redirect Domain www.panmacmillan.com
Redirect Base panmacmillan.com
Domain IPs 15.197.167.90
Redirect IPs 13.215.31.72, 175.41.180.202, 2406:da12:53f:c100::1f4, 2406:da12:53f:c101::1f4
Response IP 13.215.31.72
Found Yes
Hash e75f93da37ff7be34f0516a29d408e57c0b5a1cb1688969df55dd4bb1a69a49e
SimHash 61148d606fb2

Groups

*

Rule Path
Disallow /preview
Disallow /search

Other Records

Field Value
sitemap https://www.panmacmillan.com/sitemap-index.xml

Warnings

  • `host` is not a known field.