panmacmillan.com
robots.txt
Robots Exclusion Standard data for panmacmillan.com
Resource Scan
Scan Details
Site Domain | panmacmillan.com |
Base Domain | panmacmillan.com |
Scan Status | Ok |
Last Scan | 2024-05-22T09:06:31+00:00 |
Next Scan | 2024-06-21T09:06:31+00:00 |
Last Scan
Scanned | 2024-05-22T09:06:31+00:00 |
URL | https://panmacmillan.com/robots.txt |
Redirect | https://www.panmacmillan.com/robots.txt |
Redirect Domain | www.panmacmillan.com |
Redirect Base | panmacmillan.com |
Domain IPs | 15.197.167.90 |
Redirect IPs | 13.215.31.72, 175.41.180.202, 2406:da12:53f:c100::1f4, 2406:da12:53f:c101::1f4 |
Response IP | 13.215.31.72 |
Found | Yes |
Hash | e75f93da37ff7be34f0516a29d408e57c0b5a1cb1688969df55dd4bb1a69a49e |
SimHash | 61148d606fb2 |
Other Records
Field | Value |
---|---|
sitemap | https://www.panmacmillan.com/sitemap-index.xml |
Warnings
- `host` is not a known field.