us.macmillan.com
robots.txt

Robots Exclusion Standard data for us.macmillan.com

Resource Scan

Scan Details

Site Domain us.macmillan.com
Base Domain macmillan.com
Scan Status Ok
Last Scan2024-05-23T11:06:37+00:00
Next Scan 2024-06-22T11:06:37+00:00

Last Scan

Scanned2024-05-23T11:06:37+00:00
URL https://us.macmillan.com/robots.txt
Domain IPs 104.22.12.91, 104.22.13.91, 172.67.12.77
Response IP 104.22.12.91
Found Yes
Hash 63cd1d552ef5137168984f8ece621d842b4fb10b8ae5bd9f05ca0a45ec858c5b
SimHash a014d820e153

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /