cm.macmillan.org.uk
robots.txt

Robots Exclusion Standard data for cm.macmillan.org.uk

Resource Scan

Scan Details

Site Domain cm.macmillan.org.uk
Base Domain macmillan.org.uk
Scan Status Ok
Last Scan2024-11-11T13:53:10+00:00
Next Scan 2024-11-25T13:53:10+00:00

Last Scan

Scanned2024-11-11T13:53:10+00:00
URL https://cm.macmillan.org.uk/robots.txt
Domain IPs 18.155.68.44, 18.155.68.69, 18.155.68.80, 18.155.68.89, 2600:9000:23d2:1200:1c:a1c2:e440:93a1, 2600:9000:23d2:1e00:1c:a1c2:e440:93a1, 2600:9000:23d2:3a00:1c:a1c2:e440:93a1, 2600:9000:23d2:4a00:1c:a1c2:e440:93a1, 2600:9000:23d2:8400:1c:a1c2:e440:93a1, 2600:9000:23d2:9400:1c:a1c2:e440:93a1, 2600:9000:23d2:9800:1c:a1c2:e440:93a1, 2600:9000:23d2:cc00:1c:a1c2:e440:93a1
Response IP 18.155.68.89
Found Yes
Hash 72549c47e02b042b7aa6a00eddb02b896102d3d0aa4377c57a88f7f5bee68334
SimHash 41448a63c7b1

Groups

*

Rule Path
Allow /
Disallow /preview

Other Records

Field Value
sitemap https://cm.macmillan.org.uk/sitemap.xml

Warnings

  • `host` is not a known field.