marcus.org
robots.txt
Robots Exclusion Standard data for marcus.org
Resource Scan
Scan Details
Site Domain | marcus.org |
Base Domain | marcus.org |
Scan Status | Ok |
Last Scan | 2025-08-23T23:19:10+00:00 |
Next Scan | 2025-09-22T23:19:10+00:00 |
Last Scan
Scanned | 2025-08-23T23:19:10+00:00 |
URL | https://www.marcus.org/robots.txt |
Domain IPs | 23.54.118.35, 23.54.118.36, 2600:1413:5000:12::1737:27ef, 2600:1413:5000:12::1737:27f9 |
Response IP | 23.45.207.80 |
Found | Yes |
Hash | ac13ebe525fdc8abeae39da5a81217f3262f93e64e3d102ef0f7b4cdedbecea1 |
SimHash | c4009cc2a6b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /*.ashx$ |
Disallow | /sitecore/content/ |
Disallow | /Child-Health-Glossary/ |
Disallow | /donors-and-volunteers/test-acc-page |
Disallow | /Support-Childrens/More-Ways-to-Give/Toy-and-In-Kind-Donations |
Other Records
Field | Value |
---|---|
sitemap | https://www.choa.org/childrens_sitemap.xml |
sitemap | https://www.choa.org/give/foundation_sitemap.xml |
sitemap | https://www.marcus.org/marcus_sitemap.xml |
sitemap | https://www.strong4life.com/sitemap.xml |