acacialearning.com
robots.txt

Robots Exclusion Standard data for acacialearning.com

Resource Scan

Scan Details

Site Domain acacialearning.com
Base Domain acacialearning.com
Scan Status Ok
Last Scan2025-07-03T09:32:21+00:00
Next Scan 2025-08-02T09:32:21+00:00

Last Scan

Scanned2025-07-03T09:32:21+00:00
URL https://acacialearning.com/robots.txt
Domain IPs 20.90.134.17
Response IP 20.90.134.17
Found Yes
Hash 941f49abb75e81592910166164d7cc9ffb6d90701b2b3fac66ab707c85c04fc3
SimHash f10a60719604

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /backoffice/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/
Disallow /contactform/
Disallow /Search/
Disallow /404/
Disallow /checkout/
Disallow /orderconfirmation/
Disallow /site-general-settings/
Disallow /landing-pages/*
Disallow /lp/*
Disallow /thank-you/
Disallow /thank-you-competition/
Disallow /thank-you-for-your-enquiry/
Allow /

Other Records

Field Value
sitemap https://acacialearning.com/sitemap/