acacialearning.com
robots.txt
Robots Exclusion Standard data for acacialearning.com
Resource Scan
Scan Details
Site Domain | acacialearning.com |
Base Domain | acacialearning.com |
Scan Status | Ok |
Last Scan | 2025-07-03T09:32:21+00:00 |
Next Scan | 2025-08-02T09:32:21+00:00 |
Last Scan
Scanned | 2025-07-03T09:32:21+00:00 |
URL | https://acacialearning.com/robots.txt |
Domain IPs | 20.90.134.17 |
Response IP | 20.90.134.17 |
Found | Yes |
Hash | 941f49abb75e81592910166164d7cc9ffb6d90701b2b3fac66ab707c85c04fc3 |
SimHash | f10a60719604 |
Groups
perplexitybot
Rule | Path |
---|---|
Disallow | / |
Disallow | /aspnet_client/ |
Disallow | /bin/ |
Disallow | /config/ |
Disallow | /data/ |
Disallow | /install/ |
Disallow | /macroScripts/ |
Disallow | /masterpages/ |
Disallow | /umbraco/ |
Disallow | /backoffice/ |
Disallow | /umbraco_client/ |
Disallow | /usercontrols/ |
Disallow | /xslt/ |
Disallow | /contactform/ |
Disallow | /Search/ |
Disallow | /404/ |
Disallow | /checkout/ |
Disallow | /orderconfirmation/ |
Disallow | /site-general-settings/ |
Disallow | /landing-pages/* |
Disallow | /lp/* |
Disallow | /thank-you/ |
Disallow | /thank-you-competition/ |
Disallow | /thank-you-for-your-enquiry/ |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://acacialearning.com/sitemap/ |