canibuild.com
robots.txt

Robots Exclusion Standard data for canibuild.com

Resource Scan

Scan Details

Site Domain canibuild.com
Base Domain canibuild.com
Scan Status Ok
Last Scan2025-11-17T00:07:59+00:00
Next Scan 2025-12-17T00:07:59+00:00

Last Scan

Scanned2025-11-17T00:07:59+00:00
URL https://canibuild.com/robots.txt
Domain IPs 76.76.21.21
Response IP 76.76.21.21
Found Yes
Hash e364489f863e8cc211046d41a369dfa068c7931e65066ba67383f39fc16975cb
SimHash 5d1c0046e6c1

Groups

*

Rule Path
Disallow /api/
Disallow /admin/
Disallow /cgi-bin/
Disallow /server-scripts/
Disallow /private/
Disallow /*?*preview=true
Disallow /*?_storyblok=*
Disallow /*%26_storyblok%3D*
Disallow /*?_storyblok_release=*
Disallow /search/
Disallow /search?*
Allow /*.js$
Allow /*.css$

oai-searchbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

gemini

Rule Path
Allow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://canibuild.com/sitemap.xml
sitemap https://canibuild.com/en-au/sitemap_pages.xml
sitemap https://canibuild.com/en-us/sitemap_pages.xml
sitemap https://canibuild.com/en-nz/sitemap_pages.xml