crayola.ca
robots.txt

Robots Exclusion Standard data for crayola.ca

Resource Scan

Scan Details

Site Domain crayola.ca
Base Domain crayola.ca
Scan Status Ok
Last Scan2025-11-26T10:31:10+00:00
Next Scan 2025-12-03T10:31:10+00:00

Last Scan

Scanned2025-11-26T10:31:10+00:00
URL https://crayola.ca/robots.txt
Redirect https://www.crayola.ca/robots.txt
Redirect Domain www.crayola.ca
Redirect Base crayola.ca
Domain IPs 3.169.71.101, 3.169.71.32, 3.169.71.42, 3.169.71.65
Redirect IPs 3.169.71.101, 3.169.71.32, 3.169.71.42, 3.169.71.65
Response IP 3.169.71.65
Found Yes
Hash 3c3511c67b5db623da828d4f4345d9664259e2cfa764814c4f874edd946e72dd
SimHash 415019763793

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.crayola.ca/sitemaps-1-sitemap.xml
sitemap https://www.crayola.ca/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.crayola.ca/
  • live - don't allow web crawlers to index cpresources/ or vendor/