simplicate.ca
robots.txt

Robots Exclusion Standard data for simplicate.ca

Resource Scan

Scan Details

Site Domain simplicate.ca
Base Domain simplicate.ca
Scan Status Ok
Last Scan2025-11-01T13:00:45+00:00
Next Scan 2025-11-08T13:00:45+00:00

Last Scan

Scanned2025-11-01T13:00:45+00:00
URL https://simplicate.ca/robots.txt
Domain IPs 104.21.18.32, 172.67.179.234, 2606:4700:3032::6815:1220, 2606:4700:3035::ac43:b3ea
Response IP 104.21.18.32
Found Yes
Hash 2beb77c32c1c30fe846de72b2665395466d1ab92ac08e92ecf96388aa00e345f
SimHash 41501d566791

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://simplicate.ca/sitemaps-1-sitemap.xml
sitemap https://simplicate.ca/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://simplicate.ca/
  • live - don't allow web crawlers to index cpresources/ or vendor/