siteone.io
robots.txt

Robots Exclusion Standard data for siteone.io

Resource Scan

Scan Details

Site Domain siteone.io
Base Domain siteone.io
Scan Status Ok
Last Scan2026-02-07T01:45:57+00:00
Next Scan 2026-02-21T01:45:57+00:00

Last Scan

Scanned2026-02-07T01:45:57+00:00
URL https://siteone.io/robots.txt
Redirect https://www.siteone.io/robots.txt
Redirect Domain www.siteone.io
Redirect Base siteone.io
Domain IPs 104.21.70.207, 172.67.139.96, 2606:4700:3033::ac43:8b60, 2606:4700:3037::6815:46cf
Redirect IPs 104.21.70.207, 172.67.139.96, 2606:4700:3033::ac43:8b60, 2606:4700:3037::6815:46cf
Response IP 172.67.139.96
Found Yes
Hash dbdb99acacf7d52cc70469118bf7dab51007513da7dfa09b00a474520aff9608
SimHash c5641d567f93

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.siteone.cz/sitemaps-1-sitemap.xml
sitemap https://www.siteone.at/sitemaps-1-sitemap.xml
sitemap https://www.siteone.io/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.siteone.io/
  • live - don't allow web crawlers to index cpresources/ or vendor/