bocci.com
robots.txt

Robots Exclusion Standard data for bocci.com

Resource Scan

Scan Details

Site Domain bocci.com
Base Domain bocci.com
Scan Status Ok
Last Scan2025-06-19T15:22:09+00:00
Next Scan 2025-07-19T15:22:09+00:00

Last Scan

Scanned2025-06-19T15:22:09+00:00
URL https://bocci.com/robots.txt
Domain IPs 104.21.20.175, 172.67.193.68, 2606:4700:3036::6815:14af, 2606:4700:3037::ac43:c144
Response IP 172.67.193.68
Found Yes
Hash 15f2122eca667d8c1b05824e1a88a989c0d9a7dc3f0b9d14672476331db8dff6
SimHash cb301b362fb1

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://bocci.com/sitemaps-1-sitemap.xml
sitemap https://omerarbel.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://bocci.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/