dispo.cc
robots.txt

Robots Exclusion Standard data for dispo.cc

Resource Scan

Scan Details

Site Domain dispo.cc
Base Domain dispo.cc
Scan Status Ok
Last Scan2025-11-04T12:15:06+00:00
Next Scan 2025-11-11T12:15:06+00:00

Last Scan

Scanned2025-11-04T12:15:06+00:00
URL https://dispo.cc/robots.txt
Domain IPs 104.21.21.49, 172.67.196.106, 2606:4700:3033::ac43:c46a, 2606:4700:3034::6815:1531
Response IP 172.67.196.106
Found Yes
Hash 70c7fc66d461255d728cd5a63dffaadccf8a1369c41dc4ba7dc54cadbaa2c646
SimHash 68081df22fb0

Groups

amazonbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /api$
Disallow /weka/user/loginstate$
Disallow /rss.xml$

Other Records

Field Value
sitemap https://dispo.cc/sitemaps-2-sitemap.xml
sitemap https://dispo.cc/news.xml

Comments

  • robots.txt for https://dispo.cc/
  • live - don't allow web crawlers
  • live - don't allow web crawlers to index cpresources/ or vendor/