mwckigali.com
robots.txt

Robots Exclusion Standard data for mwckigali.com

Resource Scan

Scan Details

Site Domain mwckigali.com
Base Domain mwckigali.com
Scan Status Ok
Last Scan2025-11-09T20:09:58+00:00
Next Scan 2025-12-09T20:09:58+00:00

Last Scan

Scanned2025-11-09T20:09:58+00:00
URL https://www.mwckigali.com/robots.txt
Domain IPs 104.18.28.29, 104.18.29.29, 2606:4700::6812:1c1d, 2606:4700::6812:1d1d
Response IP 104.18.28.29
Found Yes
Hash d776b19cafef26a678e83cd3d2ee0897d9c14bba7a3a78a4dd10cb3cc7e7c498
SimHash 43281d722593

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /*.apk

Other Records

Field Value
sitemap https://www.mwckigali.com/sitemaps-4-sitemap.xml

Comments

  • robots.txt for https://www.mwckigali.com/
  • default - don't allow web crawlers to index cpresources/ or vendor/