harrahssocal.com
robots.txt

Robots Exclusion Standard data for harrahssocal.com

Resource Scan

Scan Details

Site Domain harrahssocal.com
Base Domain harrahssocal.com
Scan Status Ok
Last Scan2026-02-13T07:17:11+00:00
Next Scan 2026-02-20T07:17:11+00:00

Last Scan

Scanned2026-02-13T07:17:11+00:00
URL https://harrahssocal.com/robots.txt
Domain IPs 104.21.38.139, 172.67.223.45, 2606:4700:3030::6815:268b, 2606:4700:3037::ac43:df2d
Response IP 172.67.223.45
Found Yes
Hash b7ffeb4f8ec9f0f3866ea2b87f554baa83bcfa082c206bb90bbbae6c44348af0
SimHash 40701d726799

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://harrahssocal.com/sitemaps-1-sitemap.xml
sitemap https://harrahssocal.com/es/sitemaps-1-sitemap.xml
sitemap https://harrahssocal.com/zh-hant/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://harrahssocal.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/