twentythree.com
robots.txt

Robots Exclusion Standard data for twentythree.com

Resource Scan

Scan Details

Site Domain twentythree.com
Base Domain twentythree.com
Scan Status Ok
Last Scan2025-11-03T19:23:43+00:00
Next Scan 2025-11-10T19:23:43+00:00

Last Scan

Scanned2025-11-03T19:23:43+00:00
URL https://twentythree.com/robots.txt
Redirect https://www.twentythree.com/robots.txt
Redirect Domain www.twentythree.com
Redirect Base twentythree.com
Domain IPs 151.101.1.120, 151.101.129.120, 151.101.193.120, 151.101.65.120
Redirect IPs 151.101.1.120, 151.101.129.120, 151.101.193.120, 151.101.65.120
Response IP 151.101.193.120
Found Yes
Hash 47bf24b1fef8fcaba3e885685cb9a73beaae30e847482a8c8ac8da597382fa7b
SimHash 416899623796

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.twentythree.com/sitemaps-1-sitemap.xml
sitemap https://www.ranguinc.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.twentythree.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/