origo.com
robots.txt

Robots Exclusion Standard data for origo.com

Resource Scan

Scan Details

Site Domain origo.com
Base Domain origo.com
Scan Status Ok
Last Scan2026-02-20T01:45:57+00:00
Next Scan 2026-02-27T01:45:57+00:00

Last Scan

Scanned2026-02-20T01:45:57+00:00
URL https://origo.com/robots.txt
Domain IPs 13.135.242.150, 13.41.21.241, 2a05:d01c:9c5:ab00:c1f1:9ce3:c8a9:db71, 2a05:d01c:9c5:ab01:d5d:992f:ae7e:1afb, 2a05:d01c:9c5:ab02:9657:9769:cf2d:fc74, 3.10.254.107
Response IP 13.135.242.150
Found Yes
Hash d32c1fbb3a1bf94fc82f30f0209429b59d31f0bb3752660f79e21f021288edd7
SimHash 41701d563f92

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://origo.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://origo.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/