codeocean.com
robots.txt

Robots Exclusion Standard data for codeocean.com

Resource Scan

Scan Details

Site Domain codeocean.com
Base Domain codeocean.com
Scan Status Ok
Last Scan2024-10-05T06:44:41+00:00
Next Scan 2024-11-04T06:44:41+00:00

Last Scan

Scanned2024-10-05T06:44:41+00:00
URL https://codeocean.com/robots.txt
Domain IPs 13.33.30.16, 13.33.30.21, 13.33.30.26, 13.33.30.44
Response IP 13.33.30.21
Found Yes
Hash 19f473077194d7e4d9336ecef6d29b226f1fc3cef801d96912cc984f44ac03d6
SimHash c8d09aa3c994

Groups

*

Rule Path
Disallow /dashboard$
Disallow /portal
Disallow /v1/algorithm/
Disallow /v2/algorithm/
Disallow /test/embedding-options.html
Disallow /recent-capsules

hubspot page fetcher/1.0 http://www.hubspot.com/ web-crawlers@hubspot.com
hubspot url preview/1.0

Rule Path
Allow /

Other Records

Field Value
sitemap https://codeocean.com/sitemap.xml