chalearn.org
robots.txt

Robots Exclusion Standard data for chalearn.org

Resource Scan

Scan Details

Site Domain chalearn.org
Base Domain chalearn.org
Scan Status Ok
Last Scan2025-10-08T00:50:08+00:00
Next Scan 2025-11-07T00:50:08+00:00

Last Scan

Scanned2025-10-08T00:50:08+00:00
URL http://chalearn.org/robots.txt
Redirect http://www.chalearn.org/robots.txt
Redirect Domain www.chalearn.org
Redirect Base chalearn.org
Domain IPs 199.34.228.100
Redirect IPs 199.34.228.100
Response IP 199.34.228.100
Found Yes
Hash 1f5161b7f00a1fa0f2e8f57d08e3baf957bdd191c41e68e6b679f23845dfd327
SimHash 0154dc562793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/

Other Records

Field Value
sitemap http://www.chalearn.org/sitemap.xml