ioccc.org
robots.txt

Robots Exclusion Standard data for ioccc.org

Resource Scan

Scan Details

Site Domain ioccc.org
Base Domain ioccc.org
Scan Status Ok
Last Scan2024-09-21T23:47:49+00:00
Next Scan 2024-10-21T23:47:49+00:00

Last Scan

Scanned2024-09-21T23:47:49+00:00
URL https://ioccc.org/robots.txt
Redirect https://www.ioccc.org/robots.txt
Redirect Domain www.ioccc.org
Redirect Base ioccc.org
Domain IPs 185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153, 2606:50c0:8000::153, 2606:50c0:8001::153, 2606:50c0:8002::153, 2606:50c0:8003::153
Redirect IPs 185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153, 2606:50c0:8000::153, 2606:50c0:8001::153, 2606:50c0:8002::153, 2606:50c0:8003::153
Response IP 185.199.110.153
Found Yes
Hash 30f56e363a66409ef2ec91e5ac6cf190697bd5f1a6b086e15e3da80e3891bfa4
SimHash c01e175dcd5b

Groups

sistrix

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Comments

  • If you are an official mirror you know what the user agent field should be!
  • If you want to be an ioccc mirror contact <mirror-request@ioccc.org>. You
  • must include IOCCC 2023 in the subject line or your email will bounce.
  • Added due to slurping up everything as fast as it could
  • I watched this bot for 4 consecutive days, it indexed the whole site 4 times
  • each day (using GETs)
  • Repeated indexing