ageinghacker.net
robots.txt

Robots Exclusion Standard data for ageinghacker.net

Resource Scan

Scan Details

Site Domain ageinghacker.net
Base Domain ageinghacker.net
Scan Status Ok
Last Scan2025-10-10T00:08:14+00:00
Next Scan 2025-10-24T00:08:14+00:00

Last Scan

Scanned2025-10-10T00:08:14+00:00
URL https://ageinghacker.net/robots.txt
Domain IPs 82.221.139.216
Response IP 82.221.139.216
Found Yes
Hash cf80b1ac4b03db8ba78733b88f4a1df32bd55f7c834248b6b431b824b848435b
SimHash be907351defb

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /teaching/old/old-lipn/DLL-2011/rattrapage/corpora/
Disallow /teaching/old/intro-prog-2015/words/
Disallow /teaching/old/intro-prog-2015/words/data/
Disallow /teaching/programming-python/corpora/
Disallow /git
Disallow /git/

the knowledge ai

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mozilla/5.0 (compatible; mj12bot/v1.4.8; http://mj12bot.com/)

Rule Path
Disallow /

ahrefsbot/7.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ageinghacker.net/sitemap.xml.gz

Comments

  • Apparently I could add my RSS and Atom feeds as "Sitemap:" here, but my
  • feeds contain a subset of what is already in sitemap.xml
  • This should be enough to avoid indexing mailing list archives.
  • Literary works, off-topic for me. These come from progrmaming projects for
  • my classes.
  • Disallow: /teaching/programming-python/corpora/ancient-greek--phaenomena--utf-8
  • Disallow: /teaching/programming-python/corpora/arabic--one-thousand-and-one-nights--utf-8
  • Disallow: /teaching/programming-python/corpora/bulgarian--something--utf-8
  • Disallow: /teaching/programming-python/corpora/catalan--something--utf-8
  • Disallow: /teaching/programming-python/corpora/chinese--the-advocate--lai-ho--utf-8
  • Disallow: /teaching/programming-python/corpora/czech--something--utf-8
  • Disallow: /teaching/programming-python/corpora/danish--something--utf-8
  • Disallow: /teaching/programming-python/corpora/dutch--something--utf-8
  • Disallow: /teaching/programming-python/corpora/english--narrative-of-a-gordon-pym--poe--utf-8
  • Disallow: /teaching/programming-python/corpora/esperanto--something--utf-8
  • Disallow: /teaching/programming-python/corpora/farsi--something--utf-8
  • Disallow: /teaching/programming-python/corpora/finnish--something--utf-8
  • Disallow: /teaching/programming-python/corpora/french--la-recherche-de-labsolu--balzac--utf-8
  • Disallow: /teaching/programming-python/corpora/german--der-prozess--kafka--utf-8
  • Disallow: /teaching/programming-python/corpora/hebrew--something--utf-8
  • Disallow: /teaching/programming-python/corpora/hungarian--something--utf-8
  • Disallow: /teaching/programming-python/corpora/icelandic--something--utf-8
  • Disallow: /teaching/programming-python/corpora/italian--le-avventure-di-pinocchio--collodi--utf-8
  • Disallow: /teaching/programming-python/corpora/japanese--kairo-ko--soseki--utf-8
  • Disallow: /teaching/programming-python/corpora/korean--burning-mountain--cha-pomsok--utf-8
  • Disallow: /teaching/programming-python/corpora/latin--something--cicero--utf-8
  • Disallow: /teaching/programming-python/corpora/modern-greek--stuff--utf-8
  • Disallow: /teaching/programming-python/corpora/norwegian--something--utf-8
  • Disallow: /teaching/programming-python/corpora/polish--something--utf-8
  • Disallow: /teaching/programming-python/corpora/portuguese--something--utf-8
  • Disallow: /teaching/programming-python/corpora/russian--childhood--tolstoy--utf-8
  • Disallow: /teaching/programming-python/corpora/spanish--el-ingenioso-hidalgo-don-quijote-de-la-mancha--cervantes--utf-8
  • Disallow: /teaching/programming-python/corpora/swedish--something--utf-8
  • Disallow: /teaching/programming-python/corpora/tagalog--something--utf-8
  • Disallow: /teaching/programming-python/corpora/turkish--something--utf-8
  • Disallow: /teaching/programming-python/corpora/urdu--something--utf-8
  • Disallow: /teaching/programming-python/corpora/corpora-version-1.tar.gz
  • Disallow: /teaching/programming-python/corpora/langues.py
  • These URLs are pre-rewriting. Do I need this?
  • Disallow: /lipn-stuff/teaching/DLL-2011/rattrapage/corpora
  • Disallow: /lipn-stuff/teaching/DLL-2011/rattrapage/corpora/
  • Do not scan tag indices: they contain redundant information, if the rest
  • of the site has been crawled.
  • Tentative: wildcards are commonly supported by crawlers, but the
  • specification does not define them.
  • Disallow: /blog/tags
  • Disallow: /blog/tags/
  • Disallow: /blog/tags/*/*
  • Old cgit pages
  • For humans but not interesting to index. Now they have become
  • redirections, but that does not matter.
  • Disallow: /git/*
  • Disallow: /gitstats
  • Disallow: /gitstats/
  • Disallow: /gitstats/*
  • Particularly wasteful bots
  • Bots I just dislike
  • The “Ahrefs online marketing toolset” is something I never want to have
  • anything to do with.