librecmc.org
robots.txt

Robots Exclusion Standard data for librecmc.org

Resource Scan

Scanned	2025-10-07T03:55:28+00:00
URL	https://librecmc.org/robots.txt
Domain IPs	198.140.141.86
Response IP	198.140.141.86
Found	Yes
Hash	dfef6ff7e50d32046ae707376ce0c9696b66d43fa09e1825da92c61079fe6206
SimHash	3e178951c6f5

Rule	Path
Disallow
Disallow	/tarpit

Rule

Path

Disallow

/tarpit

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

The following two lines are saying: all user agents are allowed to crawl all of the site
Another way of saying the same thing could be:
User-agent: *
Allow: /
You don't really need any of these, if you don't put anything the implicit message is the same: allow all crwalers to the full site
The following lines are listing all known (to the date of this file) AI crawlers
The last line is saying: all of the above bots are not allowed to crawl any part of the site

Back to top