/.well-known/

Log In Sign Up

aar.li
robots.txt

Robots Exclusion Standard data for aar.li

Archived Snapshots

Resource Scan

Scan Details

Site Domain	aar.li
Base Domain	aar.li
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-11-09T20:12:13+00:00
Next Scan	2026-01-08T20:12:13+00:00

Last Successful Scan

Scanned	2025-09-11T05:45:52+00:00
URL	https://aar.li/robots.txt
Domain IPs	104.21.86.178, 172.67.223.77, 2606:4700:3032::ac43:df4d, 2606:4700:3037::6815:56b2
Response IP	172.67.223.77
Found	Yes
Hash	3e89252af5c8ea5f2889cb92865d82967504ac1233db9f2b4b4453e32c8547ff
SimHash	769e4901c1e4

Groups

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
crawlspace
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
perplexityâuser
petalbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule

Path

Disallow

/

Back to top