fed.hafs.in
robots.txt

Robots Exclusion Standard data for fed.hafs.in

Archived Snapshots

Resource Scan

Scan Details

Site Domain	fed.hafs.in
Base Domain	hafs.in
Scan Status	Ok
Last Scan	2026-02-14T05:22:49+00:00
Next Scan	2026-03-16T05:22:49+00:00

Last Scan

Scanned	2026-02-14T05:22:49+00:00
URL	https://fed.hafs.in/robots.txt
Domain IPs	104.21.70.249, 172.67.141.4, 2606:4700:3030::6815:46f9, 2606:4700:3035::ac43:8d04
Response IP	172.67.141.4
Found	Yes
Hash	72b552d481dc930c782493fb4f844ec403b5997d7f1df3d1ef476cd41f2e2b3f
SimHash	366edb51408e

Groups

addsearchbot
ai2bot
ai2bot-deepresearcheval
ai2bot-dolma
aihitbot
amazon-kendra
amazonbot
amazonbuyforme
andibot
anomura
anthropic-ai
applebot
applebot-extended
atlassian-bot
awario
bedrockbot
bigsur.ai
bravebot
brightbot 1.0
buddybot
bytespider
ccbot
channel3bot
chatglm-spider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudflare-autorag
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawl4ai
crawlspace
datenbank crawler
deepseekbot
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
facebookexternalhit
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
google-notebooklm
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskbot
iaskspider
iaskspider/2.0
iboubot
icc-crawler
imagesiftbot
imagespider
img2dataset
isscyberriskcrawler
kangaroo bot
klaviyoaibot
kunatocrawler
laion-huggingface-processor
laiondownloader
lcc
linerbot
linguee bot
linkupbot
manus-user
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
notebooklm
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poggio-citations
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
spider
terracotta
thinkbot
tiktokspider
timpibot
twinagent
velenpublicwebcrawler
wardbot
webzio-extended
webzio-extended
wpbot
wrtnbot
yak
yandexadditional
yandexadditionalbot
youbot
zanistabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

awariorssbot
awariosmartbot
dataforseobot
magpie-crawler
meltwater
peer39_crawler
peer39_crawler/1.0
piplbot
scoop.it
seekr

Rule	Path
Disallow	/

Rule

Path

Disallow

/

wellknownbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/api/
Disallow	/auth/
Disallow	/oauth/
Disallow	/check_your_email
Disallow	/wait_for_approval
Disallow	/account_disabled
Disallow	/signup
Disallow	/fileserver/
Disallow	/users/
Disallow	/emoji/
Disallow	/admin
Disallow	/user
Disallow	/settings/
Disallow	/about/suspended
Disallow	/.well-known/webfinger
Disallow	/.well-known/nodeinfo
Disallow	/nodeinfo/

Rule

Path

Disallow

/api/

Disallow

/auth/

Disallow

/oauth/

Disallow

/check_your_email

Disallow

/wait_for_approval

Disallow

/account_disabled

Disallow

/signup

Disallow

/fileserver/

Disallow

/users/

Disallow

/emoji/

Disallow

/admin

Disallow

/user

Disallow

/settings/

Disallow

/about/suspended

Disallow

/.well-known/webfinger

Disallow

/.well-known/nodeinfo

Disallow

/nodeinfo/

Other Records

Field	Value
crawl-delay	500

Field

Value

crawl-delay

500

Back to top

Comments

GoToSocial robots.txt -- to edit, see internal/api/util/robots.go
More info @ https://developers.google.com/search/docs/crawling-indexing/robots/intro
AI scrapers and the like.
https://github.com/ai-robots-txt/ai.robots.txt/
Marketing/SEO "intelligence" data scrapers
Well-known.dev crawler. Indexes stuff under /.well-known.
https://well-known.dev/about/
Rules for everything else.
API endpoints.
Auth/Sign in endpoints.
Fileserver/media.
Fedi S2S API endpoints.
Settings panels.
Domain blocklist.
Webfinger endpoint.
Disallow nodeinfo

Back to top

fed.hafs.inrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

awariorssbotawariosmartbotdataforseobotmagpie-crawlermeltwaterpeer39_crawlerpeer39_crawler/1.0piplbotscoop.itseekr

wellknownbot

*

Other Records

Comments

fed.hafs.in
robots.txt

awariorssbot
awariosmartbot
dataforseobot
magpie-crawler
meltwater
peer39_crawler
peer39_crawler/1.0
piplbot
scoop.it
seekr