fed.hafs.in
robots.txt
Robots Exclusion Standard data for fed.hafs.in
Resource Scan
Scan Details
| Site Domain | fed.hafs.in |
| Base Domain | hafs.in |
| Scan Status | Ok |
| Last Scan | 2026-02-14T05:22:49+00:00 |
| Next Scan | 2026-03-16T05:22:49+00:00 |
Last Scan
| Scanned | 2026-02-14T05:22:49+00:00 |
| URL | https://fed.hafs.in/robots.txt |
| Domain IPs | 104.21.70.249, 172.67.141.4, 2606:4700:3030::6815:46f9, 2606:4700:3035::ac43:8d04 |
| Response IP | 172.67.141.4 |
| Found | Yes |
| Hash | 72b552d481dc930c782493fb4f844ec403b5997d7f1df3d1ef476cd41f2e2b3f |
| SimHash | 366edb51408e |
Groups
addsearchbot
ai2bot
ai2bot-deepresearcheval
ai2bot-dolma
aihitbot
amazon-kendra
amazonbot
amazonbuyforme
andibot
anomura
anthropic-ai
applebot
applebot-extended
atlassian-bot
awario
bedrockbot
bigsur.ai
bravebot
brightbot 1.0
buddybot
bytespider
ccbot
channel3bot
chatglm-spider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudflare-autorag
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawl4ai
crawlspace
datenbank crawler
deepseekbot
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
facebookexternalhit
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
google-notebooklm
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskbot
iaskspider
iaskspider/2.0
iboubot
icc-crawler
imagesiftbot
imagespider
img2dataset
isscyberriskcrawler
kangaroo bot
klaviyoaibot
kunatocrawler
laion-huggingface-processor
laiondownloader
lcc
linerbot
linguee bot
linkupbot
manus-user
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
notebooklm
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poggio-citations
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
spider
terracotta
thinkbot
tiktokspider
timpibot
twinagent
velenpublicwebcrawler
wardbot
webzio-extended
webzio-extended
wpbot
wrtnbot
yak
yandexadditional
yandexadditionalbot
youbot
zanistabot
| Rule | Path |
|---|---|
| Disallow | / |
awariorssbot
awariosmartbot
dataforseobot
magpie-crawler
meltwater
peer39_crawler
peer39_crawler/1.0
piplbot
scoop.it
seekr
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow | /api/ |
| Disallow | /auth/ |
| Disallow | /oauth/ |
| Disallow | /check_your_email |
| Disallow | /wait_for_approval |
| Disallow | /account_disabled |
| Disallow | /signup |
| Disallow | /fileserver/ |
| Disallow | /users/ |
| Disallow | /emoji/ |
| Disallow | /admin |
| Disallow | /user |
| Disallow | /settings/ |
| Disallow | /about/suspended |
| Disallow | /.well-known/webfinger |
| Disallow | /.well-known/nodeinfo |
| Disallow | /nodeinfo/ |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 500 |
Comments