nihophawa.com.vn
robots.txt
Robots Exclusion Standard data for nihophawa.com.vn
Resource Scan
Scan Details
| Site Domain | nihophawa.com.vn |
| Base Domain | nihophawa.com.vn |
| Scan Status | Ok |
| Last Scan | 2026-02-20T15:01:41+00:00 |
| Next Scan | 2026-03-22T15:01:41+00:00 |
Last Scan
| Scanned | 2026-02-20T15:01:41+00:00 |
| URL | https://nihophawa.com.vn/robots.txt |
| Domain IPs | 103.90.234.208 |
| Response IP | 103.90.234.208 |
| Found | Yes |
| Hash | 2af826321fb421c52f6fd991d41b6e9f2006f1ffeb1b8ef07347caa1e486ee1c |
| SimHash | ff5cca514086 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
addsearchbot
ai2bot
ai2bot-deepresearcheval
ai2bot-dolma
aihitbot
amazon-kendra
amazonbot
amazonbuyforme
andibot
anomura
anthropic-ai
applebot
applebot-extended
atlassian-bot
awario
bedrockbot
bigsur.ai
bravebot
brightbot 1.0
buddybot
bytespider
ccbot
channel3bot
chatglm-spider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudflare-autorag
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawl4ai
crawlspace
datenbank crawler
deepseekbot
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
facebookexternalhit
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
google-notebooklm
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskbot
iaskspider
iaskspider/2.0
iboubot
icc-crawler
imagesiftbot
imagespider
img2dataset
isscyberriskcrawler
kangaroo bot
klaviyoaibot
kunatocrawler
laion-huggingface-processor
laiondownloader
lcc
linerbot
linguee bot
linkupbot
manus-user
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
notebooklm
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poggio-citations
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
spider
tavilybot
terracotta
thinkbot
tiktokspider
timpibot
twinagent
velenpublicwebcrawler
wardbot
webzio-extended
webzio-extended
wpbot
wrtnbot
yak
yandexadditional
yandexadditionalbot
youbot
zanistabot
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow |
Other Records
| Field | Value |
|---|---|
| sitemap | https://nihophawa.com.vn/sitemap_index.xml |
Warnings
- `content-signal` is not a known field.
Comments