nihophawa.com.vn
robots.txt

Robots Exclusion Standard data for nihophawa.com.vn

Resource Scan

Scan Details

Site Domain nihophawa.com.vn
Base Domain nihophawa.com.vn
Scan Status Ok
Last Scan2026-02-20T15:01:41+00:00
Next Scan 2026-03-22T15:01:41+00:00

Last Scan

Scanned2026-02-20T15:01:41+00:00
URL https://nihophawa.com.vn/robots.txt
Domain IPs 103.90.234.208
Response IP 103.90.234.208
Found Yes
Hash 2af826321fb421c52f6fd991d41b6e9f2006f1ffeb1b8ef07347caa1e486ee1c
SimHash ff5cca514086

Groups

*

Rule Path
Allow /

addsearchbot
ai2bot
ai2bot-deepresearcheval
ai2bot-dolma
aihitbot
amazon-kendra
amazonbot
amazonbuyforme
andibot
anomura
anthropic-ai
applebot
applebot-extended
atlassian-bot
awario
bedrockbot
bigsur.ai
bravebot
brightbot 1.0
buddybot
bytespider
ccbot
channel3bot
chatglm-spider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudflare-autorag
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawl4ai
crawlspace
datenbank crawler
deepseekbot
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
facebookexternalhit
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
google-notebooklm
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskbot
iaskspider
iaskspider/2.0
iboubot
icc-crawler
imagesiftbot
imagespider
img2dataset
isscyberriskcrawler
kangaroo bot
klaviyoaibot
kunatocrawler
laion-huggingface-processor
laiondownloader
lcc
linerbot
linguee bot
linkupbot
manus-user
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
notebooklm
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poggio-citations
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
spider
tavilybot
terracotta
thinkbot
tiktokspider
timpibot
twinagent
velenpublicwebcrawler
wardbot
webzio-extended
webzio-extended
wpbot
wrtnbot
yak
yandexadditional
yandexadditionalbot
youbot
zanistabot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://nihophawa.com.vn/sitemap_index.xml

Comments

  • BEGIN Tadu WAF Managed content
  • END Tadu WAF Managed Content
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK

Warnings

  • `content-signal` is not a known field.