linux.org.tw
robots.txt

Robots Exclusion Standard data for linux.org.tw

Resource Scan

Scan Details

Site Domain linux.org.tw
Base Domain linux.org.tw
Scan Status Ok
Last Scan2025-08-21T01:48:11+00:00
Next Scan 2025-08-28T01:48:11+00:00

Last Scan

Scanned2025-08-21T01:48:11+00:00
URL https://linux.org.tw/robots.txt
Domain IPs 185.141.24.188
Response IP 185.141.24.188
Found Yes
Hash 12901c89785442fa907119b35d8cb0614268d76595172ef2fa76b719a8d29d6e
SimHash f25cd310b580

Groups

ahrefsbot
ahrefssiteaudit
amazonbot
antbot
applebot
awariobot
barkrowler
blexbot
buck
bytespider
claudebot
criteobot
dataforseobot
dotbot
ds-robot
facebookexternalhit
fidget-spinner-bot
gptbot
grapeshotcrawler
linkdexbot
magpie-crawler
mediatoolkitbot
meta-externalagent
mj12bot
my-tiny-bot
petalbot
proximic
qwantify
rainbot
seekportbot
semrushbot
semrushbot-ba
semrushbot-coub
semrushbot-ct
semrushbot-si
semrushbot-swa
siteauditbot
splitsignalbot
test-bot
thesis-research-bot
timpibot
tinytestbot
trendictionbot
yak
yandexbot
yisouspider

Rule Path
Disallow /

*

Rule Path
Allow /