linux.org.tw
robots.txt
Robots Exclusion Standard data for linux.org.tw
Resource Scan
Scan Details
Site Domain | linux.org.tw |
Base Domain | linux.org.tw |
Scan Status | Ok |
Last Scan | 2025-08-21T01:48:11+00:00 |
Next Scan | 2025-08-28T01:48:11+00:00 |
Last Scan
Scanned | 2025-08-21T01:48:11+00:00 |
URL | https://linux.org.tw/robots.txt |
Domain IPs | 185.141.24.188 |
Response IP | 185.141.24.188 |
Found | Yes |
Hash | 12901c89785442fa907119b35d8cb0614268d76595172ef2fa76b719a8d29d6e |
SimHash | f25cd310b580 |
Groups
ahrefsbot
ahrefssiteaudit
amazonbot
antbot
applebot
awariobot
barkrowler
blexbot
buck
bytespider
claudebot
criteobot
dataforseobot
dotbot
ds-robot
facebookexternalhit
fidget-spinner-bot
gptbot
grapeshotcrawler
linkdexbot
magpie-crawler
mediatoolkitbot
meta-externalagent
mj12bot
my-tiny-bot
petalbot
proximic
qwantify
rainbot
seekportbot
semrushbot
semrushbot-ba
semrushbot-coub
semrushbot-ct
semrushbot-si
semrushbot-swa
siteauditbot
splitsignalbot
test-bot
thesis-research-bot
timpibot
tinytestbot
trendictionbot
yak
yandexbot
yisouspider
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Allow | / |