m.kanshuwang.tw
robots.txt

Robots Exclusion Standard data for m.kanshuwang.tw

Resource Scan

Scan Details

Site Domain m.kanshuwang.tw
Base Domain kanshuwang.tw
Scan Status Ok
Last Scan2025-04-15T23:27:49+00:00
Next Scan 2025-05-15T23:27:49+00:00

Last Scan

Scanned2025-04-15T23:27:49+00:00
URL https://m.kanshuwang.tw/robots.txt
Domain IPs 104.21.84.245, 172.67.199.107, 2606:4700:3030::6815:54f5, 2606:4700:3033::ac43:c76b
Response IP 104.21.84.245
Found Yes
Hash c47987935fb809d32c45444911ee1e85893be360be9122819ce363e4466bd53c
SimHash 0a1ec062c8b3

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

screenerbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap /sitemap/index.xml

Warnings

  • 2 invalid lines.