kanshuwang.tw
robots.txt

Robots Exclusion Standard data for kanshuwang.tw

Resource Scan

Scan Details

Site Domain kanshuwang.tw
Base Domain kanshuwang.tw
Scan Status Ok
Last Scan2025-05-05T00:45:17+00:00
Next Scan 2025-05-12T00:45:17+00:00

Last Scan

Scanned2025-05-05T00:45:17+00:00
URL https://kanshuwang.tw/robots.txt
Redirect https://www.kanshuwang.tw/robots.txt
Redirect Domain www.kanshuwang.tw
Redirect Base kanshuwang.tw
Domain IPs 104.21.84.245, 172.67.199.107, 2606:4700:3030::6815:54f5, 2606:4700:3033::ac43:c76b
Redirect IPs 104.21.84.245, 172.67.199.107, 2606:4700:3030::6815:54f5, 2606:4700:3033::ac43:c76b
Response IP 104.21.84.245
Found Yes
Hash c47987935fb809d32c45444911ee1e85893be360be9122819ce363e4466bd53c
SimHash 0a1ec062c8b3

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

screenerbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap /sitemap/index.xml

Warnings

  • 2 invalid lines.