haoshici.com
robots.txt

Robots Exclusion Standard data for haoshici.com

Resource Scan

Scan Details

Site Domain haoshici.com
Base Domain haoshici.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-02T16:00:45+00:00
Next Scan 2026-02-09T16:00:45+00:00

Last Successful Scan

Scanned2024-04-13T14:31:12+00:00
URL https://haoshici.com/robots.txt
Domain IPs 104.21.21.212, 172.67.200.91, 2606:4700:3030::ac43:c85b, 2606:4700:3037::6815:15d4
Response IP 172.67.200.91
Found Yes
Hash 8e3c04d9325f63a174a41681cfbc91f06ea7e0b74b07ca999909f1703b63f1ed
SimHash 1f3f521437cb

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /*.do
Disallow /*.json
Disallow /*.cfm
Disallow /*.giff
Disallow /login*
Disallow /zh-tw/login*
Disallow /join.html
Disallow /zh-tw/join.html
Disallow /*.html?aov=*
Disallow /zh-tw/*.html?aov=*
Disallow /*.html?qr

baiduspider

Rule Path
Disallow /zh-tw/verse-*.html
Disallow /zh-tw/search-*.html
Disallow /verse-*.html?type=*
Disallow /verse-*.html?cat=*
Disallow /verse-*.html?dynasty=*

yisouspider
sogou web spider

Rule Path
Disallow /zh-tw/

amazonbot

Rule Path
Disallow /zh-tw/verse-*.html
Disallow /zh-tw/search-*.html
Disallow /verse-*.html
Disallow /search-*.html

bingbot
googlebot

Rule Path
Disallow /zh-tw/verse-*.html?type=*
Disallow /zh-tw/verse-*.html?cat=*
Disallow /verse-*.html?type=*
Disallow /verse-*.html?cat=*

Warnings

  • 1 invalid line.