ww38.livehongkong.org
robots.txt

Robots Exclusion Standard data for ww38.livehongkong.org

Resource Scan

Scan Details

Site Domain ww38.livehongkong.org
Base Domain livehongkong.org
Scan Status Ok
Last Scan2024-10-31T07:25:37+00:00
Next Scan 2024-11-30T07:25:37+00:00

Last Scan

Scanned2024-10-31T07:25:37+00:00
URL http://ww38.livehongkong.org/robots.txt
Domain IPs 75.2.120.224
Response IP 75.2.120.224
Found Yes
Hash 39573113c30ec2fab1e8632281c0bd93c5c01a352f9ef6c30cbb5fb6ed1bea61
SimHash 44175844450a

Groups

googlebot

Rule Path
Disallow /*?
Disallow /munin*

baiduspider

Rule Path
Disallow /*?
Disallow /munin*

yandexbot

Rule Path
Disallow /*?
Disallow /munin*

ichiro

Rule Path
Disallow /*?
Disallow /munin*

sogou spider

Rule Path
Disallow /*?
Disallow /munin*

sosospider

Rule Path
Disallow /*?
Disallow /munin*

youdaobot

Rule Path
Disallow /*?
Disallow /munin*

yetibot

Rule Path
Disallow /*?
Disallow /munin*

bingbot

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

yahoo! slurp

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

rdfbot

Rule Path
Disallow /*?
Disallow /munin*

seznambot

Rule Path
Disallow /*?
Disallow /munin*

ia_archiver

Rule Path
Disallow /munin*

mediapartners-google

Rule Path
Disallow /munin*

Warnings

  • `request-rate` is not a known field.