readingclash.cc
robots.txt

Robots Exclusion Standard data for readingclash.cc

Resource Scan

Scan Details

Site Domain readingclash.cc
Base Domain readingclash.cc
Scan Status Ok
Last Scan2024-11-04T22:15:08+00:00
Next Scan 2024-12-04T22:15:08+00:00

Last Scan

Scanned2024-11-04T22:15:08+00:00
URL https://readingclash.cc/robots.txt
Domain IPs 104.21.50.203, 172.67.166.201, 2606:4700:3033::6815:32cb, 2606:4700:3033::ac43:a6c9
Response IP 172.67.166.201
Found Yes
Hash 39573113c30ec2fab1e8632281c0bd93c5c01a352f9ef6c30cbb5fb6ed1bea61
SimHash 44175844450a

Groups

googlebot

Rule Path
Disallow /*?
Disallow /munin*

baiduspider

Rule Path
Disallow /*?
Disallow /munin*

yandexbot

Rule Path
Disallow /*?
Disallow /munin*

ichiro

Rule Path
Disallow /*?
Disallow /munin*

sogou spider

Rule Path
Disallow /*?
Disallow /munin*

sosospider

Rule Path
Disallow /*?
Disallow /munin*

youdaobot

Rule Path
Disallow /*?
Disallow /munin*

yetibot

Rule Path
Disallow /*?
Disallow /munin*

bingbot

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

yahoo! slurp

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

rdfbot

Rule Path
Disallow /*?
Disallow /munin*

seznambot

Rule Path
Disallow /*?
Disallow /munin*

ia_archiver

Rule Path
Disallow /munin*

mediapartners-google

Rule Path
Disallow /munin*

Warnings

  • `request-rate` is not a known field.