cleanmedia.com
robots.txt

Robots Exclusion Standard data for cleanmedia.com

Resource Scan

Scan Details

Site Domain cleanmedia.com
Base Domain cleanmedia.com
Scan Status Ok
Last Scan2024-11-03T23:38:30+00:00
Next Scan 2024-12-03T23:38:30+00:00

Last Scan

Scanned2024-11-03T23:38:30+00:00
URL https://cleanmedia.com/robots.txt
Domain IPs 185.53.178.53
Response IP 185.53.178.53
Found Yes
Hash 39573113c30ec2fab1e8632281c0bd93c5c01a352f9ef6c30cbb5fb6ed1bea61
SimHash 44175844450a

Groups

googlebot

Rule Path
Disallow /*?
Disallow /munin*

baiduspider

Rule Path
Disallow /*?
Disallow /munin*

yandexbot

Rule Path
Disallow /*?
Disallow /munin*

ichiro

Rule Path
Disallow /*?
Disallow /munin*

sogou spider

Rule Path
Disallow /*?
Disallow /munin*

sosospider

Rule Path
Disallow /*?
Disallow /munin*

youdaobot

Rule Path
Disallow /*?
Disallow /munin*

yetibot

Rule Path
Disallow /*?
Disallow /munin*

bingbot

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

yahoo! slurp

Rule Path
Disallow /*?
Disallow /munin*

Other Records

Field Value
crawl-delay 2

rdfbot

Rule Path
Disallow /*?
Disallow /munin*

seznambot

Rule Path
Disallow /*?
Disallow /munin*

ia_archiver

Rule Path
Disallow /munin*

mediapartners-google

Rule Path
Disallow /munin*

Warnings

  • `request-rate` is not a known field.