dataecology.net
robots.txt

Robots Exclusion Standard data for dataecology.net

Resource Scan

Scan Details

Site Domain dataecology.net
Base Domain dataecology.net
Scan Status Ok
Last Scan2024-10-03T21:02:54+00:00
Next Scan 2024-11-02T21:02:54+00:00

Last Scan

Scanned2024-10-03T21:02:54+00:00
URL http://dataecology.net/robots.txt
Domain IPs 20.37.145.104
Response IP 20.37.145.104
Found Yes
Hash 258e27ff1e29700aca46c21558559e9c64b1ab33d1c1a118b19c683d6a29cb64
SimHash c9864642fb11

Groups

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

cutestat

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

yandex

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

linespider

Rule Path
Disallow /

coccocbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot
*

Rule Path
Disallow /AFqdtDRUNzL2wSumJz6H
Disallow /*.axd$
Disallow /*.axd
Disallow /ScriptResource.axd
Disallow /WebResource.axd
Disallow /scriptresource.axd
Disallow /webresource.axd