genetichomeland.com
robots.txt

Robots Exclusion Standard data for genetichomeland.com

Resource Scan

Scan Details

Site Domain genetichomeland.com
Base Domain genetichomeland.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-05T22:06:49+00:00
Next Scan 2024-11-03T22:06:49+00:00

Last Successful Scan

Scanned2023-01-14T05:41:24+00:00
URL https://www.genetichomeland.com/robots.txt
Domain IPs 66.85.139.26
Response IP 66.85.139.26
Found Yes
Hash 7f30100205b3df2c71e989749a1d0cb7073d994c4818401cda58c4e9b01bb34f
SimHash 7714c5376b88

Groups

*

Rule Path
Disallow /mgt/
Disallow /accountmgt/
Disallow /privacypolicy/
Disallow /catalog/
Disallow /404.asp

Other Records

Field Value
crawl-delay 20

bingbot

Rule Path
Disallow /mgt/
Disallow /accountmgt/
Disallow /privacypolicy/
Disallow /404.asp
Disallow /support/

Other Records

Field Value
crawl-delay 30

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

chinasospider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

yoozbot

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

euripbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

paracrawl

Rule Path
Disallow /

scrapy/1.5.0

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

velenpublicwebcrawler (velen.io)

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

semrushbot/2~bl

Rule Path
Disallow /

pcore-http

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

crawler.feedback+wc@gmail.com

Rule Path
Disallow /

cyotekwebcopy/1.0

Rule Path
Disallow /

centurybot9@gmail.com

Rule Path
Disallow /

crawler (crawler.feedback@gmail.com)

Rule Path
Disallow /

crawler

Rule Path
Disallow /

barkrowler/0.7 (+http://www.exensa.com/crawl)

Rule Path
Disallow /

go-http-client/1.1

Rule Path
Disallow /

test crawl

Rule Path
Disallow /

scalaj-http/1.0

Rule Path
Disallow /

bubing

Rule Path
Disallow /

wotbox/2.01

Rule Path
Disallow /

ccbot/2.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ebibot

Rule Path
Disallow /

pcore-http/v0.24.5

Rule Path
Disallow /

testitest1

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

istellabot/t.1

Rule Path
Disallow /

istellabot/t.1.13

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

booglebot2

Rule Path
Disallow /

booglebot

Rule Path
Disallow /

booglebot 2.0

Rule Path
Disallow /

booglebot/2.0

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

influencebo

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

acoonbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

businessdbbot

Rule Path
Disallow /

superfeedr

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

rogerbot/1.0

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

swebot

Rule Path
Disallow /

swebot

Rule Path
Disallow /

www.80legs.com

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

comodospider

Rule Path
Disallow /

comodospider/nutch-1.2

Rule Path
Disallow /

daumoa

Rule Path
Disallow /

beetlebot

Rule Path
Disallow /

niki-bot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

spbot

Rule Path
Disallow /

icarus6

Rule Path
Disallow /

icarus6

Rule Path
Disallow /

icarus

Rule Path
Disallow /

icarus

Rule Path
Disallow /

icarus6j

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

knelson

Rule Path
Disallow /

knelson/0.9

Rule Path
Disallow /

wotbox/2.01

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /welcome/

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.genetichomeland.com/sitemap.xml

Comments

  • Clean-param: AD&SI&DB&SM /welcome/dnapedigree.asp
  • Clean-param: RL&TN&HgOnly /welcome/dnamarkerindex.asp
  • Bing - Microsoft
  • Baidu China
  • jike.com / chinaso.com chinese search engine
  • YouDau
  • Yandex Russia
  • AhrefsSEOBot
  • www.majestic12.co.uk
  • Disallow: /
  • 4/18/2019 additions from: http://www.hakank.org/robots.txt
  • User-agent: Flipboard
  • Disallow: /

Warnings

  • 10 invalid lines.