clinicalconnection.com
robots.txt

Robots Exclusion Standard data for clinicalconnection.com

Resource Scan

Scan Details

Site Domain clinicalconnection.com
Base Domain clinicalconnection.com
Scan Status Ok
Last Scan2024-10-31T22:29:18+00:00
Next Scan 2024-11-30T22:29:18+00:00

Last Scan

Scanned2024-10-31T22:29:18+00:00
URL https://clinicalconnection.com/robots.txt
Redirect https://www.clinicalconnection.com/robots.txt
Redirect Domain www.clinicalconnection.com
Redirect Base clinicalconnection.com
Domain IPs 104.46.115.222
Redirect IPs 104.46.115.222
Response IP 104.46.115.222
Found Yes
Hash 541ed5f4d6aec63968b90713ba9255a07f65ccb9ccd760b72246af754ec63603
SimHash 50642062e46a

Groups

*

Rule Path
Disallow /bin/
Disallow /masters/
Disallow /comps/
Disallow /Content/
Disallow /Scripts/
Disallow /health-news/
Disallow /clinical-trials-from-other-databases/
Disallow /search-clinical-trials-from-other-databases/
Disallow /study-participant/
Disallow /nearby-studies-in-zipcode/
Disallow /clinic-admin/*
Allow /study-participant/login
Allow /clinic-admin/
Allow /clinic-admin/study-center-join

ahrefsbot

Rule Path
Allow /

mj12bot

Rule Path
Allow /

semrushbot

Rule Path
Allow /

semrushbot-sa

Rule Path
Allow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

acunetix

Rule Path
Disallow /

nessus

Rule Path
Disallow /

nikto

Rule Path
Disallow /

sqlmap

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

obot

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

cloudservermarketspider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

linkstats

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

plista

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

sg-orbiter

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

kraken

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

openhosebot

Rule Path
Disallow /

thumbsniper

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

implisensebot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

python/3.5 aiohttp

Rule Path
Disallow /

toweya.com

Rule Path
Disallow /

netestate

Rule Path
Disallow /

bubing

Rule Path
Disallow /

linguee

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

indeedbot

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

gosign-security-crawler

Rule Path
Disallow /

siteliner

Rule Path
Disallow /

sabsimbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.clinicalconnection.com/sitemap.xml

Comments

  • Allow specific pages within disallowed paths
  • Allow specific bots used for competitive analysis and SEO tools
  • Disallow specific non-essential crawlers

Warnings

  • 2 invalid lines.