corpdetails.com
robots.txt

Robots Exclusion Standard data for corpdetails.com

Resource Scan

Scan Details

Site Domain corpdetails.com
Base Domain corpdetails.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-07-08T16:48:21+00:00
Next Scan 2024-10-06T16:48:21+00:00

Last Successful Scan

Scanned2021-10-19T16:45:11+00:00
URL https://www.corpdetails.com/robots.txt
Found Yes
Hash b11debfc133b5cca4f32a0a3732c82354eaf1c21287b982ff4adce73d3e051de
SimHash b363f35426a6

Groups

*

Rule Path
Disallow /images/
Disallow /corp/captcha
Disallow /post-comment
Disallow /FBLogin
Disallow /saveData.jsp
Disallow /claim-company
Disallow /FBApp
Disallow /one-step-login

mediapartners-google

Rule Path
Allow /

a\ .net web crawler

Rule Path
Disallow /

aipbot

Rule Path
Disallow /

amorphiccrawler

Rule Path
Disallow /

art-online.com\ 0.9(beta)

Rule Path
Disallow /

asterias

Rule Path
Disallow /

backdoorbot

Rule Path
Disallow /

baconsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

black.hole

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

bleeding-sucker-bot

Rule Path
Disallow /

boitho

Rule Path
Disallow /

bot\ mailto:craftbot@yahoo.com

Rule Path
Disallow /

bot/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

bwc/0.3

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

caliperbot

Rule Path
Disallow /

camelstampede

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

chat catcher

Rule Path
Disallow /

cherrypickerse

Rule Path
Disallow /

cherrypickerelite

Rule Path
Disallow /

chinaclaw

Rule Path
Disallow /

clearware\ web\ browser

Rule Path
Disallow /

cn_com_viewer

Rule Path
Disallow /

com_viewer

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

cowbot

Rule Path
Disallow /

crescent

Rule Path
Disallow /

custo

Rule Path
Disallow /

diibot/1.2

Rule Path
Disallow /

disco

Rule Path
Disallow /

distilled-cache

Rule Path
Disallow /

download\ demon

Rule Path
Disallow /

dloader

Rule Path
Disallow /

dloader(naverrobot)/1.5

Rule Path
Disallow /

bordermanager

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

drupal

Rule Path
Disallow /

dumbot

Rule Path
Disallow /

ec_robot

Rule Path
Disallow /

ecairn-grabber

Rule Path
Disallow /

ecatch

Rule Path
Disallow /

e-societyrobot

Rule Path
Disallow /

eirgrabber

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

express\ webpictures

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

eyenetie

Rule Path
Disallow /

fairshare

Rule Path
Disallow /

flashget

Rule Path
Disallow /

fr_crawler

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

gazopabot

Rule Path
Disallow /

gazz

Rule Path
Disallow /

geniebot

Rule Path
Disallow /

getleft

Rule Path
Disallow /

getright

Rule Path
Disallow /

getweb!

Rule Path
Disallow /

ginxbot

Rule Path
Disallow /

go!zilla

Rule Path
Disallow /

go.zilla

Rule Path
Disallow /

go-ahead-got-it

Rule Path
Disallow /

grabnet

Rule Path
Disallow /

grafula

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

grubng

Rule Path
Disallow /

gsa-crawler

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

harvest

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

hmview

Rule Path
Disallow /

holmesbot

Rule Path
Disallow /

http_load 29jun2005

Rule Path
Disallow /

httrack

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

image\ stripper

Rule Path
Disallow /

image\ sucker

Rule Path
Disallow /

imagebot

Rule Path
Disallow /

indy\ library

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

interget

Rule Path
Disallow /

internet\ ninja

Rule Path
Disallow /

isidorus

Rule Path
Disallow /

isiteamspider

Rule Path
Disallow /

java

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetcar

Rule Path
Disallow /

joc\ web\ spider

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

kalooga/kaloogabot

Rule Path
Disallow /

larbin

Rule Path
Disallow /

larbin_2.6.3

Rule Path
Disallow /

larbin_2.6.2\ larbin2.6.2@unspecified.mail

Rule Path
Disallow /

larbin_2.6.3\ larbin2.6.3@unspecified.mail

Rule Path
Disallow /

larbin-experimental

Rule Path
Disallow /

largesmall\ crawler

Rule Path
Disallow /

leechftp

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

librabot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

linkverifier

Rule Path
Disallow /

lickity_split_spider

Rule Path
Disallow /

lmspider

Rule Path
Disallow /

lwp::simple

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.38

Rule Path
Disallow /

lwp-trivial/1.40

Rule Path
Disallow /

lwp-request

Rule Path
Disallow /

magnet

Rule Path
Disallow /

mag-net

Rule Path
Disallow /

mass\ downloader

Rule Path
Disallow /

mata.hari

Rule Path
Disallow /

memo

Rule Path
Disallow /

metalogger

Rule Path
Disallow /

midown\ tool

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

mirror

Rule Path
Disallow /

mister\ pix

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

modiphibot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

msnptc

Rule Path
Disallow /

naverbot-1.0

Rule Path
Disallow /

naverrobot

Rule Path
Disallow /

navroad

Rule Path
Disallow /

nearsite

Rule Path
Disallow /

netants

Rule Path
Disallow /

netspider

Rule Path
Disallow /

net\ vampire

Rule Path
Disallow /

netzip

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

nl-crawler

Rule Path
Disallow /

noteworthybot

Rule Path
Disallow /

noxtrumbot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

nutchcvs

Rule Path
Disallow /

octopus

Rule Path
Disallow /

offline\ explorer

Rule Path
Disallow /

offline\ navigator

Rule Path
Disallow /

oozbot

Rule Path
Disallow /

openfind

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

pagegrabber

Rule Path
Disallow /

page-store

Rule Path
Disallow /

papa\ foto

Rule Path
Disallow /

p.arthur 1.1

Rule Path
Disallow /

pavuk

Rule Path
Disallow /

pcbrowser

Rule Path
Disallow /

pdfbot

Rule Path
Disallow /

peter\ wang/nutch-0.9

Rule Path
Disallow /

pita

Rule Path
Disallow /

pockey-gethtml/4.11.6

Rule Path
Disallow /

pockey-gethtml

Rule Path
Disallow /

poprlbot

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

program shareware

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

psbot

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

queryn.metasearch

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

rdfbot

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

realdownload

Rule Path
Disallow /

reget

Rule Path
Disallow /

renlifangbot

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

rma

Rule Path
Disallow /

outfoxbot

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

sherlock

Rule Path
Disallow /

simplecrawler

Rule Path
Disallow /

siphon

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

shablastbot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sisi

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

smartdownload

Rule Path
Disallow /

sna-0.0.1

Rule Path
Disallow /

sna-0.0.1\ mikeelliott@hotmail.com

Rule Path
Disallow /

snapbot

Rule Path
Disallow /

snappreviewbot

Rule Path
Disallow /

sosoblogspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spanner

Rule Path
Disallow /

space\ bison

Rule Path
Disallow /

speedy\ spider

Rule Path
Disallow /

sphere\ scout

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

sqwidgebot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superhttp

Rule Path
Disallow /

surfbot

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

sygolbot

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

tailsweepbot

Rule Path
Disallow /

takeout

Rule Path
Disallow /

tanner\ spider/nutch-1.1

Rule Path
Disallow /

tapuzbot

Rule Path
Disallow /

tarantula\ experimental\ crawler

Rule Path
Disallow /

telnet

Rule Path
Disallow /

teleport\ pro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the.intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tineye

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

turingos

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

urly.warning

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

voideye

Rule Path
Disallow /

voyager

Rule Path
Disallow /

watchfire\ webxm

Rule Path
Disallow /

web\ image\ collector

Rule Path
Disallow /

web\ sucker

Rule Path
Disallow /

webalta crawler

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webclipping

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webemailextrac

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webfetch

Rule Path
Disallow /

webgo\ is

Rule Path
Disallow /

webcollage

Rule Path
Disallow /

web.image.collector

Rule Path
Disallow /

webleacher

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website\ extractor

Rule Path
Disallow /

website\ quester

Rule Path
Disallow /

webster.pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webwhacker

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wget/1.8.2

Rule Path
Disallow /

wells\ search\ ii

Rule Path
Disallow /

west\ wind\ internet\ protocols

Rule Path
Disallow /

widow

Rule Path
Disallow /

wordpress

Rule Path
Disallow /

wordpress\/2.6.2

Rule Path
Disallow /

wwwoffle

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xaldon\ webspider

Rule Path
Disallow /

xenu

Rule Path
Disallow /

xenu\ link\ sleuth

Rule Path
Disallow /

xrss

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yanga\ worldsearch\ bot

Rule Path
Disallow /

yebolbot

Rule Path
Disallow /

yeti/1.0

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

xmind

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zibber

Rule Path
Disallow /

zibber-v0.1

Rule Path
Disallow /

zixybot

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

Warnings

  • 4 invalid lines.