jcomjeune.com
robots.txt

Robots Exclusion Standard data for jcomjeune.com

Resource Scan

Scan Details

Site Domain jcomjeune.com
Base Domain jcomjeune.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-11-14T21:11:50+00:00
Next Scan 2025-01-13T21:11:50+00:00

Last Successful Scan

Scanned2024-09-16T21:10:13+00:00
URL http://jcomjeune.com/robots.txt
Redirect https://www.cidj.com/robots.txt
Redirect Domain www.cidj.com
Redirect Base cidj.com
Domain IPs 176.31.146.49
Redirect IPs 176.31.146.49
Response IP 176.31.146.49
Found Yes
Hash 2d3a44f295e45e0f019c6bc20a2af0d732481c59f230c675a10475cc5f61003b
SimHash b4961348c764

Groups

*

Rule Path
Allow /core/*.css$
Allow /core/*.css?
Allow /core/*.js$
Allow /core/*.js?
Allow /core/*.gif
Allow /core/*.jpg
Allow /core/*.jpeg
Allow /core/*.png
Allow /core/*.svg
Allow /core/*.webp
Allow /profiles/*.css$
Allow /profiles/*.css?
Allow /profiles/*.js$
Allow /profiles/*.js?
Allow /profiles/*.gif
Allow /profiles/*.jpg
Allow /profiles/*.jpeg
Allow /profiles/*.png
Allow /profiles/*.svg
Disallow /core/
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips
Disallow /index.php/node/add/
Disallow /index.php/search/
Disallow /index.php/user/password
Disallow /index.php/user/register
Disallow /index.php/user/login
Disallow /index.php/user/logout
Disallow /se-connecter
Disallow /index.php/se-connecter

applebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

facebot

Rule Path
Allow /

slurp

Rule Path
Allow /

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alvinetspider

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

antenne hatena

Rule Path
Disallow /

apocalxexplorerbot

Rule Path
Disallow /

argus

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

asterias

Rule Path
Disallow /

augure

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bizinformation

Rule Path
Disallow /

black hole

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cision

Rule Path
Disallow /

coexel

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

corporama

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

digimind

Rule Path
Disallow /

disco pump 3.1

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

edd

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

europresse

Rule Path
Disallow /

explore

Rule Path
Disallow /

eureka

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

factiva

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

fetch

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack 3.0

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

igentia

Rule Path
Disallow /

indexer

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kantar

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

knowings

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

manageo

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

mediacompil

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moreover

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

ms search 4.0 robot

Rule Path
Disallow /

ms search 5.0 robot

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netattache

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

newscan-online

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

npbot

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

proxem

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

psbot

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

raven

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

rma

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

sightupbot

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou head spider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spotter

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superbot/2.6

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

toscrawler

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

tunitinbot

Rule Path
Disallow /

turingos

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

up2news

Rule Path
Disallow /

urlpouls

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verif

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vci

Rule Path
Disallow /

vsw

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website extractor

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webstripper/2.02

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wikiofeedbot

Rule Path
Disallow /

winello

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu link sleuth/1.3.8

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youmag

Rule Path
Disallow /

yrspider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zite

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • CSS, JS, Images
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Robots admis
  • Robots exclus

Warnings

  • 11 invalid lines.