ijbox.fr
robots.txt

Robots Exclusion Standard data for ijbox.fr

Resource Scan

Scan Details

Site Domain ijbox.fr
Base Domain ijbox.fr
Scan Status Ok
Last Scan2024-11-01T05:16:10+00:00
Next Scan 2024-12-01T05:16:10+00:00

Last Scan

Scanned2024-11-01T05:16:10+00:00
URL https://ijbox.fr/robots.txt
Redirect https://www.ijbox.fr/robots.txt
Redirect Domain www.ijbox.fr
Redirect Base ijbox.fr
Domain IPs 176.31.146.53
Redirect IPs 176.31.146.53
Response IP 176.31.146.53
Found Yes
Hash 29cd9a57b55a8877f0d5991572b6fadc2070dbb2fdbb3858478d37f86b8613af
SimHash b4961308c764

Groups

*

Rule Path
Allow /core/*.css$
Allow /core/*.css?
Allow /core/*.js$
Allow /core/*.js?
Allow /core/*.gif
Allow /core/*.jpg
Allow /core/*.jpeg
Allow /core/*.png
Allow /core/*.svg
Allow /profiles/*.css$
Allow /profiles/*.css?
Allow /profiles/*.js$
Allow /profiles/*.js?
Allow /profiles/*.gif
Allow /profiles/*.jpg
Allow /profiles/*.jpeg
Allow /profiles/*.png
Allow /profiles/*.svg
Disallow /core/
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips
Disallow /index.php/node/add/
Disallow /index.php/search/
Disallow /index.php/user/password
Disallow /index.php/user/register
Disallow /index.php/user/login
Disallow /index.php/user/logout
Disallow /se-connecter
Disallow /index.php/se-connecter

applebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

facebot

Rule Path
Allow /

slurp

Rule Path
Allow /

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alvinetspider

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

antenne hatena

Rule Path
Disallow /

apocalxexplorerbot

Rule Path
Disallow /

argus

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

asterias

Rule Path
Disallow /

augure

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bizinformation

Rule Path
Disallow /

black hole

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cision

Rule Path
Disallow /

coexel

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

corporama

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

digimind

Rule Path
Disallow /

disco pump 3.1

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

edd

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

europresse

Rule Path
Disallow /

explore

Rule Path
Disallow /

eureka

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

factiva

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

fetch

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack 3.0

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

igentia

Rule Path
Disallow /

indexer

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kantar

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

knowings

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

manageo

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

mediacompil

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moreover

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

ms search 4.0 robot

Rule Path
Disallow /

ms search 5.0 robot

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netattache

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

newscan-online

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

npbot

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

proxem

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

psbot

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

raven

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

rma

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

sightupbot

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou head spider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spotter

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superbot/2.6

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

toscrawler

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

tunitinbot

Rule Path
Disallow /

turingos

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

up2news

Rule Path
Disallow /

urlpouls

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verif

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vci

Rule Path
Disallow /

vsw

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website extractor

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webstripper/2.02

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wikiofeedbot

Rule Path
Disallow /

winello

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu link sleuth/1.3.8

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youmag

Rule Path
Disallow /

yrspider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zite

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • CSS, JS, Images
  • Directories
  • Disallow: /sites/*/files/
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Robots admis
  • Robots exclus

Warnings

  • 6 invalid lines.