jho.fr
robots.txt

Robots Exclusion Standard data for jho.fr

Resource Scan

Scan Details

Site Domain jho.fr
Base Domain jho.fr
Scan Status Ok
Last Scan2024-06-11T06:14:19+00:00
Next Scan 2024-07-11T06:14:19+00:00

Last Scan

Scanned2024-06-11T06:14:19+00:00
URL https://jho.fr/robots.txt
Redirect https://www.jho.fr/robots.txt
Redirect Domain www.jho.fr
Redirect Base jho.fr
Domain IPs 75.2.60.5
Redirect IPs 13.251.96.10, 18.139.194.139, 2406:da18:880:3800::c8, 2406:da18:b3d:e202::64
Response IP 18.139.194.139
Found Yes
Hash be5fd08d54f49af181d4011e694e4614070626be9e894d8af428d5a3b51d0e25
SimHash 1259f670c5a2

Groups

*

Rule Path
Disallow /checkout/
Disallow /lp/ong/
Disallow /lp/fu-sport/
Disallow /lp/abo/
Disallow /lp/culotte/
Disallow /lp/nuitplus-longue-nuit/
Disallow /lp/nuitplus-vraiment-longues/
Disallow /lp/nuitplus-regles-abondantes/
Disallow /carte-cadeau/panier/
Disallow /carte-cadeau/confirmation/
Disallow /carte-cadeau/login/

surveybot

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

xenu’s

Rule Path
Disallow /

xenu’s link sleuth 1.1c

Rule Path
Disallow /

admantx

Rule Path
Disallow /

archive-org.com

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

betabot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cmscrawler

Rule Path
Disallow /

contextad bot

Rule Path
Disallow /

cognitiveseo

Rule Path
Disallow /

crystalsemantics

Rule Path
Disallow /

domainoptima

Rule Path
Disallow /

domainsigma

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

genieo

Rule Path
Disallow /

golden-praga

Rule Path
Disallow /

httpclient

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

james bot

Rule Path
Disallow /

leikibot

Rule Path
Disallow /

libcurl

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

livelap

Rule Path
Disallow /

lssrocket

Rule Path
Disallow /

magpie

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

memorybot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

netseer

Rule Path
Disallow /

nutch

Rule Path
Disallow /

pleasebot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

safesearch

Rule Path
Disallow /

semalt

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

stackoverflow

Rule Path
Disallow /

riddler

Rule Path
Disallow /

ru_bot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

showyoubot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

tineye

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

twitter

Rule Path
Disallow /

twittmemebot

Rule Path
Disallow /

umbot

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.jho.fr/sitemap.xml

Comments

  • Blocage de certains bots inutiles

Warnings

  • 4 invalid lines.