24meteo.fr
robots.txt

Robots Exclusion Standard data for 24meteo.fr

Resource Scan

Scan Details

Site Domain 24meteo.fr
Base Domain 24meteo.fr
Scan Status Ok
Last Scan2024-09-30T00:30:59+00:00
Next Scan 2024-10-07T00:30:59+00:00

Last Scan

Scanned2024-09-30T00:30:59+00:00
URL https://24meteo.fr/robots.txt
Domain IPs 46.105.145.216
Response IP 46.105.145.216
Found Yes
Hash 5e47ff90ef8a00c05437318866973d4645c54765fecdf72b96862998a57d1549
SimHash 695453c087a9

Groups

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /wp-content/uploads/*

google favicon

Rule Path
Disallow /wp-content/uploads/*

googlebot-image
googlebot-video

Rule Path
Disallow /wp-content/uploads/*

google-read-aloud

Rule Path
Allow /*

google-read-aloud
googlebot-news
google-speakr
duplexweb-google
googleweblight

Rule Path
Allow /*

adsbot-google-mobile
adsbot-google
mediapartners-google

Rule Path
Allow /*

bingbot
yahoo! slurp
teoma
baiduspider
yandex
applebot
exabot
ia_archiver
qwantify
wikiwix
duckduckbot
pinterest
coccocbot
coccocbot-web
yeti
sogou web spider
sogou
seekbot
seekport
seekport crawler
linguee
deusu
turnitinbot

Rule Path
Allow /*

*

Rule Path
Disallow /wp-content/uploads/*

mj12bot
orthogaffe
ubicrawler
doc
zao
sitecheck.internetseer.com
zealbot
msiecrawler
sitesnagger
webstripper
webcopier
fetch
offline explorer
teleport
teleportpro
webzip
linko
httrack
microsoft.url.control
xenu
larbin
libwww
zyborg
download ninja
fast
wget
npbot
webreaper
mojeekbot
cliqzbot
istellabot
psbot
coccocbot-image
spbot
proximic
bizinformation
blexbot
riddler
ltx71
magpie-crawler
grapeshot
grapeshotcrawler
gigablastopensource
bubing
linkdexbot
linkdexbot/2.2
seokicks
seokicks-robot
panscient.com
webdatastats
zoominfobot
ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.24meteo.fr/sitemap_index.xml

Comments

  • On empeche l'indexation des fichiers sensibles
  • Bloquer Google favicon
  • Bloquer Google Image, vidéo
  • Autoriser Google Recherche vocale (?)
  • Autoriser Google Recherche contenu (?), News
  • Autoriser Google mobile ads et Google ads, media partner
  • Autoriser Bing, Yahoo, Ask, Baidu, Yandex, Apple, Exalead, Alexa, Qwant, Wikipedia,DuckDuckBot
  • Pour les autres robots
  • spamming bot, badbot et robots trop gourmands
  • On indique au spider le lien vers notre sitemap
  • @@@@@@@@@@@&
  • @/ .@@@@(.&@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@# (
  • (.%,, **% .@@@, %@@@* ,@@@@@@@@@@@@ *
  • &@@ ,@@, &@@& @@@@@ %

Warnings

  • 28 invalid lines.