media1.ledevoir.com
robots.txt

Robots Exclusion Standard data for media1.ledevoir.com

Resource Scan

Scan Details

Site Domain media1.ledevoir.com
Base Domain ledevoir.com
Scan Status Ok
Last Scan2024-09-20T23:49:40+00:00
Next Scan 2024-09-27T23:49:40+00:00

Last Scan

Scanned2024-09-20T23:49:40+00:00
URL https://media1.ledevoir.com/robots.txt
Domain IPs 151.101.130.132, 151.101.194.132, 151.101.2.132, 151.101.66.132
Response IP 199.232.46.132
Found Yes
Hash 27b25016219e39fc077cd0b3305421d1f45326cc52dee90b42834c6158e77992
SimHash e859153fe3e1

Groups

*

Rule Path
Disallow /recherche*
Disallow /article-favori-modifier/

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

archive-it

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

gnowit

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mediatoolkit.com

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

muckrack

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

openai

Rule Path
Disallow /

otmedia

Rule Path
Disallow /

perplexity

Rule Path
Disallow /

scoopit

Rule Path
Disallow /

scpitspi-rs

Rule Path
Disallow /

semantic-visions.com

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

trendiction

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ledevoir.com/sitemap.xml

Comments

  • Le contenu du Devoir est mis à disposition selon nos conditions d'utilisation.
  • Toute autre utilisation n'est pas autorisée, y compris, mais sans s'y limiter : pour les grands modèles de langage (LLM), l'apprentissage automatique et/ou les activités liées à l'intelligence artificielle.
  • Contact droits@ledevoir.com
  • le Devoir content is made available under our terms and conditions of use.
  • Any other uses are not permitted, incl. but not limited to: for large language models (LLMs), machine learning and/or artificial intelligence-related
  • Contact droits@ledevoir.com for assistance