rtl.fr
robots.txt

Robots Exclusion Standard data for rtl.fr

Resource Scan

Scan Details

Site Domain rtl.fr
Base Domain rtl.fr
Scan Status Ok
Last Scan2024-05-24T05:12:26+00:00
Next Scan 2024-05-31T05:12:26+00:00

Last Scan

Scanned2024-05-24T05:12:26+00:00
URL https://rtl.fr/robots.txt
Redirect https://www.rtl.fr/robots.txt
Redirect Domain www.rtl.fr
Redirect Base rtl.fr
Domain IPs 2a0a:1580:2000:2::e, 89.248.208.32
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Response IP 199.232.45.91
Found Yes
Hash 381588941ba79fe6bf4857b6bb4708f49b0829982428fbeea6d025ae122b3935
SimHash 22f8992ad5f7

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /_header_nav_ajax.html
Disallow /emission/laurent-gerra/ecouter/*2010
Disallow /emission/laurent-gerra/ecouter/*2011
Disallow /emission/laurent-gerra/ecouter/*janvier-2012
Disallow /3338/
Disallow /recherche*
Disallow /archive/
Disallow /vote/
Disallow /partager/
Disallow /ajax/
Disallow /outils/
Disallow /recherche*
Disallow /connect/
Disallow /resultats-examens/
Disallow /widgets*
Disallow /emploi*
Disallow /code-promo*

Other Records

Field Value
sitemap https://www.rtl.fr/sitemap-news.xml
sitemap https://www.rtl.fr/sitemap.xml

Comments

  • Disable OpenAI bots
  • Disable Google AI bot