resultat-bac.linternaute.com
robots.txt

Robots Exclusion Standard data for resultat-bac.linternaute.com

Resource Scan

Scan Details

Site Domain resultat-bac.linternaute.com
Base Domain linternaute.com
Scan Status Ok
Last Scan2024-11-10T08:29:16+00:00
Next Scan 2024-11-17T08:29:16+00:00

Last Scan

Scanned2024-11-10T08:29:16+00:00
URL https://resultat-bac.linternaute.com/robots.txt
Domain IPs 118.215.80.128
Response IP 118.215.80.128
Found Yes
Hash 5c4e276e426eb80dac671f89a49c08bf69727579168860ca561c2d685de8835a
SimHash cd1483b07cdb

Groups

mediapartners-google*

Rule Path
Disallow

*

Rule Path
Disallow /*?print
Disallow /*xhr
Disallow /recherche/
Disallow /*?candidate-label=
Disallow /academie-*/candidat-
Disallow /candidat
Disallow /candidat-*

trendkite-akashic-crawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://resultat-bac.linternaute.com/sitemap/

Comments

  • resultat-bac.linternaute.com
  • Block https://opensiteexplorer.org/dotbot
  • Block http://ahrefs.com/robot/
  • Block https://dataforseo.com/dataforseo-bot
  • Block https://www.semrush.com/bot/