editorialink.fr
robots.txt

Robots Exclusion Standard data for editorialink.fr

Resource Scan

Scan Details

Site Domain editorialink.fr
Base Domain editorialink.fr
Scan Status Ok
Last Scan2025-12-02T03:26:13+00:00
Next Scan 2026-01-01T03:26:13+00:00

Last Scan

Scanned2025-12-02T03:26:13+00:00
URL https://editorialink.fr/robots.txt
Domain IPs 18.119.18.18
Response IP 18.119.18.18
Found Yes
Hash 170697b27bd5e99a4c23d899aad1d8cd9b28acba85ebc7fdaa00bf931514da84
SimHash ed3dfae102b0

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /login/
Disallow /admin/
Disallow /cart/
Disallow /checkout/
Disallow /thank-you/
Disallow /*.php$
Disallow /*?*session=
Disallow /*?*sort=
Disallow /*?*filter=
Disallow /*?*add-to-cart=
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/
Allow /assets/
Allow /css/
Allow /js/
Allow /images/
Allow /_next/static/

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

google-extended

Rule Path
Allow /

microsoft-extended

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-user

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

ccbot

Rule Path
Allow /

amazonbot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

Other Records

Field Value
sitemap https://editorialink.fr/sitemap.xml

Comments

  • Editorialink robots.txt — indexation ouverte + blocage minimal des zones sensibles
  • Date: 2025-10-28
  • Règles par défaut
  • Zones privées / non utiles à l'index
  • Ne PAS bloquer tous les paramètres. Cibler seulement le bruit :
  • Rendu front (laisser les assets ouverts)
  • Autorisation explicite pour les bots IA/LLM
  • Sitemap