actu44.fr
robots.txt

Robots Exclusion Standard data for actu44.fr

Resource Scan

Scan Details

Site Domain actu44.fr
Base Domain actu44.fr
Scan Status Ok
Last Scan2026-03-31T23:35:30+00:00
Next Scan 2026-04-07T23:35:30+00:00

Last Scan

Scanned2026-03-31T23:35:30+00:00
URL https://actu44.fr/robots.txt
Domain IPs 109.234.162.237
Response IP 109.234.162.237
Found Yes
Hash 2ea49180782983f2a677c5d2d774f76ee9da1d5c9f4e2834bf10e681fde6eb93
SimHash 05105d404d50

Groups

gptbot

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

claude-web

Rule Path
Allow /

claudebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

amazonbot

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /secupress-53fb2346/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /xmlrpc.php
Disallow /shopdetail/
Disallow /%3Ashodetail
Disallow /event/
Disallow /events/
Disallow /author/
Disallow /media/
Disallow /feed/
Disallow /trackback/
Disallow /*/*/feed/
Disallow /*/*/trackback/
Disallow /sample-page-2/
Disallow /horoscope/
Disallow /*?amp=
Disallow /*?amp=1
Disallow /*?noamp=
Disallow /*?s=
Disallow /*?fbclid=
Disallow /*?random-post=
Disallow /*?ical=
Disallow /*?outlook-ical=
Disallow /*?tribe-bar-date=
Allow /wp-content/
Allow /*.css$
Allow /*.js$
Allow /*.png$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.gif$
Allow /*.webp$
Allow /*.woff$
Allow /*.woff2$
Allow /*.ttf$

Other Records

Field Value
sitemap https://www.actu44.fr/sitemap.xml

Comments

  • ============================================
  • Actu44.fr - robots.txt
  • Mise à jour : 25 mars 2026
  • ============================================
  • ============================================
  • BOTS IA — EXPLICITEMENT AUTORISÉS
  • ============================================
  • ============================================
  • RÈGLES GÉNÉRALES
  • ============================================
  • --- Sécurité ---
  • --- Séquelles piratage (maintenu par précaution) ---
  • --- Anciens événements (plugin désactivé) ---
  • --- Pages inutiles ---
  • --- Paramètres générateurs de contenu dupliqué ---
  • --- Autoriser ressources statiques ---
  • ============================================
  • SITEMAPS
  • ============================================