madintouch.com
robots.txt

Robots Exclusion Standard data for madintouch.com

Resource Scan

Scan Details

Site Domain madintouch.com
Base Domain madintouch.com
Scan Status Ok
Last Scan2025-04-14T10:33:08+00:00
Next Scan 2025-04-21T10:33:08+00:00

Last Scan

Scanned2025-04-14T10:33:08+00:00
URL https://madintouch.com/robots.txt
Redirect https://www.madintouch.com/robots.txt
Redirect Domain www.madintouch.com
Redirect Base madintouch.com
Domain IPs 104.21.91.192, 172.67.178.94, 2606:4700:3034::6815:5bc0, 2606:4700:3036::ac43:b25e
Redirect IPs 104.21.91.192, 172.67.178.94, 2606:4700:3034::6815:5bc0, 2606:4700:3036::ac43:b25e
Response IP 172.67.178.94
Found Yes
Hash 2361eb63d896c3c15f3c03791c49a2a134df783d6672d479672fdcf12fbd83b9
SimHash 332458c0e2b0

Groups

*

Rule Path
Disallow /search/?q=
Allow /
Disallow /wp-content/plugins/ai-engine/
Disallow /wp-content/uploads/ai-engine/
Disallow /chatbot.js
Disallow /wp-json/mwai/
Disallow /wp-json/ai-engine/
Disallow /wp-json/mwai/chatbot/
Disallow /chatbot/
Disallow /*?ver=
Disallow /*chatbot%3D
Disallow /cdn-cgi/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /wp-signup.php
Disallow /xmlrpc.php
Disallow /wp-config.php
Disallow /wp-config-sample.php
Disallow /readme.html
Disallow /license.txt
Disallow /wp-content/*.log$
Disallow /dating/
Disallow /wp-content/cache/
Disallow /wp-content/wp-rocket-config/
Disallow /wp-content/plugins/wp-rocket/
Disallow /wp-content/advanced-cache.php
Disallow /*?author=*
Disallow /page/*?*
Disallow /search/
Disallow */feed/
Disallow */trackback/
Disallow */comments/
Disallow /wp-content/debug.log
Disallow /wp-content/uploads/wpo-backup/
Disallow /*.sql$
Disallow /*.gz$
Disallow /wp-content/updraft/
Disallow /wp-content/uploads/backups/
Disallow /wp-json/
Allow /wp-json/wp/v2/posts
Allow /wp-json/wp/v2/pages
Allow /wp-json/wp/v2/categories
Allow /wp-json/wp/v2/tags
Allow /wp-content/uploads/
Allow /wp-content/themes/*/assets/
Allow /wp-includes/js/
Allow /wp-includes/css/
Allow /*.css$
Allow /*.js$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.png$
Allow /*.gif$
Allow /*.webp$
Allow /*.avif$
Allow /*.svg$
Allow /*.pdf$
Allow /*?utm_source=*
Allow /*?utm_medium=*
Allow /*?utm_campaign=*
Allow /*?p=*
Allow /*?page_id=*

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

semrushbot

Rule Path
Allow /

blexbot

Rule Path
Allow /

screaming frog seo spider

Rule Path
Allow /

sitebulb

Rule Path
Allow /

searchatlas

Rule Path
Allow /

gptbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

yandexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.madintouch.com/sitemap_index.xml

Comments

  • Fichier robots.txt optimisé pour Madintouch.com
  • Dernière mise à jour : 09/03/2025
  • Configuration pour tous les robots par défaut
  • Protection des fichiers et répertoires WordPress Core
  • Protéger les fichiers log mais autoriser les autres fichiers .txt
  • Bloquer la section /dating/
  • Bloquer les dossiers de cache WP Rocket
  • Bloquer les pages de recherche, pagination et scraping
  • Bloquer les fichiers sensibles
  • API REST - autoriser seulement les endpoints publics
  • Autoriser explicitement l'accès aux assets
  • Paramètres autorisés (clean URLs)
  • Moteurs de recherche principaux
  • Autres bots légitimes
  • Robots IA
  • Bloquer les bots malveillants
  • Sitemap - URL canonique utilisant sitemap_index.xml

Warnings

  • 2 invalid lines.