www.calendrier.dusportif.fr
robots.txt

Robots Exclusion Standard data for www.calendrier.dusportif.fr

Resource Scan

Scan Details

Site Domain www.calendrier.dusportif.fr
Base Domain dusportif.fr
Scan Status Ok
Last Scan2024-11-05T12:58:18+00:00
Next Scan 2024-12-05T12:58:18+00:00

Last Scan

Scanned2024-11-05T12:58:18+00:00
URL https://www.calendrier.dusportif.fr/robots.txt
Domain IPs 46.105.120.31
Response IP 46.105.120.31
Found Yes
Hash 18b4c61d8c308d7247609ef1b4d290cd365308d3f3322d2cd9243285a8794f37
SimHash 2966d59771d1

Groups

ia_archiver

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

*

Rule Path
Disallow /

duckduckbot
feedfetcher-google
googlebot
mediapartners-google
googlebot-mobile
googlebot-image
googlebot-pagespeed
exabot
bingbot
msnbot
facebookexternalhit
slurp
qwantify

Rule Path
Disallow /*.pdf
Disallow /a-propos
Disallow /a_propos
Disallow /annoncer
Disallow /confirmation*
Disallow /documents*
Disallow /documents/
Disallow /documents/*
Disallow /go-www
Disallow /lapage
Disallow /maj_epreuve*
Disallow /mel/
Disallow /mel/*
Disallow /mel*
Disallow /resultats/
Disallow /resultats/*
Disallow /resultats*
Disallow /no-content/
Disallow /photos*
Disallow /photos/
Disallow /photos/*
Disallow /plugins/
Disallow /recherche
Disallow /scripts/
Disallow /soumettre*
Disallow /templates/
Disallow /vers/*
Disallow /vers/
Disallow /voir_mel

Comments

  • debut filtrage robots
  • fin filtrage robots