dusportif.fr
robots.txt

Robots Exclusion Standard data for dusportif.fr

Resource Scan

Scan Details

Site Domain dusportif.fr
Base Domain dusportif.fr
Scan Status Ok
Last Scan2024-11-02T19:42:58+00:00
Next Scan 2024-11-09T19:42:58+00:00

Last Scan

Scanned2024-11-02T19:42:58+00:00
URL https://dusportif.fr/robots.txt
Redirect https://www.dusportif.fr/robots.txt
Redirect Domain www.dusportif.fr
Redirect Base dusportif.fr
Domain IPs 46.105.120.31
Redirect IPs 46.105.120.31
Response IP 46.105.120.31
Found Yes
Hash c2ecbff1ec91b22f9c005ab62b4f6691d783f83bc7ed0fc7373113a00efc2ac4
SimHash 0561d5b56191

Groups

ia_archiver

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /

duckduckbot
feedfetcher-google
googlebot
mediapartners-google
googlebot-mobile
googlebot-image
googlebot-pagespeed
exabot
bingbot
msnbot
facebookexternalhit
slurp
qwantify

Rule Path
Disallow /a-propos
Disallow /a_propos
Disallow /lapage
Disallow /contact
Disallow /annoncer
Disallow /recherche
Disallow /documents/
Disallow /documents/*
Disallow /documents*
Disallow /soumettre*
Disallow /maj_epreuve*
Disallow /confirmation*
Disallow /voir_mel
Disallow /plugins/
Disallow /templates/
Disallow /resultats/
Disallow /no-content/
Disallow /thumbs/
Disallow /thumbs/*
Disallow /mel/
Disallow /go-www
Disallow /vers/
Disallow /*.pdf

Comments

  • debut filtrage robots
  • fin filtrage robots