coeurmarseillais.fr
robots.txt

Robots Exclusion Standard data for coeurmarseillais.fr

Resource Scan

Scan Details

Site Domain coeurmarseillais.fr
Base Domain coeurmarseillais.fr
Scan Status Ok
Last Scan2024-11-11T03:22:34+00:00
Next Scan 2024-11-18T03:22:34+00:00

Last Scan

Scanned2024-11-11T03:22:34+00:00
URL https://coeurmarseillais.fr/robots.txt
Redirect https://www.coeurmarseillais.fr/robots.txt
Redirect Domain www.coeurmarseillais.fr
Redirect Base coeurmarseillais.fr
Domain IPs 104.21.34.24, 172.67.196.187, 2606:4700:3031::ac43:c4bb, 2606:4700:3037::6815:2218
Redirect IPs 104.21.34.24, 172.67.196.187, 2606:4700:3031::ac43:c4bb, 2606:4700:3037::6815:2218
Response IP 172.67.196.187
Found Yes
Hash 90d0d9c0ff108bc0beb20102655d85429a31a148ddfc304e248918ea27a74b1e
SimHash 54174d450223

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.cgi$

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Comments

  • On élimine ce répertoire sensible présent sur certains serveurs
  • On désindexe tous les fichiers qui n'ont pas lieu de l'être