breat.fr
robots.txt

Robots Exclusion Standard data for breat.fr

Resource Scan

Scan Details

Site Domain breat.fr
Base Domain breat.fr
Scan Status Ok
Last Scan2024-09-07T11:54:44+00:00
Next Scan 2024-10-07T11:54:44+00:00

Last Scan

Scanned2024-09-07T11:54:44+00:00
URL https://breat.fr/robots.txt
Domain IPs 104.21.51.37, 172.67.220.123, 2606:4700:3035::ac43:dc7b, 2606:4700:3037::6815:3325
Response IP 172.67.220.123
Found Yes
Hash b6807d0a8d8409e3e0dd63c17958fa7b3965975aa141cbbf4b0719b1f014ce07
SimHash dc3e684002b6

Groups

*

Rule Path
Allow /
Disallow /.ftpquota
Disallow /.htaccess
Disallow /.htpasswd
Disallow /BingSiteAuth.xml
Disallow /default_index.html
Disallow /f46af2a7-8ca0-411c-8ff3-f9803c0886d6.html
Disallow /fileal.txt
Disallow /FileFox.txt
Disallow /FileJoker.txt
Disallow /mywot931787a897194177edb6.html
Disallow /nortonsw_aa4a4e20-9033-0.html
Disallow /phpinfolws.php
Disallow /pinterest-fdf04.html
Disallow /SjuT9fqG156pg21k.txt
Disallow /yandex_6404c24fe9c6bfd1.html
Disallow /ai.breat.fr/
Disallow /comments/
Disallow /erreurs.breat.fr/
Disallow /includes/
Disallow /medical.breat.fr/
Disallow /nas.breat.fr/
Disallow /static/
Disallow /stash.breat.fr/
Disallow /stats.breat.fr/
Disallow /test.breat.fr/
Disallow /*/config.php
Disallow /*/default_index.html

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://breat.fr/sitemap.xml

Comments

  • Algolia-Crawler-Verif: 3F0740239616609F