/.well-known/

Log In Sign Up

breat.fr
robots.txt

Robots Exclusion Standard data for breat.fr

Archived Snapshots

Resource Scan

Scan Details

Site Domain	breat.fr
Base Domain	breat.fr
Scan Status	Ok
Last Scan	2024-09-07T11:54:44+00:00
Next Scan	2024-10-07T11:54:44+00:00

Last Scan

Scanned	2024-09-07T11:54:44+00:00
URL	https://breat.fr/robots.txt
Domain IPs	104.21.51.37, 172.67.220.123, 2606:4700:3035::ac43:dc7b, 2606:4700:3037::6815:3325
Response IP	172.67.220.123
Found	Yes
Hash	b6807d0a8d8409e3e0dd63c17958fa7b3965975aa141cbbf4b0719b1f014ce07
SimHash	dc3e684002b6

Groups

*

Rule

Path

Allow

/

Disallow

/.ftpquota

Disallow

/.htaccess

Disallow

/.htpasswd

Disallow

/BingSiteAuth.xml

Disallow

/default_index.html

Disallow

/f46af2a7-8ca0-411c-8ff3-f9803c0886d6.html

Disallow

/fileal.txt

Disallow

/FileFox.txt

Disallow

/FileJoker.txt

Disallow

/mywot931787a897194177edb6.html

Disallow

/nortonsw_aa4a4e20-9033-0.html

Disallow

/phpinfolws.php

Disallow

/pinterest-fdf04.html

Disallow

/SjuT9fqG156pg21k.txt

Disallow

/yandex_6404c24fe9c6bfd1.html

Disallow

/ai.breat.fr/

Disallow

/comments/

Disallow

/erreurs.breat.fr/

Disallow

/includes/

Disallow

/medical.breat.fr/

Disallow

/nas.breat.fr/

Disallow

/static/

Disallow

/stash.breat.fr/

Disallow

/stats.breat.fr/

Disallow

/test.breat.fr/

Disallow

/*/config.php

Disallow

/*/default_index.html

ccbot

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://breat.fr/sitemap.xml

Back to top

Comments

Algolia-Crawler-Verif: 3F0740239616609F

Back to top