leberry.fr
robots.txt

Robots Exclusion Standard data for leberry.fr

Resource Scan

Scan Details

Site Domain leberry.fr
Base Domain leberry.fr
Scan Status Ok
Last Scan2024-05-31T04:05:05+00:00
Next Scan 2024-06-07T04:05:05+00:00

Last Scan

Scanned2024-05-31T04:05:05+00:00
URL https://leberry.fr/robots.txt
Redirect https://www.leberry.fr/robots.txt
Redirect Domain www.leberry.fr
Redirect Base leberry.fr
Domain IPs 104.18.3.216
Redirect IPs 104.18.2.216, 104.18.3.216, 2606:4700::6812:2d8, 2606:4700::6812:3d8
Response IP 104.18.3.216
Found Yes
Hash 08907736b6aafd9ad8c15573674938a14fa3a13ceb33adc46e188720ef6007d8
SimHash 191f15a661f2

Groups

*

Rule Path
Disallow /captcha.png
Disallow /json-rpc
Disallow /ajax
Disallow /idalgo
Disallow /place-publique
Disallow /*.php
Disallow /wp-
Disallow /recherche
Disallow /archivev2
Disallow /widgetRss
Disallow /*?widgetRss
Disallow /*GCF_
Disallow /region/
Disallow /loisirs/agenda/image
Disallow /abonnement/integrale-mt-1690
Disallow /abonnement/integrale-pc-1690
Disallow /abonnement/integrale-rc-1690
Disallow /abonnement/integrale-yr-1690
Disallow /abonnement/integrale-er-1690
Disallow /abonnement/integrale-jc-1690
Disallow /abonnement/integrale-ev-1690
Disallow /abonnement/integrale-br-1690
Disallow /abonnement/integrale-annuel-mt-1690
Disallow /abonnement/integrale-annuel-pc-1690
Disallow /abonnement/integrale-annuel-rc-1690
Disallow /abonnement/integrale-annuel-yr-1690
Disallow /abonnement/integrale-annuel-er-1690
Disallow /abonnement/integrale-annuel-jc-1690
Disallow /abonnement/integrale-annuel-ev-1690
Disallow /abonnement/integrale-annuel-br-1690
Disallow /front/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /