lejdc.fr
robots.txt

Robots Exclusion Standard data for lejdc.fr

Resource Scan

Scan Details

Site Domain lejdc.fr
Base Domain lejdc.fr
Scan Status Ok
Last Scan2024-05-25T08:46:51+00:00
Next Scan 2024-06-01T08:46:51+00:00

Last Scan

Scanned2024-05-25T08:46:51+00:00
URL https://lejdc.fr/robots.txt
Redirect https://www.lejdc.fr/robots.txt
Redirect Domain www.lejdc.fr
Redirect Base lejdc.fr
Domain IPs 104.18.28.242, 104.18.29.242
Redirect IPs 104.18.28.242, 104.18.29.242, 2606:4700::6812:1cf2, 2606:4700::6812:1df2
Response IP 104.18.29.242
Found Yes
Hash 2ac4b5aba5185f0fc3d4f60369e05ae1a93b41bf6262328e01b5d3289d66b7c0
SimHash 191f15a661f2

Groups

*

Rule Path
Disallow /captcha.png
Disallow /json-rpc
Disallow /ajax
Disallow /idalgo
Disallow /place-publique
Disallow /*.php
Disallow /wp-
Disallow /recherche
Disallow /archivev2
Disallow /widgetRss
Disallow /*?widgetRss
Disallow /*GCF_
Disallow /region/
Disallow /loisirs/agenda/image
Disallow /abonnement/integrale-mt-1690
Disallow /abonnement/integrale-pc-1690
Disallow /abonnement/integrale-rc-1690
Disallow /abonnement/integrale-yr-1690
Disallow /abonnement/integrale-er-1690
Disallow /abonnement/integrale-jc-1690
Disallow /abonnement/integrale-ev-1690
Disallow /abonnement/integrale-br-1690
Disallow /abonnement/integrale-annuel-mt-1690
Disallow /abonnement/integrale-annuel-pc-1690
Disallow /abonnement/integrale-annuel-rc-1690
Disallow /abonnement/integrale-annuel-yr-1690
Disallow /abonnement/integrale-annuel-er-1690
Disallow /abonnement/integrale-annuel-jc-1690
Disallow /abonnement/integrale-annuel-ev-1690
Disallow /abonnement/integrale-annuel-br-1690
Disallow /front/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /