petit-bulletin.fr
robots.txt

Robots Exclusion Standard data for petit-bulletin.fr

Resource Scan

Scan Details

Site Domain petit-bulletin.fr
Base Domain petit-bulletin.fr
Scan Status Ok
Last Scan2024-11-09T01:20:28+00:00
Next Scan 2024-11-16T01:20:28+00:00

Last Scan

Scanned2024-11-09T01:20:28+00:00
URL https://petit-bulletin.fr/robots.txt
Redirect https://www.petit-bulletin.fr/robots.txt
Redirect Domain www.petit-bulletin.fr
Redirect Base petit-bulletin.fr
Domain IPs 62.4.7.2
Redirect IPs 62.4.7.2
Response IP 62.4.7.2
Found Yes
Hash 17209e9a77d60f3d33e0c54e348d589a3768cf5c033218283403e3ede1c52a2b
SimHash c126e703c510

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /newsletter/
Disallow /images/
Disallow /backoffice/
Disallow /mon-espace-blog/
Disallow /modules/
Disallow /transfert/
Disallow /classes/
Disallow /pbv1/
Disallow /stats/
Disallow /clients/
Disallow /print.php
Disallow /envoyer.php
Disallow /prod/
Disallow /test/
Disallow /stats/
Disallow /sondages/
Disallow /publicite/
Disallow /polemiques/
Disallow /culturelyon/
Disallow /captcha/
Disallow /cache/
Disallow /*recherche-article-*

baiduspider
yisouspider
petalbot
bytespider
sogou web spider
sogou inst spider
amazonbot
gptbot
yahoo! slurp

Rule Path
Disallow /

Warnings

  • 1 invalid line.