charleroi.be
robots.txt

Robots Exclusion Standard data for charleroi.be

Resource Scan

Scan Details

Site Domain charleroi.be
Base Domain charleroi.be
Scan Status Ok
Last Scan2025-04-20T09:49:34+00:00
Next Scan 2025-04-27T09:49:34+00:00

Last Scan

Scanned2025-04-20T09:49:34+00:00
URL https://charleroi.be/robots.txt
Redirect https://www.charleroi.be/robots.txt
Redirect Domain www.charleroi.be
Redirect Base charleroi.be
Domain IPs 193.104.37.46, 2a00:b060::c168:252e
Redirect IPs 193.104.37.46, 2a00:b060::c168:252e
Response IP 193.104.37.46
Found Yes
Hash 5ca53fcf4d3cd42a57aceb57f254b414003a66d7fac3e8acc65a785993d42a57
SimHash c1501b743db3

Groups

*

Rule Path
Disallow /assets/
Allow /assets/images/
Disallow /cpresources/
Disallow /imager/
Disallow /vendor/
Disallow /.env

Other Records

Field Value
sitemap https://www.charleroi.be/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.charleroi.be/
  • live - don't allow web crawlers to index cpresources/ or vendor/