wayahead.be
robots.txt

Robots Exclusion Standard data for wayahead.be

Resource Scan

Scan Details

Site Domain wayahead.be
Base Domain wayahead.be
Scan Status Ok
Last Scan2025-03-08T10:35:24+00:00
Next Scan 2025-04-07T10:35:24+00:00

Last Scan

Scanned2025-03-08T10:35:24+00:00
URL https://wayahead.be/robots.txt
Domain IPs 2a00:1c98:1000:1213:0:2:d335:2a2e, 5.134.4.210
Response IP 5.134.4.210
Found Yes
Hash 1417500bc53b2a43f8affdf058e76bf0f0780be4c79ceac5a3b028987ec41ae1
SimHash 41701d763793

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://wayahead.be/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://wayahead.be/
  • live - don't allow web crawlers to index cpresources/ or vendor/