lighthouse.by
robots.txt

Robots Exclusion Standard data for lighthouse.by

Resource Scan

Scan Details

Site Domain lighthouse.by
Base Domain lighthouse.by
Scan Status Ok
Last Scan2025-10-31T18:41:07+00:00
Next Scan 2025-11-30T18:41:07+00:00

Last Scan

Scanned2025-10-31T18:41:07+00:00
URL https://lighthouse.by/robots.txt
Domain IPs 2a0a:7d80:1:7::83, 93.125.99.88
Response IP 93.125.99.88
Found Yes
Hash cfe430888d3bef1a61a0fc1f5bc255cb0fcc42cdfc8830e60234a80cced27daf
SimHash 2960d2e083b7

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/
Allow /wp-content/cache/autoptimize/
Allow /wp-includes/css/
Allow /wp-includes/js/
Disallow /wp-content/plugins/
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/cache
Disallow /trackback
Disallow */trackback
Disallow */*/trackback
Disallow */*/feed/*/
Disallow */feed
Disallow /tag
Disallow /files/
Disallow *feed*
Disallow *fuxn*
Disallow *.php*
Disallow *rss*

Other Records

Field Value
sitemap https://lighthouse.by/sitemap.xml