lum-int.io
robots.txt
Robots Exclusion Standard data for lum-int.io
Resource Scan
Scan Details
| Site Domain | lum-int.io |
| Base Domain | lum-int.io |
| Scan Status | Ok |
| Last Scan | 2026-02-22T16:02:35+00:00 |
| Next Scan | 2026-03-01T16:02:35+00:00 |
Last Scan
| Scanned | 2026-02-22T16:02:35+00:00 |
| URL | https://www.lum-int.io/robots.txt |
| Redirect | https://brightdata.com/robots.txt |
| Redirect Domain | brightdata.com |
| Redirect Base | brightdata.com |
| Domain IPs | 104.21.21.126, 172.67.198.159 |
| Redirect IPs | 104.18.24.60, 104.18.25.60 |
| Response IP | 104.18.25.60 |
| Found | Yes |
| Hash | baa07aa08e92005eee292d70102169997bcc527cbf3a6433ec0c580a74a8a4af |
| SimHash | e935a2dbee93 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /lum/ |
| Disallow | /www/*.html |
| Disallow | /use-cases/fintech |
| Disallow | /products/datasets2/ |
| Disallow | /events/* |
| Disallow | /wp-stage/* |
| Disallow | /www/* |
| Disallow | /svc/* |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 5 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://brightdata.com/sitemap_index.xml |
Warnings
- `host` is not a known field.