lum-int.io
robots.txt

Robots Exclusion Standard data for lum-int.io

Resource Scan

Scan Details

Site Domain lum-int.io
Base Domain lum-int.io
Scan Status Ok
Last Scan2026-02-22T16:02:35+00:00
Next Scan 2026-03-01T16:02:35+00:00

Last Scan

Scanned2026-02-22T16:02:35+00:00
URL https://www.lum-int.io/robots.txt
Redirect https://brightdata.com/robots.txt
Redirect Domain brightdata.com
Redirect Base brightdata.com
Domain IPs 104.21.21.126, 172.67.198.159
Redirect IPs 104.18.24.60, 104.18.25.60
Response IP 104.18.25.60
Found Yes
Hash baa07aa08e92005eee292d70102169997bcc527cbf3a6433ec0c580a74a8a4af
SimHash e935a2dbee93

Groups

*

Rule Path
Disallow /lum/
Disallow /www/*.html
Disallow /use-cases/fintech
Disallow /products/datasets2/
Disallow /events/*
Disallow /wp-stage/*
Disallow /www/*
Disallow /svc/*

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://brightdata.com/sitemap_index.xml

Warnings

  • `host` is not a known field.