luminatinet.com
robots.txt

Robots Exclusion Standard data for luminatinet.com

Resource Scan

Scan Details

Site Domain luminatinet.com
Base Domain luminatinet.com
Scan Status Ok
Last Scan2024-11-15T17:02:47+00:00
Next Scan 2024-11-22T17:02:47+00:00

Last Scan

Scanned2024-11-15T17:02:47+00:00
URL https://luminatinet.com/robots.txt
Redirect https://brightdata.com/robots.txt
Redirect Domain brightdata.com
Redirect Base brightdata.com
Domain IPs 3.90.158.189, 3.92.97.199
Redirect IPs 104.18.24.60, 104.18.25.60
Response IP 104.18.25.60
Found Yes
Hash 844cfab5f4963e2c36d7a829323fde396b48783989f08fb956851eb39448b86c
SimHash 8815a2dbee93

Groups

*

Rule Path
Disallow /lum/
Disallow /www/*.html
Disallow /use-cases/fintech
Disallow /products/datasets2/
Disallow /events/*
Disallow /wp-stage/*
Disallow /www/*
Disallow /svc/*

Other Records

Field Value
sitemap https://brightdata.com/sitemap_index.xml

Warnings

  • `host` is not a known field.