brightdata.com
robots.txt

Robots Exclusion Standard data for brightdata.com

Resource Scan

Scan Details

Site Domain brightdata.com
Base Domain brightdata.com
Scan Status Ok
Last Scan2024-11-14T09:44:59+00:00
Next Scan 2024-11-21T09:44:59+00:00

Last Scan

Scanned2024-11-14T09:44:59+00:00
URL https://brightdata.com/robots.txt
Domain IPs 104.18.24.60, 104.18.25.60
Response IP 104.18.25.60
Found Yes
Hash 844cfab5f4963e2c36d7a829323fde396b48783989f08fb956851eb39448b86c
SimHash 8815a2dbee93

Groups

*

Rule Path
Disallow /lum/
Disallow /www/*.html
Disallow /use-cases/fintech
Disallow /products/datasets2/
Disallow /events/*
Disallow /wp-stage/*
Disallow /www/*
Disallow /svc/*

Other Records

Field Value
sitemap https://brightdata.com/sitemap_index.xml

Warnings

  • `host` is not a known field.