firecrawl.dev
robots.txt

Robots Exclusion Standard data for firecrawl.dev

Resource Scan

Scan Details

Site Domain firecrawl.dev
Base Domain firecrawl.dev
Scan Status Ok
Last Scan2025-09-08T23:34:51+00:00
Next Scan 2025-10-08T23:34:51+00:00

Last Scan

Scanned2025-09-08T23:34:51+00:00
URL https://firecrawl.dev/robots.txt
Redirect https://www.firecrawl.dev/robots.txt
Redirect Domain www.firecrawl.dev
Redirect Base firecrawl.dev
Domain IPs 76.76.21.21
Redirect IPs 66.33.60.66, 76.76.21.241
Response IP 66.33.60.129
Found Yes
Hash 2108584f422b9bd2f5ef3ff3d11040d0077b550266f67501743bd754a0d41fd7
SimHash 4d1c6deb4d10

Groups

*

Rule Path
Disallow /_next/static/
Disallow /_next/static/css/
Disallow /logos
Disallow /api/
Disallow /assets
Disallow /assets-original
Disallow /fonts