planet.com
robots.txt
Robots Exclusion Standard data for planet.com
Resource Scan
Scan Details
Site Domain | planet.com |
Base Domain | planet.com |
Scan Status | Ok |
Last Scan | 2025-08-27T19:50:07+00:00 |
Next Scan | 2025-09-26T19:50:07+00:00 |
Last Scan
Scanned | 2025-08-27T19:50:07+00:00 |
URL | https://planet.com/robots.txt |
Redirect | https://www.planet.com:443/robots.txt |
Redirect Domain | www.planet.com |
Redirect Base | planet.com |
Domain IPs | 34.120.196.216 |
Redirect IPs | 34.120.196.216 |
Response IP | 34.120.196.216 |
Found | Yes |
Hash | c8ff4189a517f2a3038d09d63bc3c72e236a0eb6c1e9c71e69e290e08e67e2c9 |
SimHash | 5c01917182a4 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search |
Disallow | /search/* |
Disallow | /marketplace/* |
Disallow | /ignite25 |
Disallow | /ignite25/ |
Disallow | /products-v2a/ |
Disallow | /products-v2b/ |
Disallow | /admin |
Disallow | /api/internal |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://www.planet.com/sitemap-index.xml |
Warnings
- `host` is not a known field.
Comments