tooldata.io
robots.txt
Robots Exclusion Standard data for tooldata.io
Resource Scan
Scan Details
| Site Domain | tooldata.io |
| Base Domain | tooldata.io |
| Scan Status | Ok |
| Last Scan | 2025-10-18T06:49:33+00:00 |
| Next Scan | 2025-11-01T06:49:33+00:00 |
Last Scan
| Scanned | 2025-10-18T06:49:33+00:00 |
| URL | https://tooldata.io/robots.txt |
| Domain IPs | 209.151.153.205 |
| Response IP | 209.151.153.205 |
| Found | Yes |
| Hash | 3c3088f906af1fbc57d588dd8ee6858f6b5f25d7d6a01a6a6b8a52c4d523b755 |
| SimHash | 098e100287b4 |
Groups
gptbot
| Rule | Path |
|---|---|
| Allow | / |
| Allow | /blog/ |
| Allow | /social-listening |
| Allow | /inteligencia-artificial |
| Allow | /performance-marketing |
| Allow | /estrategia-digital |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 0.5 |
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /admin/ |
| Disallow | /private/ |
| Disallow | /.git/ |
| Disallow | /node_modules/ |
| Disallow | /src/ |
| Disallow | /*.json$ |
| Disallow | /*.config.* |
| Disallow | /api/internal/ |
| Disallow | /temp/ |
| Disallow | /.env |
| Allow | /ai-training-data/ |
| Allow | /knowledge-base/ |
| Allow | /structured-content/ |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 2 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://tooldata.io/sitemap-index.xml |
| sitemap | https://tooldata.io/sitemap.xml |
| sitemap | https://tooldata.io/sitemap-ai.xml |
Warnings
- `host` is not a known field.
Comments