onlycrumbsremain.com
robots.txt
Robots Exclusion Standard data for onlycrumbsremain.com
Resource Scan
Scan Details
| Site Domain | onlycrumbsremain.com |
| Base Domain | onlycrumbsremain.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2026-02-16T05:32:13+00:00 |
| Next Scan | 2026-05-17T05:32:13+00:00 |
Last Successful Scan
| Scanned | 2025-07-21T10:49:53+00:00 |
| URL | https://onlycrumbsremain.com/robots.txt |
| Domain IPs | 104.26.10.34, 104.26.11.34, 172.67.74.131, 2606:4700:20::681a:a22, 2606:4700:20::681a:b22, 2606:4700:20::ac43:4a83 |
| Response IP | 104.26.11.34 |
| Found | Yes |
| Hash | 58e0253788e87e6c2c137eb83ac72b67fecf68fa231424acf4f639c88d311e73 |
| SimHash | 3f055d4505f1 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin |
| Disallow | /wp-login.php |
| Disallow | /xmlrpc.php |
| Disallow | /cdn-cgi/ |
*
| Rule | Path |
|---|---|
| Disallow | /*.doc$ |
| Disallow | /*.pdf$ |
| Disallow | /*.zip$ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://onlycrumbsremain.com/sitemap_index.xml |