onlycrumbsremain.com
robots.txt

Robots Exclusion Standard data for onlycrumbsremain.com

Resource Scan

Scan Details

Site Domain onlycrumbsremain.com
Base Domain onlycrumbsremain.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-16T05:32:13+00:00
Next Scan 2026-05-17T05:32:13+00:00

Last Successful Scan

Scanned2025-07-21T10:49:53+00:00
URL https://onlycrumbsremain.com/robots.txt
Domain IPs 104.26.10.34, 104.26.11.34, 172.67.74.131, 2606:4700:20::681a:a22, 2606:4700:20::681a:b22, 2606:4700:20::ac43:4a83
Response IP 104.26.11.34
Found Yes
Hash 58e0253788e87e6c2c137eb83ac72b67fecf68fa231424acf4f639c88d311e73
SimHash 3f055d4505f1

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-login.php
Disallow /xmlrpc.php
Disallow /cdn-cgi/

*

Rule Path
Disallow /*.doc$
Disallow /*.pdf$
Disallow /*.zip$

Other Records

Field Value
sitemap https://onlycrumbsremain.com/sitemap_index.xml