sourdoughhome.com
robots.txt
Robots Exclusion Standard data for sourdoughhome.com
Resource Scan
Scan Details
| Site Domain | sourdoughhome.com |
| Base Domain | sourdoughhome.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2025-09-27T03:44:27+00:00 |
| Next Scan | 2025-11-26T03:44:27+00:00 |
Last Successful Scan
| Scanned | 2025-07-30T02:21:49+00:00 |
| URL | https://sourdoughhome.com/robots.txt |
| Domain IPs | 104.21.96.46, 172.67.173.3, 2606:4700:3034::ac43:ad03, 2606:4700:3037::6815:602e |
| Response IP | 172.67.173.3 |
| Found | Yes |
| Hash | 19fa8ac1fbe070165171800dd53a6c9632ebb9ce0d69131e54e61b929689917b |
| SimHash | c2510846e072 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /wp-content/uploads/ |
| Disallow | /wp-content/plugins/ |
| Disallow | /wp-admin/ |
| Disallow | /includes |
| Disallow | /pics |
| Disallow | /styles |
| Disallow | /cgi-bin |
| Disallow | /buttons |
| Disallow | /downloads |
| Disallow | /movies |
| Disallow | /scripts |
| Disallow | /wip |
| Disallow | /Glutenfree |
| Disallow | /?blackhole |
| Disallow | /GlutenFree/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.sourdoughhome.com/sitemap_index.xml |