sourdoughhome.com
robots.txt

Robots Exclusion Standard data for sourdoughhome.com

Resource Scan

Scan Details

Site Domain sourdoughhome.com
Base Domain sourdoughhome.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-27T03:44:27+00:00
Next Scan 2025-11-26T03:44:27+00:00

Last Successful Scan

Scanned2025-07-30T02:21:49+00:00
URL https://sourdoughhome.com/robots.txt
Domain IPs 104.21.96.46, 172.67.173.3, 2606:4700:3034::ac43:ad03, 2606:4700:3037::6815:602e
Response IP 172.67.173.3
Found Yes
Hash 19fa8ac1fbe070165171800dd53a6c9632ebb9ce0d69131e54e61b929689917b
SimHash c2510846e072

Groups

bingbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Allow /wp-content/uploads/
Disallow /wp-content/plugins/
Disallow /wp-admin/
Disallow /includes
Disallow /pics
Disallow /styles
Disallow /cgi-bin
Disallow /buttons
Disallow /downloads
Disallow /movies
Disallow /scripts
Disallow /wip
Disallow /Glutenfree
Disallow /?blackhole
Disallow /GlutenFree/

Other Records

Field Value
sitemap https://www.sourdoughhome.com/sitemap_index.xml