wearewoodruff.com
robots.txt

Robots Exclusion Standard data for wearewoodruff.com

Resource Scan

Scan Details

Site Domain wearewoodruff.com
Base Domain wearewoodruff.com
Scan Status Ok
Last Scan2025-02-19T05:18:31+00:00
Next Scan 2025-03-21T05:18:31+00:00

Last Scan

Scanned2025-02-19T05:18:31+00:00
URL https://wearewoodruff.com/robots.txt
Domain IPs 104.21.20.18, 172.67.190.209, 2606:4700:3030::6815:1412, 2606:4700:3033::ac43:bed1
Response IP 172.67.190.209
Found Yes
Hash 0f39c74989e85167ac737ddbca895bc87765e85700470e2a282d5e5af3327b7e
SimHash e8397843e9b2

Groups

*

Rule Path
Disallow /*add-to-cart%3D*
Disallow /feed/
Disallow */feed
Disallow */feed$
Disallow /feed/$
Disallow /comments/feed
Disallow /?feed=
Disallow /wp-feed

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://wearewoodruff.com/sitemaps.xml

Warnings

  • `​user-agent` is not a known field.