woodlanddeck.com
robots.txt

Robots Exclusion Standard data for woodlanddeck.com

Resource Scan

Scan Details

Site Domain woodlanddeck.com
Base Domain woodlanddeck.com
Scan Status Ok
Last Scan2025-10-28T21:23:03+00:00
Next Scan 2025-11-04T21:23:03+00:00

Last Scan

Scanned2025-10-28T21:23:03+00:00
URL https://woodlanddeck.com/robots.txt
Domain IPs 104.21.26.102, 172.67.135.219, 2606:4700:3033::ac43:87db, 2606:4700:3035::6815:1a66
Response IP 172.67.135.219
Found Yes
Hash b21403097bd45c7efb60e42a68a56ac4fc00c4ad45b04305f882132aeb2ab1d2
SimHash 60a0d8600ab0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin
Disallow /wp-
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://woodlanddeck.com/sitemap_index.xml