woodyworld.com
robots.txt

Robots Exclusion Standard data for woodyworld.com

Resource Scan

Scan Details

Site Domain woodyworld.com
Base Domain woodyworld.com
Scan Status Ok
Last Scan2025-02-25T04:17:29+00:00
Next Scan 2025-03-27T04:17:29+00:00

Last Scan

Scanned2025-02-25T04:17:29+00:00
URL https://woodyworld.com/robots.txt
Domain IPs 104.21.61.230, 172.67.216.11, 2606:4700:3033::6815:3de6, 2606:4700:3033::ac43:d80b
Response IP 104.21.61.230
Found Yes
Hash b590a820e5b70b538c3baa27220077ca232698f34b89b8c5466bee58730e6a17
SimHash b801317be6b8

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow *?*price*
Disallow *?*&*&*&*
Disallow *?*&*&*&*&*
Disallow *?*&*&*&*&*&*
Disallow *?*%5B0%5D*
Disallow *?*%5B1%5D*
Disallow *?*%5B2%5D*
Disallow */app/
Disallow */bin/
Disallow */dev/
Disallow */lib/
Disallow */phpserver/
Disallow */pkginfo/
Disallow */report/
Disallow */setup/
Disallow */update/
Disallow */var/
Disallow */vendor/
Disallow */index.php/
Disallow */catalog/product_compare/
Disallow */catalog/category/view/
Disallow */catalog/product/view/
Disallow */catalogsearch/
Disallow */control/
Disallow */contacts/
Disallow */contact/
Disallow */customer/
Disallow */customize/
Disallow */newsletter/
Disallow */review/
Disallow */sendfriend/
Disallow */wishlist/
Disallow */composer.json
Disallow */composer.lock
Disallow */CONTRIBUTING.md
Disallow */CONTRIBUTOR_LICENSE_AGREEMENT.html
Disallow */COPYING.txt
Disallow */Gruntfile.js
Disallow */LICENSE.txt
Disallow */LICENSE_AFL.txt
Disallow */nginx.conf.sample
Disallow */package.json
Disallow */php.ini.sample
Disallow */RELEASE_NOTES.txt
Disallow /*?*product_list_mode=
Disallow /*?*product_list_order=
Disallow /*?*product_list_limit=
Disallow /*?*product_list_dir=
Disallow /*?SID=
Disallow /*?p=*&
Disallow /*%26p%3D*%26
Disallow /*%26p%3D*
Disallow /*.php$
Disallow /*.CVS
Disallow /*.Zip$
Disallow /*.Svn$
Disallow /*.Idea$
Disallow /*.Sql$
Disallow /*.Tgz$
Disallow /*.cvs
Disallow /*.zip$
Disallow /*.svn$
Disallow /*.idea$
Disallow /*.sql$
Disallow /*.tgz$

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://woodyworld.com/nl/sitemaps-1-sitemap.xml

Comments

  • robots.txt
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • prevent excessive indexing
  • Block price filter
  • Block new navigation segments > 3 filters
  • Block new navigation segments > 2 same attribute filters
  • Directories
  • Paths (clean URLs)
  • Files
  • Do not index pages that are sorted or filtered.
  • Do not index session ID
  • Disallow paging with other filtering
  • CVS, SVN directory and dump files