greencultured.co
robots.txt

Robots Exclusion Standard data for greencultured.co

Resource Scan

Scan Details

Site Domain greencultured.co
Base Domain greencultured.co
Scan Status Ok
Last Scan2026-02-27T04:32:31+00:00
Next Scan 2026-03-29T04:32:31+00:00

Last Scan

Scanned2026-02-27T04:32:31+00:00
URL https://greencultured.co/robots.txt
Domain IPs 172.66.40.68, 172.66.43.188, 2606:4700:3108::ac42:2844, 2606:4700:3108::ac42:2bbc
Response IP 172.66.40.68
Found Yes
Hash 08d3b74dc3b3c3f0066b20b0fe9a1f9dca0e2b1847e3753b4cc126f8525eba0f
SimHash 62a2c9006bf4

Groups

*

Rule Path
Allow /wp-content/
Allow /wp-includes/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /ld-groups/
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/course-certificates/
Disallow /*?wc-ajax=
Disallow /*redirect_to%3D
Disallow /*utm_%3D
Disallow /*fbclid%3D
Disallow /*gclid%3D
Disallow /*mc_cid%3D
Disallow /*orderby%3D
Disallow /*min_price%3D
Disallow /*max_price%3D
Disallow /*wishlist-action

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://greencultured.co/sitemap_index.xml

Comments

  • Allow full rendering (CSS, JS, images)
  • Disallow private and unnecessary areas
  • Disallow crawl-wasting query strings
  • Set crawl-delay for ALL bots (most respect it except Google)
  • Sitemap declarations (double-included for compatibility)