thecoursegallery.com
robots.txt

Robots Exclusion Standard data for thecoursegallery.com

Resource Scan

Scan Details

Site Domain thecoursegallery.com
Base Domain thecoursegallery.com
Scan Status Ok
Last Scan2025-12-15T01:11:58+00:00
Next Scan 2026-01-14T01:11:58+00:00

Last Scan

Scanned2025-12-15T01:11:58+00:00
URL https://thecoursegallery.com/robots.txt
Domain IPs 104.21.71.40, 172.67.143.22, 2606:4700:3031::6815:4728, 2606:4700:3037::ac43:8f16
Response IP 104.21.71.40
Found Yes
Hash c11f574e5c2a5dba4e3553902931ce1b0e404ae06ac8a183f059bb552900fa85
SimHash e5296882e710

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cart/
Disallow /checkout/
Disallow /my-account/
Disallow /*?add-to-cart=*
Disallow /*?remove_item=*
Disallow /?s=
Disallow /search/

Other Records

Field Value
sitemap https://thecoursegallery.com/sitemap_index.xml

Comments

  • Block WooCommerce dynamic pages to save crawl budget
  • Block internal search results to prevent spam indexing
  • Sitemap Location