kungfudirect.com
robots.txt

Robots Exclusion Standard data for kungfudirect.com

Resource Scan

Scan Details

Site Domain kungfudirect.com
Base Domain kungfudirect.com
Scan Status Ok
Last Scan2025-11-07T12:17:30+00:00
Next Scan 2025-11-08T12:17:30+00:00

Last Scan

Scanned2025-11-07T12:17:30+00:00
URL https://kungfudirect.com/robots.txt
Domain IPs 192.124.249.175, 2a02:fe80:1010::25:10
Response IP 192.124.249.175
Found Yes
Hash 6685237f65f757ae469535306aae493951ffcf5bd404bcbf4254c3e4f0ce752b
SimHash 209c0843e4d2

Groups

*

Rule Path
Disallow /admin/
Disallow /cart/
Disallow /checkout/
Disallow /account/
Disallow /order/
Disallow /wishlist/
Disallow /compare/
Disallow /search/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /logs/
Disallow /*?route=
Disallow /*?tracking=
Disallow /*?currency=
Disallow /*?sort=
Disallow /*?order=
Disallow /*?limit=
Disallow /*?page=
Disallow /*?filter=
Disallow /*?variant=
Disallow /*?search=
Disallow /*?manufacturer_id=
Disallow /*?tag=
Disallow /*?utm_source=
Disallow /*?utm_medium=
Disallow /*?utm_campaign=
Disallow /*?utm_term=
Disallow /*?utm_content=
Disallow /*?gclid=
Disallow /*?fbclid=
Disallow /*?preview_theme_id*
Disallow /*?preview_script_id*
Disallow /*?sessionid=
Allow /*.css$
Allow /*.js$
Allow /*.woff$
Allow /*.woff2$
Allow /*.png$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.svg$
Allow /*.gif$

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

ccbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

pinterest

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.kungfudirect.com/sitemap.xml

Comments

  • =========================================================
  • KungFuDirect Robots.txt
  • Optimized for Google SEO + AI discovery (ChatGPT, Bing Copilot, Perplexity)
  • =========================================================
  • --- MAIN SITEMAP ---
  • =========================================================
  • GENERAL RULES FOR ALL CRAWLERS
  • =========================================================
  • Block system and user/account sections
  • Allow important informational pages (policies, FAQ, service)
  • System and temporary folders
  • Common URL parameters (duplicate/filtered pages)
  • Allow required assets for proper rendering
  • =========================================================
  • MAJOR SEARCH ENGINES
  • =========================================================
  • =========================================================
  • AI CRAWLERS (OPEN FOR DISCOVERY)
  • These bots power ChatGPT, Perplexity, Claude, etc.
  • =========================================================
  • =========================================================
  • BLOCK KNOWN AGGRESSIVE OR NON-USEFUL BOTS
  • =========================================================
  • =========================================================
  • END OF FILE
  • =========================================================