wake.tech
robots.txt

Robots Exclusion Standard data for wake.tech

Resource Scan

Scan Details

Site Domain wake.tech
Base Domain wake.tech
Scan Status Ok
Last Scan2025-09-16T08:40:19+00:00
Next Scan 2025-10-16T08:40:19+00:00

Last Scan

Scanned2025-09-16T08:40:19+00:00
URL https://wake.tech/robots.txt
Domain IPs 104.26.4.215, 104.26.5.215, 172.67.69.80, 2606:4700:20::681a:4d7, 2606:4700:20::681a:5d7, 2606:4700:20::ac43:4550
Response IP 104.26.5.215
Found Yes
Hash 57e08e0686e493b60c6c9ea6d1fd824466c5b60761d1587435213d5547cb1724
SimHash 685e9d2a8e0b

Groups

*

Rule Path
Disallow /readme.html
Disallow /license.txt
Disallow /wp-config.php
Disallow /xmlrpc.php
Disallow /*.php$
Disallow /?s=*
Disallow /search/
Disallow /trackback/
Disallow /feed/
Disallow /comments/feed/
Disallow /*?replytocom=*
Allow /*?utm_*
Allow /*?fbclid=*
Allow /*?gclid=*
Allow /wp-admin/admin-ajax.php
Allow /wp-content/
Allow /wp-includes/
Allow /*.css
Allow /*.js
Allow /*.png
Allow /*.jpg
Allow /*.jpeg
Allow /*.gif
Allow /*.webp
Allow /*.svg

claude-user
chatgpt-user
perplexity-user
gemini-deep-research

Rule Path
Disallow
Allow /

Other Records

Field Value
sitemap https://wake.tech/sitemap_index.xml

Comments

  • Bloqueia arquivos sensíveis e irrelevantes
  • URLs dinâmicas e irrelevantes para indexação
  • Permite parâmetros de campanhas
  • Permite acesso a recursos essenciais para renderização
  • Regras específicas para agentes de IA