netjan.com
robots.txt

Robots Exclusion Standard data for netjan.com

Resource Scan

Scan Details

Site Domain netjan.com
Base Domain netjan.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-26T21:04:45+00:00
Next Scan 2025-10-10T21:04:45+00:00

Last Successful Scan

Scanned2025-09-11T19:31:08+00:00
URL https://netjan.com/robots.txt
Domain IPs 185.146.167.195, 2a07:7800::195
Response IP 185.146.167.195
Found Yes
Hash c67437c5ccdefe9bb6e17e47b4fcfd17f4bfd340cc828265c51d7243847db8e9
SimHash 6b6abe45b776

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

chatgpt-user

Rule Path
Allow /

openai-searchbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

claudebot

Rule Path
Allow /

facebookbot

Rule Path
Allow /
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /uncategorized/
Disallow */trackback/
Disallow /*/*?s=*
Disallow */feed/
Disallow */comments/feed/

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://netjan.com/sitemap_index.xml
sitemap https://netjan.com/sitemap.xml

Comments

  • robots.txt Otimizado para NetJan.comb - Maximo Acesso para IAs
  • === CONFIGURACAO GERAL ===
  • === DIRETRIZES ESPECIFICAS PARA IAs PRINCIPAIS ===
  • Google (Pesquisa + Bard/Gemini)
  • OpenAI (ChatGPT, GPT-4)
  • Microsoft (Bing + Copilot)
  • Anthropic (Claude)
  • Meta (Facebook + AI)
  • === BLOQUEIOS NECESSARIOS ===
  • WordPress Admin (manter segurança)
  • Diretorios tecnicos (ajustados para subdiretório)
  • URLs dinamicas problematicas
  • === SITEMAPS (CRITICO PARA IAs) ===
  • === CONFIGURACOES AVANÇADAS ===