canalcienciascriminais.jusbrasil.com.br
robots.txt

Robots Exclusion Standard data for canalcienciascriminais.jusbrasil.com.br

Resource Scan

Scan Details

Site Domain canalcienciascriminais.jusbrasil.com.br
Base Domain jusbrasil.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-29T16:37:42+00:00
Next Scan 2024-07-28T16:37:42+00:00

Last Successful Scan

Scanned2023-03-30T05:03:02+00:00
URL https://canalcienciascriminais.jusbrasil.com.br/robots.txt
Domain IPs 104.18.208.80, 104.18.209.80
Response IP 104.18.208.80
Found Yes
Hash 33bfad0edda3c0533f18660fbdbcde1aa8d75071f7d4609d3ef313eacbb9e64c
SimHash 6223da529410

Groups

*

Rule Path
Disallow */api/
Disallow /ads.txt
Disallow /ajax
Allow */busca?q=*&p=1$
Disallow */busca?q=*&p=*
Disallow */busca?*&l=*
Disallow */busca?*&idtopico=*
Disallow */busca?*&tribunal=*
Disallow */busca?*&o=*
Disallow */busca?*&c=*
Disallow */busca?*&dateFrom=*
Disallow */busca?*&dateTo=*
Disallow /jurisprudencia/busca?*&hasDecisionFacts=*
Disallow /jurisprudencia/busca?*&jurisType=*
Disallow /jurisprudencia/busca?*&orgaojulgador=*
Disallow /doutrina/busca?*&tipoDoutrina=*
Disallow /doutrina/busca?*&areasDireito=*
Disallow /doutrina/busca?*&idDoutrina=*
Disallow /diarios/busca?*&journalType=*
Disallow /diarios/busca?*&journalUfIdentifiers=*
Disallow */comentarios/*
Disallow /diarios/documentos/*/andamento-do-processo-n-*
Disallow /diarios/documentos/*/20*/*
Disallow /advogados/*p%3D
Disallow /advogados/*r%3D
Disallow /advogados/*rand%3D
Disallow /diarios/busca
Disallow /legislacao/busca
Disallow /topicos/busca
Disallow /perfil/busca
Disallow /consulta-processual/busca
Disallow /consulta-processual/goto/
Allow /diarios/busca?q=escreva%2Baqui%2Bo%2Bque%2Bdeseja%2Bpesquisar&idtopico=T28300130
Disallow */busca?q=*%28Preso%29
Disallow /processos/consulta
Disallow /processos/nome/*/*/*/
Allow /processos/nome/*/*/$
Disallow /processos/nome/*/*/*
Disallow */graphql
Disallow /box/event
Disallow /convert
Disallow /events
Disallow /participate
Disallow /cdn-cgi/
Disallow /seguidos
Disallow /recomendacoes
Disallow /seguidores
Disallow /livros
Disallow */editar$
Disallow /editar-perfil
Disallow /login
Disallow /cadastro
Disallow *?next_url=*
Disallow *%26next_url%3D*
Disallow *?cameFrom=*
Disallow *%26cameFrom%3D*
Disallow /doutrina/secao/*/agradecimento-*
Disallow /doutrina/secao/*/agradecimentos-*
Disallow /doutrina/secao/*/apresentacao-*
Disallow /doutrina/secao/*/assinatura-*
Disallow /doutrina/secao/*/bibliografia-*
Disallow /doutrina/secao/*/conclusoes-*
Disallow /doutrina/secao/*/creditos-*
Disallow /doutrina/secao/*/dedicatoria-*
Disallow /doutrina/secao/*/epigrafe-*
Disallow /doutrina/secao/*/expediente-*
Disallow /doutrina/secao/*/ficha-catalografica-*
Disallow /doutrina/secao/*/nota-*
Disallow /doutrina/secao/*/nota-previa-*
Disallow /doutrina/secao/*/perfil-dos-autores-*
Disallow /doutrina/secao/*/pesquisa-de-satisfacao-*
Disallow /doutrina/secao/*/pre-textuais-*
Disallow /doutrina/secao/*/pre-textual-*
Disallow /doutrina/secao/*/prefacio-*
Disallow /doutrina/secao/*/primeiras-paginas-*
Disallow /doutrina/secao/*/secao-interativa-*
Disallow /legislacao/*/editar
Disallow /legislacao/*/deletar
Disallow /jurisprudencia/busca?q=*&mandatoryPrecedent=*
Disallow /jurisprudencia/busca?q=*&popularPrecedent=*
Disallow /jurisprudencia/busca?q=*&matchExactTerm=*
Disallow /jurisprudencia/busca?q=*&notContainingTerm=*
Disallow /jurisprudencia/busca?q=*&matchAnyWordTerm=*
Disallow /jurisprudencia/busca?q=*&matchAllWordsTerm=*
Disallow /jurisprudencia/busca?q=*&admissibilityDecision=*
Disallow /jurisprudencia/busca?q=*&meritDecision=*
Disallow /error-pages/*

mediapartners-google

Rule Path
Allow /

bdcbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.jusbrasil.com.br/sitemap-index/static/sitemap-index.xml

Comments

  • Disable search pagination
  • Specific Artifact SERP
  • Allow specific SERP that's linked from DJE MG - see "pesquisa avancada" in https://www.tjmg.jus.br/portal-tjmg/dje/
  • Disable specific search
  • https://jusbrasil.slack.com/archives/CQ64A8AAG/p1662488814569839
  • Lawsuits
  • Should be allowed
  • https://www.jusbrasil.com.br/processos/nome/45622652/francisco-costa-peixoto-guimaraes
  • https://www.jusbrasil.com.br/processos/nome/45622652/francisco-costa-peixoto-guimaraes/
  • Shouldn't be allowed:
  • https://www.jusbrasil.com.br/processos/nome/45622652/francisco-costa-peixoto-guimaraes/artigos
  • https://www.jusbrasil.com.br/processos/nome/45622652/francisco-costa-peixoto-guimaraes/artigos/
  • Disable GraphQL client-side
  • Events
  • https://support.cloudflare.com/hc/en-us/articles/200169806-Troubleshooting-crawl-errors#h_40DxOK4QOfQeoBqAG4wp9a
  • Disable user profile routes
  • Disable editing routes
  • Disable other unnecessary routes
  • Disable specific doctrine sections - https://github.com/jusbrasil/doctrine-parser/blob/main/resources/non-indexable-titles.txt
  • Disable unnecessary legis routes
  • Disable jurisprudence advanced filter params