cml.pr.gov.br
robots.txt

Robots Exclusion Standard data for cml.pr.gov.br

Resource Scan

Scan Details

Site Domain cml.pr.gov.br
Base Domain cml.pr.gov.br
Scan Status Ok
Last Scan2024-06-17T20:54:48+00:00
Next Scan 2024-07-17T20:54:48+00:00

Last Scan

Scanned2024-06-17T20:54:48+00:00
URL https://cml.pr.gov.br/robots.txt
Domain IPs 200.155.62.205, 2002::c89b:3ecd
Response IP 200.155.62.205
Found Yes
Hash f92fb65d2083d22b1d0459611f3c7fff68c525731f1b986a30f8813117355c2c
SimHash cf7a74018439

Groups

*

Rule Path
Allow /cml/site/*.xhtml
Allow /cml/site/*.jspx
Allow /cml/site/*.jsp
Allow /cml/site/index.xhtml
Allow /cml/site/ini.jspx
Allow /cml/site/noticiadetalha.xhtml
Allow /cml/site/historia.xhtml
Allow /cml/site/mesaexecutiva.xhtml
Allow /cml/site/comissaodeetica.xhtml
Allow /cml/site/legislaturaoutras.xhtml
Allow /cml/site/galeriapresidentes.xhtml
Allow /cml/site/galeriamulheres.xhtml
Allow /cml/site/conhecalondrina.xhtml
Allow /cml/site/vereadores.xhtml
Allow /cml/site/comissoes.xhtml
Allow /cml/site/reppartidaria.xhtml
Allow /cml/site/reporgaos.xhtml
Allow /cml/site/comissoesinquerito.xhtml
Allow /cml/site/pesquisarepre.xhtml
Allow /cml/site/pautapri.xhtml
Allow /cml/site/reuniaocomissao.xhtml
Allow /cml/site/legempauta.xhtml
Allow /cml/site/aovivo.xhtml
Allow /cml/site/anteriores.xhtml
Allow /cml/site/salaimagem.xhtml
Allow /cml/site/aconteceu.xhtml
Allow /cml/site/simbolos.xhtml
Allow /nossoshinos/index.htm
Allow /cml/site/livros.xhtml
Allow /cml/site/pesquisaleis.xhtml
Allow /cml/site/pesquisaproj.xhtml
Allow /cml/site/pesquisapi.xhtml
Allow /cml/site/pesquisareq.xhtml
Allow /cml/site/pesquisain.xhtml
Allow /cml/site/videos.xhtml
Allow /cml/site/pesquisanoticias.xhtml
Allow /cml/site/pesquisaatas.xhtml
Allow /cml/site/pesquisaagenda.xhtml
Allow /cml/site/transparencia.xhtml
Allow /cml/site/linhadireta.xhtml
Allow /cml/site/contatovereadores.xhtml
Allow /cml/site/contatodepartamentos.xhtml
Disallow /*.js$
Disallow /*.js$
Disallow /*.gif$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.png$
Disallow /*.bmp$
Disallow /META-INF/
Disallow /WEB-INF/
Disallow /cml/META-INF/
Disallow /cml/WEB-INF/

amazonbot

Rule Path
Disallow /

a6-indexer

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

applebot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /

blackboard safeassign

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

qwant-news

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

tinytestbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www1.cml.pr.gov.br/cml/site/sitemap.txt

Comments

  • Disallow: /site/estilos/
  • Disallow: /site/gifs/
  • Disallow: /site/imagens/
  • Disallow: /site/includes/
  • Disallow: /site/js/
  • Disallow: /cml/site/imagens/
  • Disallow: /cml/site/estilos/
  • Disallow: /cml/site/leidetalhe.xhtml
  • Disallow: /cml/site/projetodetalhe.xhtml
  • Disallow: leidetalhe.xhtml
  • Disallow: projetodetalhe.xhtml
  • Disallow: /estilos/
  • Disallow: /gifs/
  • Disallow: /includes/
  • Disallow: /js/
  • Block bots
  • Block bots
  • RDH, 08.19.19: I really don't want to block Applebot, but for now, I am. It is crawling us too much
  • RDH, 05.13.20: I really don't want to block bing, but for now, I am. It is also already in htaccess rules
  • RDH, 06.30.21: Very temporary to get some relief.
  • User-Agent: Googlebot
  • Disallow: /