connect.net.br
robots.txt

Robots Exclusion Standard data for connect.net.br

Resource Scan

Scan Details

Site Domain connect.net.br
Base Domain connect.net.br
Scan Status Ok
Last Scan2025-12-05T19:06:59+00:00
Next Scan 2026-01-04T19:06:59+00:00

Last Scan

Scanned2025-12-05T19:06:59+00:00
URL https://connect.net.br/robots.txt
Redirect https://www.connect.net.br/robots.txt
Redirect Domain www.connect.net.br
Redirect Base connect.net.br
Domain IPs 104.26.0.21, 104.26.1.21, 172.67.68.231, 2606:4700:20::681a:115, 2606:4700:20::681a:15, 2606:4700:20::ac43:44e7
Redirect IPs 104.26.0.21, 104.26.1.21, 172.67.68.231, 2606:4700:20::681a:115, 2606:4700:20::681a:15, 2606:4700:20::ac43:44e7
Response IP 104.26.0.21
Found Yes
Hash 1c8aa6ae82865e3cfdc4d8e11052b1b16b731009b0d8c57cae4991caf6cde73c
SimHash 68105a824633

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /login/
Disallow /carrinho/
Disallow /cgi-bin

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

hl_ftien_spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /
Disallow /*/*/*/trackback/$

Other Records

Field Value
sitemap https://connect.net.br/sitemap.xml

Comments

  • Permitir acesso a todos os bots
  • Bloquear bots específicos
  • Bloquear URLs com padrões específicos
  • Localização do Sitemap