jornaldocomercio.com.br
robots.txt

Robots Exclusion Standard data for jornaldocomercio.com.br

Resource Scan

Scan Details

Site Domain jornaldocomercio.com.br
Base Domain jornaldocomercio.com.br
Scan Status Ok
Last Scan2026-02-26T03:00:58+00:00
Next Scan 2026-03-05T03:00:58+00:00

Last Scan

Scanned2026-02-26T03:00:58+00:00
URL https://jornaldocomercio.com.br/robots.txt
Redirect https://www.jornaldocomercio.com/robots.txt
Redirect Domain www.jornaldocomercio.com
Redirect Base jornaldocomercio.com
Domain IPs 104.21.52.56, 172.67.195.241, 2606:4700:3030::6815:3438, 2606:4700:3036::ac43:c3f1
Redirect IPs 104.26.8.35, 104.26.9.35, 172.67.72.107, 2606:4700:20::681a:823, 2606:4700:20::681a:923, 2606:4700:20::ac43:486b
Response IP 172.67.72.107
Found Yes
Hash 79d3234909523b705024af282387350f8ae549a747ac123e9f56aa73b79b1a61
SimHash 48847ab7a913

Groups

slurp

Rule Path
Disallow /

baidoospider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

baidoospider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /_conteudos/
Disallow /*.json
Disallow /*.php
Disallow /webparts/
Disallow /search/
Disallow /tags/
Disallow /autor/
Disallow /site/*
Allow /site/noticia.php

Other Records

Field Value
sitemap https://www.jornaldocomercio.com/sitemap.xml