tvjornal.com.br
robots.txt

Robots Exclusion Standard data for tvjornal.com.br

Resource Scan

Scan Details

Site Domain tvjornal.com.br
Base Domain tvjornal.com.br
Scan Status Ok
Last Scan2024-06-23T21:28:50+00:00
Next Scan 2024-06-30T21:28:50+00:00

Last Scan

Scanned2024-06-23T21:28:50+00:00
URL https://tvjornal.com.br/robots.txt
Domain IPs 104.21.21.41, 172.67.196.97, 2606:4700:3032::6815:1529, 2606:4700:3035::ac43:c461
Response IP 172.67.196.97
Found Yes
Hash ee38d978ebff8578a84ae1770cef3b4a08f8e8c471eb2ed78e3a296f4dd6ad3d
SimHash ea485ea3d9f3

Groups

*

Rule Path
Disallow /_temp/
Disallow /src/
Disallow /cdn/
Disallow /assets/
Disallow /*.pdf$
Disallow /*.json$
Disallow /search/*
Allow /*.jpg
Allow /*.JPG
Allow /*.jpeg
Allow /*.JPEG
Allow /*.png
Allow /*.PNG
Allow /*.gif
Allow /*.GIF

facebot

Rule Path
Allow /imagens/

facebookexternalhit

Rule Path
Allow /imagens/

googlebot-news

Rule Path
Allow *

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

baidoospider

Rule Path
Disallow /

Comments

  • Sitemap: https://jc.ne10.uol.com.br/sitemap.xml