radiojornal.com.br
robots.txt

Robots Exclusion Standard data for radiojornal.com.br

Resource Scan

Scan Details

Site Domain radiojornal.com.br
Base Domain radiojornal.com.br
Scan Status Ok
Last Scan2024-06-09T19:34:06+00:00
Next Scan 2024-06-16T19:34:06+00:00

Last Scan

Scanned2024-06-09T19:34:06+00:00
URL https://radiojornal.com.br/robots.txt
Domain IPs 104.21.48.111, 172.67.185.124, 2606:4700:3034::ac43:b97c, 2606:4700:3035::6815:306f
Response IP 104.21.48.111
Found Yes
Hash 578f0fc698c89cd5ca233ab147315050cbf7d140079ad0114a29ab7e5a22970c
SimHash e848daa3f9b3

Groups

*

Rule Path
Disallow /_temp/
Disallow /src/
Disallow /cdn/
Disallow /assets/
Disallow /*.pdf$
Disallow /*.json$
Disallow /search/*
Allow /*.jpg
Allow /*.JPG
Allow /*.jpeg
Allow /*.JPEG
Allow /*.png
Allow /*.PNG
Allow /*.gif
Allow /*.GIF

facebot

Rule Path
Allow /imagens/

facebookexternalhit

Rule Path
Allow /imagens/

googlebot-news

Rule Path
Allow *

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

baidoospider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://radiojornal.com.br/sitemap/1.xml
sitemap https://radiojornal.com.br/sitemap/map/day/sitemap.xml
sitemap https://radiojornal.com.br/sitemap/map/day/sitemap-news.xml