trionoticias.com.br
robots.txt

Robots Exclusion Standard data for trionoticias.com.br

Resource Scan

Scan Details

Site Domain trionoticias.com.br
Base Domain trionoticias.com.br
Scan Status Ok
Last Scan2024-09-28T12:41:06+00:00
Next Scan 2024-10-05T12:41:06+00:00

Last Scan

Scanned2024-09-28T12:41:06+00:00
URL https://trionoticias.com.br/robots.txt
Domain IPs 187.1.142.117, 2804:10:8021::142:117
Response IP 187.1.142.117
Found Yes
Hash 65c88db6e9a17e38ed282df912321f544849563ed7f0f164521da82bae18a30c
SimHash 49147dd7d3e1

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile-apps

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

google favicon

Rule Path
Allow *

feedfetcher-google

Rule Path
Allow *

bingbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

facebot

Rule Path
Allow *

facebookexternalhit

Rule Path
Allow *

ia_archiver

Rule Path
Allow *

twitterbot

Rule Path
Allow *

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct76

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

*

Rule Path
Disallow /utils/

Other Records

Field Value
sitemap https://trionoticias.com.br/sitemap/sitemap.xml

Comments

  • Trio Notícias