interacao.r7.com
robots.txt

Robots Exclusion Standard data for interacao.r7.com

Resource Scan

Scan Details

Site Domain interacao.r7.com
Base Domain r7.com
Scan Status Ok
Last Scan2025-08-31T17:44:35+00:00
Next Scan 2025-09-30T17:44:35+00:00

Last Scan

Scanned2025-08-31T17:44:35+00:00
URL https://interacao.r7.com/robots.txt
Domain IPs 104.89.120.142
Response IP 104.103.144.12
Found Yes
Hash 515b10a0661aa37adffeec6a049634dc322aff816ab9c678d6f369b7cb41d3c1
SimHash 181d1956ffb1

Groups

proximic

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

twitterbot

Rule Path
Allow

semrushbot-sa

Rule Path
Disallow /

starkbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

voluumdsp-content-bot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /embeds/
Disallow /especiais
Disallow /index2.html
Disallow /teste-oper/*
Disallow /teste-sup/*

Other Records

Field Value
sitemap https://interacao.r7.com/indice_interacao_sitemaps.xml