canaltech.com.br
robots.txt

Robots Exclusion Standard data for canaltech.com.br

Resource Scan

Scan Details

Site Domain canaltech.com.br
Base Domain canaltech.com.br
Scan Status Ok
Last Scan2024-11-13T19:19:18+00:00
Next Scan 2024-11-20T19:19:18+00:00

Last Scan

Scanned2024-11-13T19:19:18+00:00
URL https://canaltech.com.br/robots.txt
Domain IPs 186.195.65.65
Response IP 186.195.65.65
Found Yes
Hash 32438db1f3cc9c1014667e8d9fa23a843dea004040285afa3d09a63ad95eb905
SimHash 4919682de913

Groups

*

Rule Path
Disallow /api/
Disallow /rss/
Allow /rss/google-assistente/
Disallow /empresa/*/produtos/
Disallow /deeplink/

siteauditbot

Rule Path
Disallow /rss/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://static.canaltech.com.br/smap/geral.xml