portalt5.com.br
robots.txt

Robots Exclusion Standard data for portalt5.com.br

Resource Scan

Scan Details

Site Domain portalt5.com.br
Base Domain portalt5.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2026-01-30T10:42:32+00:00
Next Scan 2026-02-13T10:42:32+00:00

Last Successful Scan

Scanned2026-01-15T10:35:32+00:00
URL https://portalt5.com.br/robots.txt
Redirect https://www.portalt5.com.br/robots.txt
Redirect Domain www.portalt5.com.br
Redirect Base portalt5.com.br
Domain IPs 104.26.12.109, 104.26.13.109, 172.67.74.237, 2606:4700:20::681a:c6d, 2606:4700:20::681a:d6d, 2606:4700:20::ac43:4aed
Redirect IPs 104.26.12.109, 104.26.13.109, 172.67.74.237, 2606:4700:20::681a:c6d, 2606:4700:20::681a:d6d, 2606:4700:20::ac43:4aed
Response IP 104.26.13.109
Found Yes
Hash 222a5527da9f8dd4e1275963bc705c2bc0248a892a2feaa8bd8647ca6f81c46a
SimHash 231819225f76

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.portalt5.com.br/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.portalt5.com.br/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site