dicadagrecia.com.br
robots.txt

Robots Exclusion Standard data for dicadagrecia.com.br

Resource Scan

Scan Details

Site Domain dicadagrecia.com.br
Base Domain dicadagrecia.com.br
Scan Status Ok
Last Scan2026-03-09T05:20:43+00:00
Next Scan 2026-03-16T05:20:43+00:00

Last Scan

Scanned2026-03-09T05:20:43+00:00
URL https://dicadagrecia.com.br/robots.txt
Domain IPs 104.21.47.193, 172.67.172.36, 2606:4700:3031::ac43:ac24, 2606:4700:3033::6815:2fc1
Response IP 172.67.172.36
Found Yes
Hash e88351d381dff2552d35ac30968c532bfe281894ae5682a69b63dcb51d13e8d0
SimHash 5b3a09c0a69a

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search
Disallow /trackback/
Disallow /readme.html
Disallow /license.txt
Disallow /cdn-cgi/
Disallow /private/

ccbot
perplexitybot
anthropic-ai
claudebot
omgilibot
diffbot
applebot-extended

Rule Path
Disallow /

amazonbot
petalbot
semrushbot
semrushbot-sa
bytespider
seobilitybot
ahrefsbot
blexbot
dotbot
linkdexbot
scrapy
yandexbot
baiduspider
sogou
mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://network.grupodicas.com/news-sitemap.xml
sitemap https://network.grupodicas.com/sitemap_index.xml

Comments

  • robots.txt – SEOX / Publisher - 2026-01
  • ------------------------------------------------------------------
  • BLOQUEIO GERAL & ROTAS ADMINISTRATIVAS
  • ------------------------------------------------------------------
  • ------------------------------------------------------------------
  • AI CRAWLERS & LLM SCRAPERS
  • ------------------------------------------------------------------
  • ------------------------------------------------------------------
  • SEO / SCRAPING BOTS AGRESSIVOS
  • ------------------------------------------------------------------
  • ------------------------------------------------------------------
  • SITEMAPS
  • ------------------------------------------------------------------