queresapostar.pt
robots.txt

Robots Exclusion Standard data for queresapostar.pt

Resource Scan

Scan Details

Site Domain queresapostar.pt
Base Domain queresapostar.pt
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-21T01:05:42+00:00
Next Scan 2026-01-04T01:05:42+00:00

Last Successful Scan

Scanned2025-11-12T04:51:19+00:00
URL https://queresapostar.pt/robots.txt
Domain IPs 104.21.74.121, 172.67.158.91, 2606:4700:3032::ac43:9e5b, 2606:4700:3033::6815:4a79
Response IP 104.21.74.121
Found Yes
Hash 60aaf6bf5028430a1eb90a08b81ac04449adeb9cdf211e562164e0c2f5af9b00
SimHash 6b6e984084f2

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/

facebookbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://queresapostar.pt/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK
  • BLOQUEIO DE CRAWLERS DE IA
  • ---------------------------
  • Impede o acesso dos crawlers de IA da Meta (Facebook, Instagram, Threads)
  • (Opcional) Impede outros crawlers de IA conhecidos:
  • OpenAI (ChatGPT)
  • Google AI (ex: Google-Extended usado para treino de Bard/Gemini)
  • Amazon (Alexa e modelos internos)
  • Common Crawl (utilizado para treinar mĂșltiplas IAs, incluindo LLaMA, GPT e outros)
  • Perplexity AI
  • ---------------------------
  • FIM DO BLOQUEIO DE CRAWLERS DE IA