panini.com.br
robots.txt

Robots Exclusion Standard data for panini.com.br

Resource Scan

Scan Details

Site Domain panini.com.br
Base Domain panini.com.br
Scan Status Ok
Last Scan2025-12-28T12:31:43+00:00
Next Scan 2026-01-27T12:31:43+00:00

Last Scan

Scanned2025-12-28T12:31:43+00:00
URL https://panini.com.br/robots.txt
Domain IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 151.101.65.124
Found Yes
Hash 4b76b2cb3dc069784f9c9312b5992e9237c2fe4d1ccd3a892af342e2e8b6c1fb
SimHash 5537b84b8c10

Groups

*

Rule Path
Disallow /index.php/
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /*/banner/ajax/
Disallow /*/checkout/cart/
Disallow /*/review/
Disallow /*/customer/
Disallow /*/ajaxrequest/
Disallow /*/sendfriend/
Disallow /*/page_cache/
Disallow /*/elasticsuite/
Disallow /*/catalogsearch/
Disallow /pub/media/openpay/attachments/
Allow /*.css
Allow /*.js

gptbot

Rule Path
Disallow

oai-searchbot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow

openai-searchbot

Rule Path
Disallow

operator

Rule Path
Disallow

ccbot

Rule Path
Disallow

perplexitybot

Rule Path
Disallow

perplexity-user

Rule Path
Disallow

google-extended

Rule Path
Disallow

googleother

Rule Path
Disallow

bingbot

Rule Path
Disallow

claude-web

Rule Path
Disallow

claudebot

Rule Path
Disallow

anthropic-ai

Rule Path
Disallow

Other Records

Field Value
sitemap https://panini.com.br/sitemap.xml
sitemap https://panini.com.br/media/sitemap.xml

Comments

  • OpenAI Bots
  • Common Crawl Bot
  • Perplexity AI Bots
  • Google AI Bots
  • Microsoft/Bing AI Bots
  • Claude (Anthropic)