mercadocomum.com
robots.txt
Robots Exclusion Standard data for mercadocomum.com
Resource Scan
Scan Details
Site Domain | mercadocomum.com |
Base Domain | mercadocomum.com |
Scan Status | Ok |
Last Scan | 2025-10-03T10:02:55+00:00 |
Next Scan | 2025-10-10T10:02:55+00:00 |
Last Scan
Scanned | 2025-10-03T10:02:55+00:00 |
URL | https://mercadocomum.com/robots.txt |
Domain IPs | 104.21.40.170, 172.67.155.21, 2606:4700:3033::6815:28aa, 2606:4700:3033::ac43:9b15 |
Response IP | 172.67.155.21 |
Found | Yes |
Hash | 89f705ce3714eab30ed148431d77e8d182f07f1c24391c0dca4998082e95a2a4 |
SimHash | 631c5951c5f4 |
Groups
*
Rule | Path |
---|---|
Allow | / |
googlebot
google-extended
googleother-image
googleother
googleother-video
bingbot
amazonbot
anthropic-ai
applebot
applebot-extended
ccbot
chatgpt
claude-web
claudebot
facebookbot
gptbot
icc-crawler
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexity-user
perplexitybot
semrushbot-ocob
semrushbot-swa
slurp
oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot
Rule | Path |
---|---|
Allow | / |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.mercadocomum.com/sitemap_index.xml |
sitemap | https://www.mercadocomum.com/sitemap_index.xml |
sitemap | https://www.mercadocomum.com/sitemap.xml |