treasurebox.com.br
robots.txt
Robots Exclusion Standard data for treasurebox.com.br
Resource Scan
Scan Details
Site Domain | treasurebox.com.br |
Base Domain | treasurebox.com.br |
Scan Status | Ok |
Last Scan | 2024-10-24T08:33:09+00:00 |
Next Scan | 2024-11-23T08:33:09+00:00 |
Last Scan
Scanned | 2024-10-24T08:33:09+00:00 |
URL | https://www.treasurebox.com.br/robots.txt |
Domain IPs | 13.33.88.16, 13.33.88.44, 13.33.88.66, 13.33.88.82 |
Response IP | 13.33.88.16 |
Found | Yes |
Hash | 53a76d4a8a1ec9966b4a2f507bcb9cd2ecca28a30664d7a40bf8faacf5a54dce |
SimHash | 6256dc24c413 |
Groups
*
Rule | Path |
---|---|
Disallow | /conta/* |
Disallow | /carrinho/* |
Disallow | /buscar |
Disallow | /documentacao |
Disallow | /api/produto/calcular_frete |
Disallow | /*fq%3D |
Disallow | /compre_junto/* |
Disallow | /_events/* |
Disallow | /tracking/convertion |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.treasurebox.com.br/sitemap.xml |