bretasatacarejo.com.br
robots.txt

Robots Exclusion Standard data for bretasatacarejo.com.br

Resource Scan

Scan Details

Site Domain bretasatacarejo.com.br
Base Domain bretasatacarejo.com.br
Scan Status Ok
Last Scan2024-09-17T04:29:36+00:00
Next Scan 2024-10-17T04:29:36+00:00

Last Scan

Scanned2024-09-17T04:29:36+00:00
URL https://www.bretasatacarejo.com.br/robots.txt
Domain IPs 13.227.254.17, 13.227.254.20, 13.227.254.5, 13.227.254.90, 2600:9000:200a:2200:17:55e7:d5c0:93a1, 2600:9000:200a:5c00:17:55e7:d5c0:93a1, 2600:9000:200a:8200:17:55e7:d5c0:93a1, 2600:9000:200a:8400:17:55e7:d5c0:93a1, 2600:9000:200a:8800:17:55e7:d5c0:93a1, 2600:9000:200a:e800:17:55e7:d5c0:93a1, 2600:9000:200a:ee00:17:55e7:d5c0:93a1, 2600:9000:200a:f600:17:55e7:d5c0:93a1
Response IP 13.227.254.5
Found Yes
Hash 2b1cf081cc3c01c88505da1118cf8428139659c6d4acbceb49c6a700e3ca5267
SimHash a8108d16caf1

Groups

*

Rule Path
Allow /
Allow /*.js$
Allow /*.css$
Allow /busqueda/
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap https://www.bretasatacarejo.com.br/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.