dia.com.br
robots.txt

Robots Exclusion Standard data for dia.com.br

Resource Scan

Scan Details

Site Domain dia.com.br
Base Domain dia.com.br
Scan Status Ok
Last Scan2024-06-26T00:58:49+00:00
Next Scan 2024-07-26T00:58:49+00:00

Last Scan

Scanned2024-06-26T00:58:49+00:00
URL https://dia.com.br/robots.txt
Redirect https://www.dia.com.br/robots.txt
Redirect Domain www.dia.com.br
Redirect Base dia.com.br
Domain IPs 3.165.102.117, 3.165.102.51, 3.165.102.53, 3.165.102.96
Redirect IPs 18.155.68.112, 18.155.68.127, 18.155.68.27, 18.155.68.73
Response IP 18.155.68.127
Found Yes
Hash 4da778fb7fe48a2c924605bb5347b597cc2cbcee3db704254eeaba58fa21c67b
SimHash 41108e714791

Groups

*

Rule Path
Allow *
Disallow /app/*
Disallow /diaexpress/suporte/

Other Records

Field Value
sitemap https://www.dia.com.br/sitemap-index.xml

Warnings

  • `host` is not a known field.