newman.cl
robots.txt
Robots Exclusion Standard data for newman.cl
Resource Scan
Scan Details
Site Domain | newman.cl |
Base Domain | newman.cl |
Scan Status | Ok |
Last Scan | 2024-10-17T04:34:04+00:00 |
Next Scan | 2024-11-16T04:34:04+00:00 |
Last Scan
Scanned | 2024-10-17T04:34:04+00:00 |
URL | https://www.newman.cl/robots.txt |
Domain IPs | 108.157.254.112, 108.157.254.114, 108.157.254.74, 108.157.254.8, 2600:9000:2753:1600:0:dec9:a600:93a1, 2600:9000:2753:200:0:dec9:a600:93a1, 2600:9000:2753:2200:0:dec9:a600:93a1, 2600:9000:2753:2800:0:dec9:a600:93a1, 2600:9000:2753:8200:0:dec9:a600:93a1, 2600:9000:2753:b200:0:dec9:a600:93a1, 2600:9000:2753:bc00:0:dec9:a600:93a1, 2600:9000:2753:e800:0:dec9:a600:93a1 |
Response IP | 108.157.254.114 |
Found | Yes |
Hash | f600adcf47f0a36816d4e0a4e6c6f631622622a385d0c4d5f75ba2b322ec4330 |
SimHash | f438cf074dd0 |
Groups
*
Rule | Path |
---|---|
Disallow | /img/* |
Disallow | /account/* |
Disallow | /login/* |
Disallow | /checkout/* |
Disallow | /busca/* |
Disallow | /quick-view/* |
Disallow | /espiar/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.newmanchile.cl/sitemap.xml |
Warnings
- `noindex` is not a known field.
Comments