newman.cl
robots.txt

Robots Exclusion Standard data for newman.cl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	newman.cl
Base Domain	newman.cl
Scan Status	Ok
Last Scan	2024-10-17T04:34:04+00:00
Next Scan	2024-11-16T04:34:04+00:00

Last Scan

Scanned	2024-10-17T04:34:04+00:00
URL	https://www.newman.cl/robots.txt
Domain IPs	108.157.254.112, 108.157.254.114, 108.157.254.74, 108.157.254.8, 2600:9000:2753:1600:0:dec9:a600:93a1, 2600:9000:2753:200:0:dec9:a600:93a1, 2600:9000:2753:2200:0:dec9:a600:93a1, 2600:9000:2753:2800:0:dec9:a600:93a1, 2600:9000:2753:8200:0:dec9:a600:93a1, 2600:9000:2753:b200:0:dec9:a600:93a1, 2600:9000:2753:bc00:0:dec9:a600:93a1, 2600:9000:2753:e800:0:dec9:a600:93a1
Response IP	108.157.254.114
Found	Yes
Hash	f600adcf47f0a36816d4e0a4e6c6f631622622a385d0c4d5f75ba2b322ec4330
SimHash	f438cf074dd0

Groups

*

Rule	Path
Disallow	/img/*
Disallow	/account/*
Disallow	/login/*
Disallow	/checkout/*
Disallow	/busca/*
Disallow	/quick-view/*
Disallow	/espiar/*

Rule

Path

Disallow

/img/*

Disallow

/account/*

Disallow

/login/*

Disallow

/checkout/*

Disallow

/busca/*

Disallow

/quick-view/*

Disallow

/espiar/*

Back to top

Other Records

Field	Value
sitemap	https://www.newmanchile.cl/sitemap.xml

Field

Value

sitemap

https://www.newmanchile.cl/sitemap.xml

Back to top

Comments

Disallow all crawlers access to certain pages.

Back to top

Warnings

`noindex` is not a known field.

Back to top

newman.clrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

Warnings

newman.cl
robots.txt