cerezamayorista.com
robots.txt

Robots Exclusion Standard data for cerezamayorista.com

Resource Scan

Scan Details

Site Domain cerezamayorista.com
Base Domain cerezamayorista.com
Scan Status Ok
Last Scan2024-11-15T21:24:58+00:00
Next Scan 2024-11-29T21:24:58+00:00

Last Scan

Scanned2024-11-15T21:24:58+00:00
URL https://cerezamayorista.com/robots.txt
Redirect https://www.cerezamayorista.com/robots.txt
Redirect Domain www.cerezamayorista.com
Redirect Base cerezamayorista.com
Domain IPs 18.161.97.126, 18.161.97.15, 18.161.97.23, 18.161.97.47
Redirect IPs 13.227.254.121, 13.227.254.45, 13.227.254.5, 13.227.254.66, 2600:9000:200a:1800:c:2027:c700:93a1, 2600:9000:200a:1e00:c:2027:c700:93a1, 2600:9000:200a:2200:c:2027:c700:93a1, 2600:9000:200a:3400:c:2027:c700:93a1, 2600:9000:200a:7400:c:2027:c700:93a1, 2600:9000:200a:9200:c:2027:c700:93a1, 2600:9000:200a:9c00:c:2027:c700:93a1, 2600:9000:200a:e00:c:2027:c700:93a1
Response IP 13.227.254.66
Found Yes
Hash 7b6960447b94b0fa17550da7688d19ab412322d736d99e791949881c0d5a23b2
SimHash e638ed064dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /buscapagina/*

Other Records

Field Value
sitemap https://cerezamayorista.com/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.