pierrecardin.com.br
robots.txt

Robots Exclusion Standard data for pierrecardin.com.br

Resource Scan

Scan Details

Site Domain pierrecardin.com.br
Base Domain pierrecardin.com.br
Scan Status Ok
Last Scan2024-11-12T20:04:14+00:00
Next Scan 2024-12-12T20:04:14+00:00

Last Scan

Scanned2024-11-12T20:04:14+00:00
URL https://www.pierrecardin.com.br/robots.txt
Domain IPs 13.33.88.32, 13.33.88.79, 13.33.88.85, 13.33.88.89, 2600:9000:223b:1400:8:5984:1980:93a1, 2600:9000:223b:3400:8:5984:1980:93a1, 2600:9000:223b:4c00:8:5984:1980:93a1, 2600:9000:223b:5800:8:5984:1980:93a1, 2600:9000:223b:7800:8:5984:1980:93a1, 2600:9000:223b:9200:8:5984:1980:93a1, 2600:9000:223b:d800:8:5984:1980:93a1, 2600:9000:223b:f400:8:5984:1980:93a1
Response IP 13.33.88.89
Found Yes
Hash 5556aae4058bc5eb6e47222cbf91bf3df78fa6c35f267dd4c0fd807ce33c0704
SimHash 6c18ef474ff5

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /logout/*
Disallow /checkout/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /esqueci-minha-senha/*
Disallow /meu-cadastro/*
Disallow /gerenciar-enderecos/*
Disallow /checkPassword/*
Disallow /nova-senha/*
Disallow /meus-pedidos/*
Disallow /busca/*
Disallow /admin/*
Allow /.js$
Allow /.css$
Allow /files

Other Records

Field Value
sitemap https://www.pierrecardin.com.br/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.