pantufas.com.br
robots.txt

Robots Exclusion Standard data for pantufas.com.br

Resource Scan

Scan Details

Site Domain pantufas.com.br
Base Domain pantufas.com.br
Scan Status Ok
Last Scan2026-03-14T11:48:31+00:00
Next Scan 2026-04-13T11:48:31+00:00

Last Scan

Scanned2026-03-14T11:48:31+00:00
URL https://www.pantufas.com.br/robots.txt
Domain IPs 2600:9000:271a:1a00:11:a883:e200:93a1, 2600:9000:271a:2e00:11:a883:e200:93a1, 2600:9000:271a:4000:11:a883:e200:93a1, 2600:9000:271a:8e00:11:a883:e200:93a1, 2600:9000:271a:aa00:11:a883:e200:93a1, 2600:9000:271a:be00:11:a883:e200:93a1, 2600:9000:271a:d000:11:a883:e200:93a1, 2600:9000:271a:e00:11:a883:e200:93a1, 3.165.75.32, 3.165.75.4, 3.165.75.86, 3.165.75.99
Response IP 3.165.75.32
Found Yes
Hash c19c62540b643ce9870274a64c6eccd6da0b604855d44fca0246f6262d0733e6
SimHash e430cd474dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap http://www.pantufas.com.br/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.

Warnings

  • `noindex` is not a known field.