trousseau.com.br
robots.txt

Robots Exclusion Standard data for trousseau.com.br

Resource Scan

Scan Details

Site Domain trousseau.com.br
Base Domain trousseau.com.br
Scan Status Ok
Last Scan2025-07-24T23:29:11+00:00
Next Scan 2025-08-07T23:29:11+00:00

Last Scan

Scanned2025-07-24T23:29:11+00:00
URL https://trousseau.com.br/robots.txt
Redirect https://www.trousseau.com.br/robots.txt
Redirect Domain www.trousseau.com.br
Redirect Base trousseau.com.br
Domain IPs 35.215.93.182
Redirect IPs 2600:9000:271a:2e00:e:aed4:4c40:93a1, 2600:9000:271a:3200:e:aed4:4c40:93a1, 2600:9000:271a:3800:e:aed4:4c40:93a1, 2600:9000:271a:4200:e:aed4:4c40:93a1, 2600:9000:271a:4800:e:aed4:4c40:93a1, 2600:9000:271a:7200:e:aed4:4c40:93a1, 2600:9000:271a:ac00:e:aed4:4c40:93a1, 2600:9000:271a:c800:e:aed4:4c40:93a1, 3.165.75.16, 3.165.75.37, 3.165.75.63, 3.165.75.94
Response IP 3.165.75.94
Found Yes
Hash e7069cf356e5975b9c882a0937a27887c605ba2eab8671cd5a8335e87f3eb048
SimHash ee10ed52c0f1

Groups

*

Rule Path
Allow /
Allow /*.js$
Allow /*.css$
Disallow /login*
Disallow /checkout*
Disallow /Sistema*
Disallow /sistema*
Disallow /*?utm*
Disallow /*?PS*
Disallow /*?O=*
Disallow /*?ft=*
Disallow /*?search-term=*
Disallow /*?rr_id=*
Disallow /*map%3D*
Disallow /*amgiftreg
Disallow /*.aspx
Disallow /*.html$
Disallow /*catalog
Disallow /central-de-atendimento*
Disallow /teste*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap https://www.trousseau.com.br/pt/sitemap.xml
sitemap https://www.trousseau.com.br/en/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.