alls.com.br
robots.txt

Robots Exclusion Standard data for alls.com.br

Resource Scan

Scan Details

Site Domain alls.com.br
Base Domain alls.com.br
Scan Status Ok
Last Scan2024-06-22T18:50:54+00:00
Next Scan 2024-07-22T18:50:54+00:00

Last Scan

Scanned2024-06-22T18:50:54+00:00
URL https://www.alls.com.br/robots.txt
Domain IPs 2600:9000:2721:1200:6:e48a:f640:93a1, 2600:9000:2721:3600:6:e48a:f640:93a1, 2600:9000:2721:4600:6:e48a:f640:93a1, 2600:9000:2721:7000:6:e48a:f640:93a1, 2600:9000:2721:a800:6:e48a:f640:93a1, 2600:9000:2721:d200:6:e48a:f640:93a1, 2600:9000:2721:e00:6:e48a:f640:93a1, 2600:9000:2721:ee00:6:e48a:f640:93a1, 3.165.102.105, 3.165.102.6, 3.165.102.70, 3.165.102.71
Response IP 3.165.102.6
Found Yes
Hash b40ff8b816b18ca8ea289024ed08e8e3f9b5dcb7ac5f89d757526a42fa72e140
SimHash f430cd574dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap https://www.allislove.com.br/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.

Warnings

  • `noindex` is not a known field.