amway.com.br
robots.txt

Robots Exclusion Standard data for amway.com.br

Resource Scan

Scan Details

Site Domain amway.com.br
Base Domain amway.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-09T06:57:04+00:00
Next Scan 2024-08-08T06:57:04+00:00

Last Successful Scan

Scanned2023-07-23T04:05:14+00:00
URL https://www.amway.com.br/robots.txt
Domain IPs 54.192.150.44, 54.192.150.65, 54.192.150.8, 54.192.150.96
Response IP 18.66.147.72
Found Yes
Hash 0f2af8f51099b5bd2cdf6300b8a59e36e0cb50699b4d9cbe08b575648784864f
SimHash 6844d79cefec

Groups

*

Rule Path
Allow /
Disallow /pt/cart
Disallow /pt/checkout
Disallow /pt/my-account
Disallow /pt/lojavirtual
Disallow /pt/search
Disallow /pt/register
Disallow /pt/login

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.amway.com.br/sitemap.xml

Comments

  • For all robots #Last modify: 2022-JUN
  • Block access to specific groups of pages
  • Request-rate: 1/10 # maximum rate is one page every 10 seconds
  • Crawl-delay: 10 # 10 seconds between page requests
  • Visit-time: 0400-0845 # only visit between 04:00 and 08:45 UTC
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot