miriade.com
robots.txt

Robots Exclusion Standard data for miriade.com

Resource Scan

Scan Details

Site Domain miriade.com
Base Domain miriade.com
Scan Status Ok
Last Scan2024-10-17T05:37:14+00:00
Next Scan 2024-11-16T05:37:14+00:00

Last Scan

Scanned2024-10-17T05:37:14+00:00
URL https://miriade.com/robots.txt
Redirect https://www.miriade.com/robots.txt
Redirect Domain www.miriade.com
Redirect Base miriade.com
Domain IPs 35.214.224.4
Redirect IPs 13.227.254.112, 13.227.254.124, 13.227.254.18, 13.227.254.45, 2600:9000:2014:1400:12:3f39:a300:93a1, 2600:9000:2014:1800:12:3f39:a300:93a1, 2600:9000:2014:4000:12:3f39:a300:93a1, 2600:9000:2014:5000:12:3f39:a300:93a1, 2600:9000:2014:a000:12:3f39:a300:93a1, 2600:9000:2014:a00:12:3f39:a300:93a1, 2600:9000:2014:ee00:12:3f39:a300:93a1, 2600:9000:2014:f800:12:3f39:a300:93a1
Response IP 13.227.254.18
Found Yes
Hash 87b9c6977b2cf8cc430b5ec2586f13c70856c7dc14a0924778d615437fd92130
SimHash a4344fc76dd0

Groups

googlebot

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /search/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /*?map*
Disallow *?utm*
Disallow /*?page*

Other Records

Field Value
sitemap https://www.miriade.com/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • Disallow: /es/*
  • Disallow: /en/*
  • Disallow: /de/*
  • Noindex: /es/*
  • Noindex: /en/*
  • Noindex: /de/*