truora.com
robots.txt

Robots Exclusion Standard data for truora.com

Resource Scan

Scan Details

Site Domain truora.com
Base Domain truora.com
Scan Status Ok
Last Scan2024-11-10T04:24:41+00:00
Next Scan 2024-12-10T04:24:41+00:00

Last Scan

Scanned2024-11-10T04:24:41+00:00
URL https://truora.com/robots.txt
Redirect https://www.truora.com/robots.txt
Redirect Domain www.truora.com
Redirect Base truora.com
Domain IPs 13.35.238.119, 13.35.238.54, 13.35.238.56, 13.35.238.95
Redirect IPs 199.60.103.225, 199.60.103.31, 2606:2c40::c73c:671f, 2606:2c40::c73c:67e1
Response IP 199.60.103.31
Found Yes
Hash 1c9b69309764aa9612f5ce07d0d11d2a166c7d51f98a56d0fc79623bc1cf5358
SimHash baa1cc64d4bb

Groups

*

Rule Path
Disallow /tag/*
Disallow /en/tag/*
Disallow /pt/tag/*
Disallow /es/tag/*
Disallow /page/*
Disallow /author/*
Disallow /en/author/*
Disallow /pt/author/*
Disallow /es/author/*
Disallow /hs-search-results*
Disallow /page/*
Disallow /en/page/*
Disallow /es/page/*
Disallow /pt/page/*
Disallow /search?*
Disallow /news*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.truora.com/sitemap.xml

Comments

  • tags and pages
  • paginaciones
  • web querys

Warnings

  • 1 invalid line.