catycan.com
robots.txt

Robots Exclusion Standard data for catycan.com

Resource Scan

Scan Details

Site Domain catycan.com
Base Domain catycan.com
Scan Status Ok
Last Scan2024-06-10T06:11:44+00:00
Next Scan 2024-07-10T06:11:44+00:00

Last Scan

Scanned2024-06-10T06:11:44+00:00
URL https://catycan.com/robots.txt
Redirect https://www.catycan.com.ar/robots.txt
Redirect Domain www.catycan.com.ar
Redirect Base catycan.com.ar
Domain IPs 104.21.14.134
Redirect IPs 108.157.254.5, 108.157.254.57, 108.157.254.67, 108.157.254.91, 2600:9000:2753:3800:1c:e136:8c00:93a1, 2600:9000:2753:3c00:1c:e136:8c00:93a1, 2600:9000:2753:6600:1c:e136:8c00:93a1, 2600:9000:2753:8c00:1c:e136:8c00:93a1, 2600:9000:2753:9800:1c:e136:8c00:93a1, 2600:9000:2753:b000:1c:e136:8c00:93a1, 2600:9000:2753:c200:1c:e136:8c00:93a1, 2600:9000:2753:e400:1c:e136:8c00:93a1
Response IP 108.157.254.67
Found Yes
Hash d7cc896fa2007cb07342d82219a17e7511bbe66c102deabfa4859b02e151f72b
SimHash e418cd574dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap https://www.natural-life.com.ar/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.

Warnings

  • `noindex` is not a known field.