vans.com.co
robots.txt

Robots Exclusion Standard data for vans.com.co

Resource Scan

Scan Details

Site Domain vans.com.co
Base Domain vans.com.co
Scan Status Ok
Last Scan2024-11-13T09:03:14+00:00
Next Scan 2024-12-13T09:03:14+00:00

Last Scan

Scanned2024-11-13T09:03:14+00:00
URL https://www.vans.com.co/robots.txt
Domain IPs 18.155.68.101, 18.155.68.23, 18.155.68.8, 18.155.68.80, 2600:9000:23d2:1a00:3:e4a4:e700:93a1, 2600:9000:23d2:3200:3:e4a4:e700:93a1, 2600:9000:23d2:3c00:3:e4a4:e700:93a1, 2600:9000:23d2:5e00:3:e4a4:e700:93a1, 2600:9000:23d2:8600:3:e4a4:e700:93a1, 2600:9000:23d2:8800:3:e4a4:e700:93a1, 2600:9000:23d2:9800:3:e4a4:e700:93a1, 2600:9000:23d2:a600:3:e4a4:e700:93a1
Response IP 18.155.68.23
Found Yes
Hash de216f1b5b18ff26889a5f6586ea00e454e8c6385a242973452045a77a2df760
SimHash 2c31698e6951

Groups

*

Rule Path
Disallow /img/*
Disallow /account*
Disallow /login*
Disallow /checkout*
Disallow /busca*
Disallow /quick-view*
Disallow /buscavazia*
Disallow /*?*map*
Disallow /*?*fq*
Disallow /*?*productClusterIds*
Disallow /*?*productClusterSearchableIds*
Disallow /*?*specificationFilter*

Other Records

Field Value
sitemap https://www.vans.com.co/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • Disallow URL Parameters