ca.vlex.com
robots.txt

Robots Exclusion Standard data for ca.vlex.com

Resource Scan

Scan Details

Site Domain ca.vlex.com
Base Domain vlex.com
Scan Status Ok
Last Scan2024-11-01T23:46:55+00:00
Next Scan 2024-11-15T23:46:55+00:00

Last Scan

Scanned2024-11-01T23:46:55+00:00
URL https://ca.vlex.com/robots.txt
Domain IPs 108.156.133.105, 108.156.133.39, 108.156.133.68, 108.156.133.76
Response IP 108.156.133.76
Found Yes
Hash 6a0919b80ddd36ad89e8e55fe33e7ede9af182f21c2c705d974712ddabec1fa4
SimHash b2de79094767

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /tour
Disallow /redirect
Disallow /stats
Disallow /search
Disallow /account/login_ip
Disallow /account/check_if_logged?callback=jsonp*
Disallow /pdf/
Disallow /*/search
Disallow /*/auto_complete_for_norma_citada
Disallow /*/auto_complete_for_voces_partes
Disallow /vid/1234$
Disallow /vid/*ix_resultado*
Disallow /async
Disallow /*st_dominio
Disallow /*textolibre
Disallow /voces
Disallow /vid/*/translate?mt=
Disallow /checkout
Disallow /vid/*.json
Disallow /vid/*/content
Disallow /tags/*?format=pdf
Disallow /vid?*ix_resultado*
Disallow /source/*/*/thesaurus/starting_with
Disallow /corporate
Disallow /help_center
Disallow /for_publishers
Disallow /librarian_center
Disallow /offline_trials
Disallow /promos
Disallow /countries
Disallow /customers
Disallow /publisher
Disallow /languages
Disallow /freetrial_30
Disallow /center-search*
Disallow /session_ip*
Disallow /session.json*

bingbot

Rule Path
Disallow /tags/*/page/*

msnbot

Rule Path
Disallow /tags/*/page/*

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

psbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

bloglines/3.1

Rule Path
Disallow /

jyxobot/1

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

proximic

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file