ar.vlex.com
robots.txt

Robots Exclusion Standard data for ar.vlex.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ar.vlex.com
Base Domain	vlex.com
Scan Status	Ok
Last Scan	2024-06-07T22:44:28+00:00
Next Scan	2024-06-21T22:44:28+00:00

Last Scan

Scanned	2024-06-07T22:44:28+00:00
URL	https://ar.vlex.com/robots.txt
Domain IPs	2600:9000:271a:1c00:18:e259:2dc0:93a1, 2600:9000:271a:2600:18:e259:2dc0:93a1, 2600:9000:271a:5600:18:e259:2dc0:93a1, 2600:9000:271a:9800:18:e259:2dc0:93a1, 2600:9000:271a:a000:18:e259:2dc0:93a1, 2600:9000:271a:b200:18:e259:2dc0:93a1, 2600:9000:271a:b800:18:e259:2dc0:93a1, 2600:9000:271a:fa00:18:e259:2dc0:93a1, 3.165.82.106, 3.165.82.33, 3.165.82.65, 3.165.82.78
Response IP	3.165.82.33
Found	Yes
Hash	6a0919b80ddd36ad89e8e55fe33e7ede9af182f21c2c705d974712ddabec1fa4
SimHash	b2de79094767

Groups

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow	/tour
Disallow	/redirect
Disallow	/stats
Disallow	/search
Disallow	/account/login_ip
Disallow	/account/check_if_logged?callback=jsonp*
Disallow	/pdf/
Disallow	/*/search
Disallow	/*/auto_complete_for_norma_citada
Disallow	/*/auto_complete_for_voces_partes
Disallow	/vid/1234$
Disallow	/vid/ix_resultado
Disallow	/async
Disallow	/*st_dominio
Disallow	/*textolibre
Disallow	/voces
Disallow	/vid/*/translate?mt=
Disallow	/checkout
Disallow	/vid/*.json
Disallow	/vid/*/content
Disallow	/tags/*?format=pdf
Disallow	/vid?ix_resultado
Disallow	/source///thesaurus/starting_with
Disallow	/corporate
Disallow	/help_center
Disallow	/for_publishers
Disallow	/librarian_center
Disallow	/offline_trials
Disallow	/promos
Disallow	/countries
Disallow	/customers
Disallow	/publisher
Disallow	/languages
Disallow	/freetrial_30
Disallow	/center-search*
Disallow	/session_ip*
Disallow	/session.json*

Rule

Path

Disallow

/tour

Disallow

/redirect

Disallow

/stats

Disallow

/search

Disallow

/account/login_ip

Disallow

/account/check_if_logged?callback=jsonp*

Disallow

/pdf/

Disallow

/*/search

Disallow

/*/auto_complete_for_norma_citada

Disallow

/*/auto_complete_for_voces_partes

Disallow

/vid/1234$

Disallow

/vid/*ix_resultado*

Disallow

/async

Disallow

/*st_dominio

Disallow

/*textolibre

Disallow

/voces

Disallow

/vid/*/translate?mt=

Disallow

/checkout

Disallow

/vid/*.json

Disallow

/vid/*/content

Disallow

/tags/*?format=pdf

Disallow

/vid?*ix_resultado*

Disallow

/source/*/*/thesaurus/starting_with

Disallow

/corporate

Disallow

/help_center

Disallow

/for_publishers

Disallow

/librarian_center

Disallow

/offline_trials

Disallow

/promos

Disallow

/countries

Disallow

/customers

Disallow

/publisher

Disallow

/languages

Disallow

/freetrial_30

Disallow

/center-search*

Disallow

/session_ip*

Disallow

/session.json*

bingbot

Rule	Path
Disallow	/tags//page/

Rule

Path

Disallow

/tags/*/page/*

msnbot

Rule	Path
Disallow	/tags//page/

Rule

Path

Disallow

/tags/*/page/*

ubicrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

doc

Rule	Path
Disallow	/

Rule

Path

Disallow

zao

Rule	Path
Disallow	/

Rule

Path

Disallow

sitecheck.internetseer.com

Rule	Path
Disallow	/

Rule

Path

Disallow

zealbot

Rule	Path
Disallow	/

Rule

Path

Disallow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

fetch

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

linko

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

zyborg

Rule	Path
Disallow	/

Rule

Path

Disallow

download ninja

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

grub-client

Rule	Path
Disallow	/

Rule

Path

Disallow

k2spider

Rule	Path
Disallow	/

Rule

Path

Disallow

npbot

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

psbot

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

speedy

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bloglines/3.1

Rule	Path
Disallow	/

Rule

Path

Disallow

jyxobot/1

Rule	Path
Disallow	/

Rule

Path

Disallow

cityreview

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file

ar.vlex.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mediapartners-google

*

bingbot

msnbot

ubicrawler

doc

zao

sitecheck.internetseer.com

zealbot

msiecrawler

sitesnagger

webstripper

webcopier

fetch

offline explorer

teleport

teleportpro

webzip

linko

httrack

microsoft.url.control

xenu

larbin

libwww

zyborg

download ninja

wget

grub-client

k2spider

npbot

webreaper

psbot

exabot

speedy

dotbot

bloglines/3.1

jyxobot/1

cityreview

proximic

gptbot

chatgpt-user

google-extended

facebookbot

omgilibot

anthropic-ai

cohere-ai

Comments

ar.vlex.com
robots.txt