es.healthcare.airliquide.com
robots.txt

Robots Exclusion Standard data for es.healthcare.airliquide.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	es.healthcare.airliquide.com
Base Domain	airliquide.com
Scan Status	Ok
Last Scan	2024-09-23T00:58:49+00:00
Next Scan	2024-10-07T00:58:49+00:00

Last Scan

Scanned	2024-09-23T00:58:49+00:00
URL	https://es.healthcare.airliquide.com/robots.txt
Domain IPs	23.210.96.168, 2600:1413:b000:696::f52, 2600:1413:b000:697::f52
Response IP	23.210.96.168
Found	Yes
Hash	911b8e24cc0a44fd38752ad18dce9d88664ef082816757fde0ff5e3009b80c02
SimHash	3896ad0bc778

Groups

*

Rule	Path
Allow	/core/*.css$
Allow	/core/*.css?
Allow	/core/*.js$
Allow	/core/*.js?
Allow	/core/*.gif
Allow	/core/*.jpg
Allow	/core/*.jpeg
Allow	/core/*.png
Allow	/core/*.svg
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.gif
Allow	/profiles/*.jpg
Allow	/profiles/*.jpeg
Allow	/profiles/*.png
Allow	/profiles/*.svg
Disallow	/core/
Disallow	/profiles/
Disallow	/README.txt
Disallow	/web.config
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register
Disallow	/user/password
Disallow	/user/login
Disallow	/user/logout
Disallow	/index.php/admin/
Disallow	/index.php/comment/reply/
Disallow	/index.php/filter/tips
Disallow	/index.php/node/add/
Disallow	/index.php/search/
Disallow	/index.php/user/password
Disallow	/index.php/user/register
Disallow	/index.php/user/login
Disallow	/index.php/user/logout
Disallow	/add-to-calendar/ics/
Disallow	/api/airliquide/download/file/
Disallow	/form/
Disallow	/opinion_survey_json/add/
Disallow	*/aggregate
Disallow	/page_action/
Disallow	/spa/
Disallow	/ajax/
Disallow	/jserrors/
Disallow	/metrics/
Disallow	/page_view_timing/
Disallow	/page_view_event/
Disallow	/sesion_trace/
Disallow	%3D
Disallow	/node
Allow	webp
Allow	page%3D
Allow	languageSelect%3D
Allow	thematic%5B0%5D%3D
Allow	jobId%3DP95FK026203F3VBQBV7LOQWKU-48853%26langCode%3Dfr_FR
Allow	jobId%3DPDMFK026203F3VBQBV7LOQW0B-250%26langCode%3Den_US
Allow	field_date_range_end_value%3D%26field_date_range_end_value_1%3D%26page%3D
Allow	period%5Bmin%5D%3D%26period%5Bmax%5D%3D%26text%3D%26page%3D
Allow	period%5Bmin%5D%3D%26period%5Bmax%5D%3D%26page%3D
Allow	.jpg
Allow	.jpeg
Allow	.png

Rule

Path

Allow

/core/*.css$

Allow

/core/*.css?

Allow

/core/*.js$

Allow

/core/*.js?

Allow

/core/*.gif

Allow

/core/*.jpg

Allow

/core/*.jpeg

Allow

/core/*.png

Allow

/core/*.svg

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.gif

Allow

/profiles/*.jpg

Allow

/profiles/*.jpeg

Allow

/profiles/*.png

Allow

/profiles/*.svg

Disallow

/core/

Disallow

/profiles/

Disallow

/README.txt

Disallow

/web.config

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register

Disallow

/user/password

Disallow

/user/login

Disallow

/user/logout

Disallow

/index.php/admin/

Disallow

/index.php/comment/reply/

Disallow

/index.php/filter/tips

Disallow

/index.php/node/add/

Disallow

/index.php/search/

Disallow

/index.php/user/password

Disallow

/index.php/user/register

Disallow

/index.php/user/login

Disallow

/index.php/user/logout

Disallow

*/add-to-calendar/ics/*

Disallow

*/api/airliquide/download/file/*

Disallow

*/form/*

Disallow

*/opinion_survey_json/add/*

Disallow

*/aggregate

Disallow

*/page_action/*

Disallow

*/spa/*

Disallow

*/ajax/*

Disallow

*/jserrors/*

Disallow

*/metrics/*

Disallow

*/page_view_timing/*

Disallow

*/page_view_event/*

Disallow

*/sesion_trace/*

Disallow

*%3D*

Disallow

*/node*

Allow

*webp*

Allow

*page%3D*

Allow

*languageSelect%3D*

Allow

*thematic%5B0%5D%3D*

Allow

*jobId%3DP95FK026203F3VBQBV7LOQWKU-48853%26langCode%3Dfr_FR*

Allow

*jobId%3DPDMFK026203F3VBQBV7LOQW0B-250%26langCode%3Den_US*

Allow

*field_date_range_end_value%3D%26field_date_range_end_value_1%3D%26page%3D*

Allow

*period%5Bmin%5D%3D%26period%5Bmax%5D%3D%26text%3D%26page%3D*

Allow

*period%5Bmin%5D%3D%26period%5Bmax%5D%3D%26page%3D*

Allow

*.jpg*

Allow

*.jpeg*

Allow

*.png*

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

yahoo! slurp china

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

archive.org_bot
semrushbot
yandeximages

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-image

Rule	Path
Disallow	/

Rule

Path

Disallow

noxtrumbot
msnbot
slurp
ahrefsbot
msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

python-urllib

Rule	Path
Disallow	/

Rule

Path

Disallow

url_spider_pro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

emailwolf

Rule	Path
Disallow	/

Rule

Path

Disallow

extractorpro

Rule	Path
Disallow	/

Rule

Path

Disallow

copyrightcheck

Rule	Path
Disallow	/

Rule

Path

Disallow

alexibot

Rule	Path
Disallow	/

Rule

Path

Disallow

web image collector

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu's link sleuth 1.1c

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu's

Rule	Path
Disallow	/

Rule

Path

Disallow

zeus

Rule	Path
Disallow	/

Rule

Path

Disallow

zeus link scout

Rule	Path
Disallow	/

Rule

Path

Disallow

erocrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

linkscan/8.1a unix

Rule	Path
Disallow	/

Rule

Path

Disallow

keyword density/0.9

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier v3.2a

Rule	Path
Disallow	/

Rule

Path

Disallow

webcapture 2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier v.2.2

Rule	Path
Disallow	/

Rule

Path

Disallow

etaospider

Rule	Path
Disallow	/

Rule

Path

Disallow

black hole

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu\\\'s link sleuth 1.1c

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu\\\'s

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://es.healthcare.airliquide.com/sites/alh_es/files/pdf-sitemap.xml
sitemap	https://es.healthcare.airliquide.com/sitemap.xml

Field

Value

sitemap

https://es.healthcare.airliquide.com/sites/alh_es/files/pdf-sitemap.xml

sitemap

https://es.healthcare.airliquide.com/sitemap.xml

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
CSS, JS, Images
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
Specific rules
Blocking all parameters
Except those
Tratamiento de User-agents especificos
XML sitemap

es.healthcare.airliquide.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

baiduspider

yahoo! slurp china

yandexbot

yandex

archive.org_botsemrushbotyandeximages

baiduspider-image

noxtrumbotmsnbotslurpahrefsbotmsiecrawler

webcopier

python-urllib

url_spider_pro

emailcollector

emailsiphon

webbandit

emailwolf

extractorpro

copyrightcheck

alexibot

web image collector

xenu's link sleuth 1.1c

xenu's

zeus

zeus link scout

erocrawler

linkscan/8.1a unix

keyword density/0.9

webcopier v3.2a

webcapture 2.0

webcopier v.2.2

etaospider

black hole

xenu\\\'s link sleuth 1.1c

xenu\\\'s

Other Records

Comments

es.healthcare.airliquide.com
robots.txt

archive.org_bot
semrushbot
yandeximages

noxtrumbot
msnbot
slurp
ahrefsbot
msiecrawler