lbstgroup.com
robots.txt

Robots Exclusion Standard data for lbstgroup.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	lbstgroup.com
Base Domain	lbstgroup.com
Scan Status	Ok
Last Scan	2024-10-25T11:22:19+00:00
Next Scan	2024-11-24T11:22:19+00:00

Last Scan

Scanned	2024-10-25T11:22:19+00:00
URL	https://lbstgroup.com/robots.txt
Redirect	https://www.lbstgroup.com/robots.txt
Redirect Domain	www.lbstgroup.com
Redirect Base	lbstgroup.com
Domain IPs	47.251.1.227
Redirect IPs	47.251.1.227
Response IP	47.251.1.227
Found	Yes
Hash	1056e7c8f69b49a69b1c7dbd1a9a519ce073db097e1ec052d5be629e329c8d21
SimHash	3c92fd01c760

Groups

webuup.com

Rule	Path
Disallow	/

Rule

Path

Disallow

maoytcurl

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.com

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bubing

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

netestate ne crawler (+http://www.website-datenbank.de/)

Rule	Path
Disallow	/

Rule

Path

Disallow

youdaobot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

my-tiny-bot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/core/*.css$
Allow	/core/*.css?
Allow	/core/*.js$
Allow	/core/*.js?
Allow	/core/*.gif
Allow	/core/*.jpg
Allow	/core/*.jpeg
Allow	/core/*.png
Allow	/core/*.svg
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.gif
Allow	/profiles/*.jpg
Allow	/profiles/*.jpeg
Allow	/profiles/*.png
Allow	/profiles/*.svg
Allow	/sitemap.xml

Rule

Path

Allow

/core/*.css$

Allow

/core/*.css?

Allow

/core/*.js$

Allow

/core/*.js?

Allow

/core/*.gif

Allow

/core/*.jpg

Allow

/core/*.jpeg

Allow

/core/*.png

Allow

/core/*.svg

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.gif

Allow

/profiles/*.jpg

Allow

/profiles/*.jpeg

Allow

/profiles/*.png

Allow

/profiles/*.svg

Allow

/sitemap.xml

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
https://megaindex.com/crawler
http://www.opensiteexplorer.org/dotbot
http://moz.com/researchtools/ose/dotbot
http://openlinkprofiler.org/bot
SEOkicks-Robot
Block netEstate NE Crawler (+http://www.website-datenbank.de/)
CSS, JS, Images

lbstgroup.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

webuup.com

maoytcurl

alphaseobot

alphaseobot-sa

seznambot

zoominfobot

megaindex.ru

megaindex.com

dotbot

bubing

spbot

coccocbot

seokicks-robot

netestate ne crawler (+http://www.website-datenbank.de/)

youdaobot

amazonbot

my-tiny-bot

*

Comments

lbstgroup.com
robots.txt