glorybeats.com
robots.txt

Robots Exclusion Standard data for glorybeats.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	glorybeats.com
Base Domain	glorybeats.com
Scan Status	Ok
Last Scan	2024-10-20T20:01:17+00:00
Next Scan	2024-11-19T20:01:17+00:00

Last Scan

Scanned	2024-10-20T20:01:17+00:00
URL	https://glorybeats.com/robots.txt
Domain IPs	91.240.20.30
Response IP	91.240.20.30
Found	Yes
Hash	8448a48763b823fb23408a3f9b6d83460a3c50de3a5964f9f1e84dbcb4687403
SimHash	487aff43c635

Groups

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	60

Field

Value

crawl-delay

a6-indexer

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

alphaseobot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

aspiegelbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot/2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

blackboard safeassign

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

crawler4j

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

liebaofast

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot (crawler.feedback+wc@gmail.com)

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru/2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

mqqbrowser

Rule	Path
Disallow	/

Rule

Path

Disallow

nimbostratus-bot/v1.3.2

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

sputnikbot/2.3

Rule	Path
Disallow	/

Rule

Path

Disallow

the knowledge ai

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ucbrowser

Rule	Path
Disallow	/

Rule

Path

Disallow

yacybot

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

yisouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	http://glorybeats.com/sitemap.xml

Field

Value

sitemap

http://glorybeats.com/sitemap.xml

Comments

Block bots
RDH, 08.19.19: I really don't want to block Applebot, but for now, I am. It is crawling us too much
RDH, 05.13.20: I really don't want to block bing, but for now, I am. It is also already in htaccess rules

Warnings

`host` is not a known field.

glorybeats.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

*

Other Records

a6-indexer

alphaseobot

alphaseobot-sa

applebot

aspiegelbot

bingbot/2.0

blackboard safeassign

blexbot/1.0

bytespider

crawler4j

dotbot

gigabot

liebaofast

mauibot

mauibot (crawler.feedback+wc@gmail.com)

megaindex.ru/2.0

mqqbrowser

nimbostratus-bot/v1.3.2

seekport crawler

semrushbot

semrushbot-sa

seznambot

sputnikbot/2.3

the knowledge ai

turnitinbot

ucbrowser

yacybot

yeti

yisouspider

Other Records

Comments

Warnings

glorybeats.com
robots.txt