opac.ub.lmu.de
robots.txt

Robots Exclusion Standard data for opac.ub.lmu.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	opac.ub.lmu.de
Base Domain	lmu.de
Scan Status	Ok
Last Scan	2024-11-03T11:55:35+00:00
Next Scan	2024-12-03T11:55:35+00:00

Last Scan

Scanned	2024-11-03T11:55:35+00:00
URL	https://opac.ub.lmu.de/robots.txt
Domain IPs	141.84.147.15
Response IP	141.84.147.15
Found	Yes
Hash	28a09517cc9c0f6440077dc3a48a201481ebfea6c72083db1c17669c9af1b85c
SimHash	72b6d8398ef0

Groups

*

Rule	Path
Disallow	/AJAX
Disallow	/Alphabrowse
Disallow	/Browse
Disallow	/Search/Results
Disallow	/Primo
Disallow	/PrimoRecord
Disallow	/Cover
Disallow	/Cover/Show
Disallow	/Resource
Disallow	/AJAX/
Disallow	/AlphaBrowse/
Disallow	/Browse/
Disallow	/Search/Results/
Disallow	/Primo/
Disallow	/PrimoRecord/
Disallow	/Cover/
Disallow	/Cover/Show/
Disallow	/Resource/

Rule

Path

Disallow

/AJAX

Disallow

/Alphabrowse

Disallow

/Browse

Disallow

/Search/Results

Disallow

/Primo

Disallow

/PrimoRecord

Disallow

/Cover

Disallow

/Cover/Show

Disallow

/Resource

Disallow

/AJAX/

Disallow

/AlphaBrowse/

Disallow

/Browse/

Disallow

/Search/Results/

Disallow

/Primo/

Disallow

/PrimoRecord/

Disallow

/Cover/

Disallow

/Cover/Show/

Disallow

/Resource/

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou web spider

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou inst spider

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

academicbotrtu

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

test-bot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Block Bots
Claude
Common Crawl
ChatGPT user prompt research
Google AI training data crawl
OpenAI training data crawl

opac.ub.lmu.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

bytespider

sogou web spider

sogou inst spider

anthropic-ai

claudebot

claudebot/1.0

claude-web

ccbot

chatgpt-user

diffbot

facebookbot

google-extended

omgili

academicbotrtu

dataforseobot

gptbot

petalbot

semrushbot

test-bot

Comments

opac.ub.lmu.de
robots.txt