openjur.de
robots.txt

Robots Exclusion Standard data for openjur.de

Resource Scan

Scan Details

Site Domain openjur.de
Base Domain openjur.de
Scan Status Ok
Last Scan2024-06-15T08:35:25+00:00
Next Scan 2024-06-29T08:35:25+00:00

Last Scan

Scanned2024-06-15T08:35:25+00:00
URL https://openjur.de/robots.txt
Domain IPs 212.12.52.193, 2a00:14b0:4200:3400:193::1
Response IP 212.12.52.193
Found Yes
Hash 5a4fdbdf39fda2c390935f6504d7054f7cd11a8c96d9a42942c3eb54431d1fe3
SimHash 4324d44b8751

Groups

*

Rule Path
Disallow /u/*.pdf$
Disallow /u/*.print$
Disallow /a/*.html$
Disallow /a/*.pdf$
Disallow /u/*.xml$
Disallow /u/*.tex$
Disallow /u/*.json$
Disallow /u/*.ppdf$
Disallow /u/*.md$
Disallow /u/report/*.html$
Disallow /a/*.xml$
Disallow /edit/*/*.html$
Disallow /anlagen/
Disallow /js/
Disallow /css/
Disallow /img/
Disallow /gvp/
Disallow /suche/

ms search 4.0 robot
nutch
discobot
findestars
myonid
peekyou
pipl
rapleaf
snitch
spock
tweepz
wink
yasni
yoname
yourtraces
zoominfo
mj12bot
ia_archiver
semrushbot
dotbot
mauibot
ahrefsbot
blexbot
seekportbot
amazonbot
gptbot
turnitinbot

Rule Path
Disallow /

Warnings

  • 1 invalid line.