uni-giessen.de
robots.txt
Robots Exclusion Standard data for uni-giessen.de
Resource Scan
Scan Details
Site Domain | uni-giessen.de |
Base Domain | uni-giessen.de |
Scan Status | Ok |
Last Scan | 2024-08-29T20:47:47+00:00 |
Next Scan | 2024-09-28T20:47:47+00:00 |
Last Scan
Scanned | 2024-08-29T20:47:47+00:00 |
URL | https://uni-giessen.de/robots.txt |
Redirect | https://www.uni-giessen.de/robots.txt |
Redirect Domain | www.uni-giessen.de |
Redirect Base | uni-giessen.de |
Domain IPs | 134.176.3.22 |
Redirect IPs | 134.176.3.22 |
Response IP | 134.176.3.22 |
Found | Yes |
Hash | 5827e5186b2f8e5636aa2a7539c86a8322970e6972ec7794c0f91e1bbaeea7bd |
SimHash | 3b3827394ce1 |
Groups
*
Rule | Path |
---|---|
Disallow |
afilias web mining tool
afiliaswebminingtool
aihitbot
askpeterbot
bdbrandprotect
bpimagewalker*
bpimagewalker
blexbot
bubing
catchbot
comodo ssl checker
comodosslchecker
comodo-certificates-spider
content crawler
contentcrawler
curl
dcpbot
discobot
docomo/2.0
dotbot
drupact
ec2linkfinder
edisterbot
erocrawler
exb language crawler
ezooms
findlinks
gonzo
gslfbot
htdig
huaweisymantecspider
icjobs
infohelfer
ips-agent
ipsagent
it2media-domain-crawler
java
kaloogabot
larbin
lb-spider
lex
libwww-perl
linkdex.com
lssrocketcrawler
ltx71 - (http://ltx71.com/)
ltx71
magpie-crawler
majesticseo
mail.ru_bot
mia
mj12bot
mlbot
msiecrawler
msnbot
netestate ne crawler
netestatenecrawler
nutch
obot
openindexspider
opidoobot
pagepeeker
picmole
pixray-seeker
psbot
qualidator*
reverseget
schrein
scooter
searchmetricsbot
search17
semrushbot
semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
seznambot
sistrix
sistrix crawler
sistrix
slysearch
solomonobot
solomonolinkchecker
sosospider
spbot
spiderlytics
suggybot
surveybot
swebot
thunderstone
turnitinbot
tineye
unisterbot
unister
unister*
updownerbot
wbsearchbot
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webinator
webenhancer
webmastercoffee
webmasterworld extractor
webmasterworldforumbot
webofantbot
websauger
website quester
webster pro
webstripper
webvac
webzip
webzip/4.0
wotbox
www-collector-e
x28-job-bot
xovi
xovibot
yandex
yacybot
yeti
yeti-mobile
youdaobot
xenu's link sleuth
xenu's
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
Rule | Path |
---|---|
Disallow | /*atct_album_view$ |
Disallow | /*folder_factories$ |
Disallow | /*folder_summary_view$ |
Disallow | /*login_form$ |
Disallow | /*mail_password_form$ |
Disallow | /%40%40search |
Disallow | /*search_rss$ |
Disallow | /*sendto_form$ |
Disallow | /*summary_view$ |
Disallow | /*thumbnail_view$ |
Disallow | /*view$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.uni-giessen.de/sitemap.xml.gz |
Warnings
- 4 invalid lines.
Comments