beuth.de
robots.txt

Robots Exclusion Standard data for beuth.de

Resource Scan

Scan Details

Site Domain beuth.de
Base Domain beuth.de
Scan Status Ok
Last Scan2024-06-14T07:05:16+00:00
Next Scan 2024-07-14T07:05:16+00:00

Last Scan

Scanned2024-06-14T07:05:16+00:00
URL https://beuth.de/robots.txt
Redirect https://www.dinmedia.de/robots.txt
Redirect Domain www.dinmedia.de
Redirect Base dinmedia.de
Domain IPs 128.65.213.34
Redirect IPs 128.65.213.83
Response IP 128.65.213.83
Found Yes
Hash 7c854c7357762ad7452ce21fcd5d408dc7979655fe65f6ab459c1cb15815ca9f
SimHash 629e5652cc11

Groups

*

Rule Path
Disallow /beuth/owa/
Disallow /*%21suggest
Disallow /php/

siteimprovebot-crawler

Rule Path
Allow /

slurp
ahrefsbot
alphabot
baiduspider
baiduspider-render
buck
changedetection
cliqzbot
exabot
flamingo_searchengine
grobbot
jobboersebot
jobs.de-robot
linkdexbot
mail.ru_bot
mauibot
mediatoolkitbot
mega-index
megaindex.ru
mj12bot
mojeekbot
nutch
pinterestbot
pixray-seeker
rogerbot
safednsbot
seokicks
seokicks-robot
seznambot
smtbot
sogou spider
spbot
trendictionbot
tweetmemebot
vebidoobot
wonderbot
wotbox
yacybot
yak
yandexantivirus
yandeximages
yandexmobilebot
yeti

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dinmedia.de/service-sitemap-dinmedia-de-sitemap_index.xml

Warnings

  • 1 invalid line.