dinmedia.de
robots.txt
Robots Exclusion Standard data for dinmedia.de
Resource Scan
Scan Details
Site Domain | dinmedia.de |
Base Domain | dinmedia.de |
Scan Status | Ok |
Last Scan | 2024-09-23T23:27:57+00:00 |
Next Scan | 2024-10-23T23:27:57+00:00 |
Last Scan
Scanned | 2024-09-23T23:27:57+00:00 |
URL | https://dinmedia.de/robots.txt |
Redirect | https://www.dinmedia.de//robots.txt |
Redirect Domain | www.dinmedia.de |
Redirect Base | dinmedia.de |
Domain IPs | 128.65.213.83 |
Redirect IPs | 128.65.213.83 |
Response IP | 128.65.213.83 |
Found | Yes |
Hash | 7c854c7357762ad7452ce21fcd5d408dc7979655fe65f6ab459c1cb15815ca9f |
SimHash | 629e5652cc11 |
Groups
*
Rule | Path |
---|---|
Disallow | /beuth/owa/ |
Disallow | /*%21suggest |
Disallow | /php/ |
slurp
ahrefsbot
alphabot
baiduspider
baiduspider-render
buck
changedetection
cliqzbot
exabot
flamingo_searchengine
grobbot
jobboersebot
jobs.de-robot
linkdexbot
mail.ru_bot
mauibot
mediatoolkitbot
mega-index
megaindex.ru
mj12bot
mojeekbot
nutch
pinterestbot
pixray-seeker
rogerbot
safednsbot
seokicks
seokicks-robot
seznambot
smtbot
sogou spider
spbot
trendictionbot
tweetmemebot
vebidoobot
wonderbot
wotbox
yacybot
yak
yandexantivirus
yandeximages
yandexmobilebot
yeti
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.dinmedia.de/service-sitemap-dinmedia-de-sitemap_index.xml |
Warnings
- 1 invalid line.