elternnachricht.de
robots.txt

Robots Exclusion Standard data for elternnachricht.de

Resource Scan

Scan Details

Site Domain elternnachricht.de
Base Domain elternnachricht.de
Scan Status Ok
Last Scan2024-10-18T05:22:07+00:00
Next Scan 2024-11-17T05:22:07+00:00

Last Scan

Scanned2024-10-18T05:22:07+00:00
URL https://elternnachricht.de/robots.txt
Redirect https://www.elternnachricht.de/robots.txt
Redirect Domain www.elternnachricht.de
Redirect Base elternnachricht.de
Domain IPs 23.88.92.106
Redirect IPs 23.88.92.106
Response IP 23.88.92.106
Found Yes
Hash 4cd8e2b4d0c97ad9e5778e9551a17a3866efcbe959ad5628d64874f428ddca7c
SimHash 721c40f5af98

Groups

*

Rule Path
Disallow /anmelden
Disallow /anhaenge
Disallow /bestaetigung
Disallow /feedback
Disallow /bestellung
Disallow /bestellung-erfolgreich
Disallow /admin
Disallow /secure
Disallow /*.docx$
Disallow /*.xslx$
Disallow /fehlzeiten/*
Disallow /videokonferenz/*

ahrefsbot
wellknownbot
livelapbot
wpbot
seobilitybot
telegrambot
mail.ru_bot
petalbot
aasa-bot
indeedbot
twitterbot
pinterestbot
trendictionbot
mj12bot
seznambot
blexbot
dotbot
deusu
cliqzbot
baiduspider
baiduspider-video
baiduspider-image
betabot
domainappender
sogou spider
sogou web spider
nutch
spiderbot
spiderbot/nutch-1.7
yandex
youdaobot
duckduckgo-favicons-bot
megaindex
yandexbot
msnbot-media
jobs.de-robot
exabot
haosouspider
yacybot
proximic
grapeshotcrawler
oncrawl
safednsbot
ia_archiver
ia_archiver/1.6
semrushbot
semrushbot-sa
sistrix
alphabot
seokicks
seokicks-robot
wget
httrack
dataforseobot
gigabot
searchmetricsbot
siteanalyzerbot
scrapy
tracemyfile

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.elternnachricht.de/sitemaps/sitemap.xml