wr.de
robots.txt
Robots Exclusion Standard data for wr.de
Resource Scan
Scan Details
Site Domain | wr.de |
Base Domain | wr.de |
Scan Status | Ok |
Last Scan | 2024-11-11T11:14:51+00:00 |
Next Scan | 2024-11-18T11:14:51+00:00 |
Last Scan
Scanned | 2024-11-11T11:14:51+00:00 |
URL | https://wr.de/robots.txt |
Redirect | https://www.wr.de:443/robots.txt |
Redirect Domain | www.wr.de |
Redirect Base | wr.de |
Domain IPs | 18.185.81.127, 18.196.221.37, 3.72.121.83 |
Redirect IPs | 13.35.210.11, 13.35.210.52, 13.35.210.75, 13.35.210.89, 2600:9000:2078:1800:19:b6f5:1200:93a1, 2600:9000:2078:5000:19:b6f5:1200:93a1, 2600:9000:2078:600:19:b6f5:1200:93a1, 2600:9000:2078:6e00:19:b6f5:1200:93a1, 2600:9000:2078:9c00:19:b6f5:1200:93a1, 2600:9000:2078:b200:19:b6f5:1200:93a1, 2600:9000:2078:b800:19:b6f5:1200:93a1, 2600:9000:2078:e00:19:b6f5:1200:93a1 |
Response IP | 13.35.210.75 |
Found | Yes |
Hash | 99617a18f2029fd19a3ac4da8bdf8b4db8fbc52cb1ef413dbe11b5a31ffc3382 |
SimHash | 581b9052c621 |
Groups
*
Rule | Path |
---|---|
Allow | /static/*/client.js |
Allow | /static/*/main.css |
Allow | /static/*/favicon.png |
Disallow | /stats/* |
Disallow | /*?config* |
Disallow | /*.xmli* |
Disallow | /*?service=Ajax |
Disallow | /*?service=ajax |
Disallow | /config/* |
Disallow | /test/* |
Disallow | /Test/* |
Disallow | /template/* |
Disallow | /*?*token=* |
Disallow | /*?*eventId=* |
Disallow | /static/* |
Disallow | /migration_import_no_section/* |
Disallow | /secure/ |
Disallow | /socialmedia/* |
Disallow | *reader_id%3DREADER_ID* |
Disallow | /suche/* |
Disallow | /*?widgetid= |
Disallow | /newsletter-result/ |
Disallow | *tpcc%3D* |
Disallow | /resources/ |
Disallow | /bin/ |
Disallow | /downloads/ |
Disallow | /service/newsletter-adconsent |
Disallow | /pagespeed_static/ |
Disallow | /resources/img/*icon*pagespeed |
semrushbot-sa
ahrefsbot
backlinkcrawler
linkchecker
dataforseobot
deepcrawl
majestic
majestic12
mj12bot
onpagebot
optimizer
rytebot
semrushbot
semrushbot-si
seobility
seodiver
seokicks
seokicks-robot
sistrix
openindexspider
openindexspider
sistrix optimizer
sistrix
sistrix crawler
siteauditbot
Rule | Path |
---|---|
Disallow | / |
amazonbot
anthropic-ai
applebot-extended
archive.org_bot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
ia_archiver
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot
Rule | Path |
---|---|
Disallow | / |
arquivo-web-crawler
arquivo.pt
barkrowler
blexbot
browsertrix
brozzler
builtwith
cincraw
coccocbot
contao/crawler
dmbot
domainstatsbot
dotbot
dotbot
fluid
haosouspider
happywing
harsilbot
hatena antenna
heritrix
imagesiftbot
kazbtbot
kraken
linkdebot
linkfluence yak bot
mail.ru_bot
metajobbot
monsidobot
netestate
ogdwctcxcrawler
petalbot
researchbot
riddler
sentibot
rogerbot
semanticbot
semanticscholarbot
sirdatabot
spbot
special_archiver
splitsignalbot
tag-crawler
testcrawler
thinkers-bot
toplistbot
uipbot/1.0
urlsuma
user-agent
vsusearchspider
weborama-fetcher
wiseguys robot
wpbot
yeti
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.wr.de/sitemaps/news.xml |
Comments