newsclick.de
robots.txt
Robots Exclusion Standard data for newsclick.de
Resource Scan
Scan Details
Site Domain | newsclick.de |
Base Domain | newsclick.de |
Scan Status | Ok |
Last Scan | 2024-11-10T21:48:02+00:00 |
Next Scan | 2024-12-10T21:48:02+00:00 |
Last Scan
Scanned | 2024-11-10T21:48:02+00:00 |
URL | http://www.newsclick.de/robots.txt |
Redirect | https://www.braunschweiger-zeitung.de/robots.txt |
Redirect Domain | www.braunschweiger-zeitung.de |
Redirect Base | braunschweiger-zeitung.de |
Domain IPs | 62.116.130.8 |
Redirect IPs | 13.35.238.25, 13.35.238.32, 13.35.238.40, 13.35.238.65, 2600:9000:2085:1600:5:7e4d:58c0:93a1, 2600:9000:2085:3600:5:7e4d:58c0:93a1, 2600:9000:2085:3a00:5:7e4d:58c0:93a1, 2600:9000:2085:7600:5:7e4d:58c0:93a1, 2600:9000:2085:7e00:5:7e4d:58c0:93a1, 2600:9000:2085:8c00:5:7e4d:58c0:93a1, 2600:9000:2085:9600:5:7e4d:58c0:93a1, 2600:9000:2085:9a00:5:7e4d:58c0:93a1 |
Response IP | 13.35.238.25 |
Found | Yes |
Hash | 7890011d2b9c557383ac6170f990f46d5abc38c25597d238af1d3f842eec65ac |
SimHash | 580b8052c621 |
Groups
*
Rule | Path |
---|---|
Allow | /static/*/client.js |
Allow | /static/*/main.css |
Allow | /static/*/favicon.png |
Disallow | /stats/* |
Disallow | /*?config* |
Disallow | /*.xmli* |
Disallow | /*?service=Ajax |
Disallow | /*?service=ajax |
Disallow | /config/* |
Disallow | /test/* |
Disallow | /Test/* |
Disallow | /template/* |
Disallow | /*?*token=* |
Disallow | /*?*eventId=* |
Disallow | /static/* |
Disallow | /migration_import_no_section/* |
Disallow | /secure/ |
Disallow | /socialmedia/* |
Disallow | *reader_id%3DREADER_ID* |
Disallow | /suche/* |
Disallow | /*?widgetid= |
Disallow | /newsletter-result/ |
Disallow | *tpcc%3D* |
Disallow | /resources/ |
Disallow | /bin/ |
Disallow | /downloads/ |
Disallow | /service/newsletter-adconsent |
Disallow | /pagespeed_static/ |
Disallow | /resources/img/*icon*pagespeed |
semrushbot-sa
ahrefsbot
backlinkcrawler
linkchecker
dataforseobot
deepcrawl
majestic
majestic12
mj12bot
onpagebot
optimizer
rytebot
semrushbot
semrushbot-si
seobility
seodiver
seokicks
seokicks-robot
sistrix
openindexspider
openindexspider
sistrix optimizer
sistrix
sistrix crawler
siteauditbot
Rule | Path |
---|---|
Disallow | / |
amazonbot
anthropic-ai
applebot-extended
archive.org_bot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
ia_archiver
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot
Rule | Path |
---|---|
Disallow | / |
arquivo-web-crawler
arquivo.pt
barkrowler
blexbot
browsertrix
brozzler
builtwith
cincraw
coccocbot
contao/crawler
dmbot
domainstatsbot
dotbot
dotbot
fluid
haosouspider
happywing
harsilbot
hatena antenna
heritrix
imagesiftbot
kazbtbot
kraken
linkdebot
linkfluence yak bot
mail.ru_bot
metajobbot
monsidobot
netestate
ogdwctcxcrawler
petalbot
researchbot
riddler
sentibot
rogerbot
semanticbot
semanticscholarbot
sirdatabot
spbot
special_archiver
splitsignalbot
tag-crawler
testcrawler
thinkers-bot
toplistbot
uipbot/1.0
urlsuma
user-agent
vsusearchspider
weborama-fetcher
wiseguys robot
wpbot
yeti
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.braunschweiger-zeitung.de/sitemaps/news.xml |
Comments