wp.de
robots.txt
Robots Exclusion Standard data for wp.de
Resource Scan
Scan Details
Site Domain | wp.de |
Base Domain | wp.de |
Scan Status | Ok |
Last Scan | 2024-11-14T14:27:55+00:00 |
Next Scan | 2024-11-21T14:27:55+00:00 |
Last Scan
Scanned | 2024-11-14T14:27:55+00:00 |
URL | https://wp.de/robots.txt |
Redirect | https://www.wp.de:443/robots.txt |
Redirect Domain | www.wp.de |
Redirect Base | wp.de |
Domain IPs | 18.185.81.127, 18.196.221.37, 3.72.121.83 |
Redirect IPs | 13.227.254.34, 13.227.254.48, 13.227.254.5, 13.227.254.52, 2600:9000:200a:1c00:1a:220c:4f40:93a1, 2600:9000:200a:2600:1a:220c:4f40:93a1, 2600:9000:200a:2a00:1a:220c:4f40:93a1, 2600:9000:200a:4000:1a:220c:4f40:93a1, 2600:9000:200a:5800:1a:220c:4f40:93a1, 2600:9000:200a:6200:1a:220c:4f40:93a1, 2600:9000:200a:a00:1a:220c:4f40:93a1, 2600:9000:200a:b400:1a:220c:4f40:93a1 |
Response IP | 13.227.254.52 |
Found | Yes |
Hash | 17a4e2989e37d6fe649e32e40ce8c9bf6421367820e3c93e3c2636b101b915d6 |
SimHash | 5c1b8052c621 |
Groups
*
Rule | Path |
---|---|
Allow | /static/*/client.js |
Allow | /static/*/main.css |
Allow | /static/*/favicon.png |
Disallow | /stats/* |
Disallow | /*?config* |
Disallow | /*.xmli* |
Disallow | /*?service=Ajax |
Disallow | /*?service=ajax |
Disallow | /config/* |
Disallow | /test/* |
Disallow | /Test/* |
Disallow | /template/* |
Disallow | /*?*token=* |
Disallow | /*?*eventId=* |
Disallow | /static/* |
Disallow | /migration_import_no_section/* |
Disallow | /secure/ |
Disallow | /socialmedia/* |
Disallow | *reader_id%3DREADER_ID* |
Disallow | /suche/* |
Disallow | /*?widgetid= |
Disallow | /newsletter-result/ |
Disallow | *tpcc%3D* |
Disallow | /resources/ |
Disallow | /bin/ |
Disallow | /downloads/ |
Disallow | /service/newsletter-adconsent |
Disallow | /pagespeed_static/ |
Disallow | /resources/img/*icon*pagespeed |
semrushbot-sa
ahrefsbot
backlinkcrawler
linkchecker
dataforseobot
deepcrawl
majestic
majestic12
mj12bot
onpagebot
optimizer
rytebot
semrushbot
semrushbot-si
seobility
seodiver
seokicks
seokicks-robot
sistrix
openindexspider
openindexspider
sistrix optimizer
sistrix
sistrix crawler
siteauditbot
Rule | Path |
---|---|
Disallow | / |
amazonbot
anthropic-ai
applebot-extended
archive.org_bot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
ia_archiver
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot
Rule | Path |
---|---|
Disallow | / |
arquivo-web-crawler
arquivo.pt
barkrowler
blexbot
browsertrix
brozzler
builtwith
cincraw
coccocbot
contao/crawler
dmbot
domainstatsbot
dotbot
dotbot
fluid
haosouspider
happywing
harsilbot
hatena antenna
heritrix
imagesiftbot
kazbtbot
kraken
linkdebot
linkfluence yak bot
mail.ru_bot
metajobbot
monsidobot
netestate
ogdwctcxcrawler
petalbot
researchbot
riddler
sentibot
rogerbot
semanticbot
semanticscholarbot
sirdatabot
spbot
special_archiver
splitsignalbot
tag-crawler
testcrawler
thinkers-bot
toplistbot
uipbot/1.0
urlsuma
user-agent
vsusearchspider
weborama-fetcher
wiseguys robot
wpbot
yeti
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.wp.de/sitemaps/news.xml |
Comments