TY - JOUR
T1 - Similarity-driven multi-view embeddings from high-dimensional biomedical data
AU - Avants, Brian B.
AU - Tustison, Nicholas J.
AU - Stone, James R.
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive licence to Springer Nature America, Inc. part of Springer Nature.
PY - 2021/2
Y1 - 2021/2
N2 - Diverse, high-dimensional modalities collected in large cohorts present new opportunities for the formulation and testing of integrative scientific hypotheses. Similarity-driven multi-view linear reconstruction (SiMLR) is an algorithm that exploits inter-modality relationships to transform large scientific datasets into smaller, more well-powered and interpretable low-dimensional spaces. SiMLR contributes an objective function to identify joint signal regularization based on sparse matrices representing prior within-modality relationships and an implementation that permits application to joint reduction of large data matrices. We demonstrate that SiMLR outperlforms closely related methods on supervised learning problems in simulation data, a multi-omics cancer survival prediction dataset and multiple modality neuroimaging datasets. Taken together, this collection of results shows that SiMLR may be applied to joint signal estimation from disparate modalities and may yield practically useful results in a variety of application domains.
AB - Diverse, high-dimensional modalities collected in large cohorts present new opportunities for the formulation and testing of integrative scientific hypotheses. Similarity-driven multi-view linear reconstruction (SiMLR) is an algorithm that exploits inter-modality relationships to transform large scientific datasets into smaller, more well-powered and interpretable low-dimensional spaces. SiMLR contributes an objective function to identify joint signal regularization based on sparse matrices representing prior within-modality relationships and an implementation that permits application to joint reduction of large data matrices. We demonstrate that SiMLR outperlforms closely related methods on supervised learning problems in simulation data, a multi-omics cancer survival prediction dataset and multiple modality neuroimaging datasets. Taken together, this collection of results shows that SiMLR may be applied to joint signal estimation from disparate modalities and may yield practically useful results in a variety of application domains.
UR - http://www.scopus.com/inward/record.url?scp=85105444022&partnerID=8YFLogxK
U2 - 10.1038/s43588-021-00029-8
DO - 10.1038/s43588-021-00029-8
M3 - Article
AN - SCOPUS:85105444022
SN - 2662-8457
VL - 1
SP - 143
EP - 152
JO - Nature Computational Science
JF - Nature Computational Science
IS - 2
ER -