Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing
Articolo
Data di Pubblicazione:
2025
Citazione:
Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing / J. Gliozzo, M. Soto-Gomez, V. Guarino, A. Bonometti, A. Cabri, E. Cavalleri, J. Reese, P.N. Robinson, M. Mesiti, G. Valentini, E. Casiraghi. - In: ARTIFICIAL INTELLIGENCE IN MEDICINE. - ISSN 0933-3657. - 160:(2025 Feb), pp. 103049.1-103049.13. [10.1016/j.artmed.2024.103049]
Abstract:
Multi-omics data have revolutionized biomedical research by providing a comprehensive understanding of biological systems and the molecular mechanisms of disease development. However, analyzing multi-omics data is challenging due to high dimensionality and limited sample sizes, necessitating proper data-reduction pipelines to ensure reliable analyses. Additionally, its multimodal nature requires effective data-integration pipelines. While several dimensionality reduction and data fusion algorithms have been proposed, crucial aspects are often overlooked. Specifically, the choice of projection space dimension is typically heuristic and uniformly applied across all omics, neglecting the unique high dimension small sample size challenges faced by individual omics. This paper introduces a novel multi-modal dimensionality reduction pipeline tailored to individual views. By leveraging intrinsic dimensionality estimators, we assess the curse-of-dimensionality impact on each view and propose a two-step reduction strategy for significantly affected views, combining feature selection with feature extraction. Compared to traditional uniform reduction pipelines in a crucial and supervised multi-omics analysis setting, our approach shows significant improvement. Additionally, we explore three effective unsupervised multi-omics data fusion methods rooted in the main data fusion strategies to gain insights into their performance under crucial, yet overlooked, settings.
Tipologia IRIS:
01 - Articolo su periodico
Keywords:
Data fusion; Dimensionality reduction; Feature extraction; Feature selection; Intrinsic dimensionality; Multi-omics datasets
Elenco autori:
J. Gliozzo, M. Soto-Gomez, V. Guarino, A. Bonometti, A. Cabri, E. Cavalleri, J. Reese, P.N. Robinson, M. Mesiti, G. Valentini, E. Casiraghi
Link alla scheda completa:
Link al Full Text: