DIMENSIONALITY REDUCTION METHODS FOR BIOMEDICAL DATA

Authors

DOI:

https://doi.org/10.14311/CTJ.2018.1.%25x

Keywords:

biomedical data, dimensionality, biostatistics, multivariate analysis, sparsity

Abstract

The aim of this paper is to present basic principles of common multivariate statistical approaches to dimensionality reduction and to discuss three particular approaches, namely feature extraction, (prior) variable selection, and sparse variable selection. Their important examples are also presented in the paper, which includes the principal component analysis, minimum redundancy maximum relevance variable selection, and nearest shrunken centroid classifier with an intrinsic variable selection. Each of the three methods is illustrated on a real dataset with a biomedical motivation, including a biometric identification based on keystroke dynamics or a study of metabolomic profiles. Advantages and benefits of performing dimensionality reduction of multivariate data are discussed.

Author Biography

Jan Kalina, Institute of Computer Science of the Czech Academy of Sciences

Dept. of Medical Informatics and Biostatistics

Downloads

Published

2018-03-31

Issue

Section

Review (Only by a direct request from the Editor-in-chief!)