Multiple imputation in principal component analysis - Université de Rennes Accéder directement au contenu
Article Dans Une Revue Advances in Data Analysis and Classification Année : 2011

Multiple imputation in principal component analysis

Résumé

The available methods to handle missing values in principal component analysis only provide point estimates of the parameters (axes and components) and estimates of the missing values. To take into account the variability due to missing values a multiple imputation method is proposed. First a method to generate multiple imputed data sets from a principal component analysis model is defined. Then, two ways to visualize the uncertainty due to missing values onto the principal component analysis results are described. The first one consists in projecting the imputed data sets onto a reference configuration as supplementary elements to assess the stability of the individuals (respectively of the variables). The second one consists in performing a principal component analysis on each imputed data set and fitting each obtained configuration onto the reference one with Procrustes rotation. The latter strategy allows to assess the variability of the principal component analysis parameters induced by the missing values. The methodology is then evaluated from a real data set.
Fichier non déposé

Dates et versions

hal-00704514 , version 1 (05-06-2012)

Identifiants

Citer

Julie Josse, Jérome Pagès, François Husson. Multiple imputation in principal component analysis. Advances in Data Analysis and Classification, 2011, 5 (3), pp.231-246. ⟨10.1007/s11634-011-0086-7⟩. ⟨hal-00704514⟩
408 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More