< Terug naar vorige pagina

Publicatie

Cellwise robust regularized discriminant analysis

Tijdschriftbijdrage - Tijdschriftartikel

Quadratic and Linear Discriminant Analysis (QDA/LDA) are the most often applied classification rules under normality. In QDA, a separate covariance matrix is estimated for each group. If there are more variables than observations in the groups, the usual estimates aresingular and cannot be used anymore. Assuming homoscedasticity, as in LDA, reduces the number of parameters to estimate. This rather strong assumption is however rarely verified in practice. Regularized discriminant techniques that are computable in high-dimension and cover the path between the two extremes QDA and LDA have been proposed in the literature. However, these procedures rely on sample covariance matrices. As such, they become inappropriate in presence of cellwise outliers, a type of outliers that is very likely to occurin high-dimensional datasets. In this paper, we propose cellwise robust counterparts of these regularized discriminant techniques by inserting cellwise robust covariance matrices. Our methodology results in a family of discriminant methods that (i) are robust against outlyingcells, (ii) cover the gap between LDA and QDA and (iii) are computable in high-dimension. The good performance of the new methods is illustrated through simulated and real dataexamples. As a by-product, visual tools are provided for the detection of outliers.
Tijdschrift: Statistical Analysis and Data Mining
ISSN: 1932-1864
Issue: 6
Volume: 10
Pagina's: 436 - 447
Jaar van publicatie:2017