PCA document reconstruction for email classification KU Leuven
This paper presents a document classifier based on text content features and its application to email classification. We test the validity of a classifier which uses Principal Component Analysis Document Reconstruction (PCADR), where the idea is that principal component analysis (PCA) can compress optimally only the kind of documents - in our experiments email classes - that are used to compute the principal components (PCs), and that for other ...