Mastersthesis,

Dimensionality reduction for bag-of-words models: PCA vs LSA

.
(2017)

Abstract

We study a collection of texts stored as “bags of words” and implement two methods for reducing the dimension of the data. We compare how easy it is to perform authorship identification on the dimensionally-reduced data.

Tags

Users

  • @ghagerer

Comments and Reviews