Abstract
The definition and extraction of actionable anomalous discords, i.e. pattern outliers, is a challenging
problem in data analysis. It raises the crucial issue of identifying criteria that would render a discord
more insightful than another one. In this paper, we propose an approach to address this by
introducing the concept of prominent discord. The core idea behind this new concept is to identify
dependencies among discords of varying lengths. How can we identify a discord that would be
prominent? We propose an ordering relation, that ranks discords, and we seek a set of prominent
discords with respect to this ordering. Our contributions are threefold 1) a formal definition,
ordering relation and methods to derive prominent discords based on Matrix Profile techniques,2)
their evaluation over large contextual climate data, covering 110 years of monthly data, and 3) a
comparison of an exact method based on STOMP and an approximate approach that is based on
SCRIMP++ to compute the prominent discords and study the tradeoff optimality/CPU. The
approach is generic and its pertinence shown over historical climate data.
Users
Please
log in to take part in the discussion (add own reviews or comments).