копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Online clustering of parallel data streams

J. Beringer, и E. Hüllermeier. Data & Knowledge Engineering, 58 (2): 180 - 204 (2006)
DOI: 10.1016/j.datak.2005.05.009

Аннотация

In recent years, the management and processing of so-called data streams has become a topic of active research in several fields of computer science such as, e.g., distributed systems, database systems, and data mining. A data stream can roughly be thought of as a transient, continuously increasing sequence of time-stamped data. In this paper, we consider the problem of clustering parallel streams of real-valued data, that is to say, continuously evolving time series. In other words, we are interested in grouping data streams the evolution over time of which is similar in a specific sense. In order to maintain an up-to-date clustering structure, it is necessary to analyze the incoming data in an online manner, tolerating not more than a constant time delay. For this purpose, we develop an efficient online version of the classical K-means clustering algorithm. Our method’s efficiency is mainly due to a scalable online transformation of the original data which allows for a fast computation of approximate distances between streams.

Описание

ScienceDirect.com - Data & Knowledge Engineering - Online clustering of parallel data streams

Линки и ресурсы

ключ BibTeX: Beringer2006180
тип записи: article
год: 2006
журнал: Data & Knowledge Engineering
номер: 2
страницы: 180 - 204
том: 58
issn: 0169-023X
DOI: 10.1016/j.datak.2005.05.009
url: http://www.sciencedirect.com/science/article/pii/S0169023X05000819

тэги

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 12 лет назад
Создан 12 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!