Article,

Robust Multimodal Audio-Visual Processing for Advanced Context Awareness in Smart Spaces

A. Pnevmatikakis, J. Soldatos, F. Talantzis, and L. Polymenakos.
Personal and Ubiquitous Computing, 13 (1): 3-14 (2009)
DOI: 10.1007/s00779-007-0169-9

Abstract

Identifying people and tracking their locations is a key prerequisite to achieving context awareness in smart spaces. Moreover, in realistic context-aware applications, these tasks have to be carried out in a non-obtrusive fashion. In this paper we present a set of robust person-identification and tracking algorithms, based on audio and visual processing. A main characteristic of these algorithms is that they operate on far-field and un-constrained audio-visual streams, which ensure that they are non-intrusive. We also illustrate that the combination of their outputs can lead to composite multimodal tracking components, which are suitable for supporting a broad range of context-aware services. In combining audio-visual processing results, we exploit a context-modeling approach based on a graph of situations. Accordingly, we discuss the implementation of realistic prototype applications that make use of the full range of audio, visual and multimodal algorithms.

BibTeX key: PnevmatikakisSoldatosEtAl09puc
entry type: article
year: 2009
journal: Personal and Ubiquitous Computing
number: 1
pages: 3-14
volume: 13
file: SpringerLink:2009/PnevmatikakisSoldatosEtAl09puc.pdf:PDF
issn: 1617-4909
groups: public
intrahash: 2b3b9d3ac9883746b8b63a9c85714735
DOI: 10.1007/s00779-007-0169-9
timestamp: 2009.09.05
username: flint63

BibSonomy

Robust Multimodal Audio-Visual Processing for Advanced Context Awareness in Smart Spaces

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on