Grouping users automatically based on their system usage
can be beneficial in an autonomic computing environment.
Clustering algorithms can generate meaningful
user groups that provide important insights to system administrators
about user profiles and group policies. In
particular, if a small amount of supervision is provided by
the administrator to the clustering process, semi-supervised
clustering algorithms can use this supervision to generate
clusters which are more useful for user management.
In this work, we demonstrate the utility of semisupervised
clustering in intelligent user management. We
collect publicly available system usage data of users in
a university computing environment, and cluster the users
using semi-supervised hierarchical agglomerative clustering
based on the profile of the processes they run. Initial
supervision is provided in the form of a few users running
a specific process. Semi-supervised clustering gives
us more meaningful clusters than unsupervised clustering
in this domain, demonstrating that our technique can find
interesting and useful groups in data with minimal user
intervention.
%0 Conference Proceedings
%1 citeulike:267341
%A Basu, Sugato
%A Bilenko, Mikhail
%A Mooney, Raymond J.
%B Proceedings of the IBM Austin Center for Advanced Studies 5th Annual Austin CAS Conference
%D 2004
%K active contraint user
%T Semisupervised Clustering for Intelligent User Management
%U http://citeseer.ifi.unizh.ch/667120.html
%X Grouping users automatically based on their system usage
can be beneficial in an autonomic computing environment.
Clustering algorithms can generate meaningful
user groups that provide important insights to system administrators
about user profiles and group policies. In
particular, if a small amount of supervision is provided by
the administrator to the clustering process, semi-supervised
clustering algorithms can use this supervision to generate
clusters which are more useful for user management.
In this work, we demonstrate the utility of semisupervised
clustering in intelligent user management. We
collect publicly available system usage data of users in
a university computing environment, and cluster the users
using semi-supervised hierarchical agglomerative clustering
based on the profile of the processes they run. Initial
supervision is provided in the form of a few users running
a specific process. Semi-supervised clustering gives
us more meaningful clusters than unsupervised clustering
in this domain, demonstrating that our technique can find
interesting and useful groups in data with minimal user
intervention.
@proceedings{citeulike:267341,
abstract = {Grouping users automatically based on their system usage
can be beneficial in an autonomic computing environment.
Clustering algorithms can generate meaningful
user groups that provide important insights to system administrators
about user profiles and group policies. In
particular, if a small amount of supervision is provided by
the administrator to the clustering process, semi-supervised
clustering algorithms can use this supervision to generate
clusters which are more useful for user management.
In this work, we demonstrate the utility of semisupervised
clustering in intelligent user management. We
collect publicly available system usage data of users in
a university computing environment, and cluster the users
using semi-supervised hierarchical agglomerative clustering
based on the profile of the processes they run. Initial
supervision is provided in the form of a few users running
a specific process. Semi-supervised clustering gives
us more meaningful clusters than unsupervised clustering
in this domain, demonstrating that our technique can find
interesting and useful groups in data with minimal user
intervention.},
added-at = {2006-06-16T10:34:37.000+0200},
author = {Basu, Sugato and Bilenko, Mikhail and Mooney, Raymond J.},
biburl = {https://www.bibsonomy.org/bibtex/2bd7857e33eb782ff3d8b921930e47d78/ldietz},
booktitle = {Proceedings of the IBM Austin Center for Advanced Studies 5th Annual Austin CAS Conference},
citeulike-article-id = {267341},
interhash = {92db9ab221ebab16e1e0aedd7fb667ff},
intrahash = {bd7857e33eb782ff3d8b921930e47d78},
keywords = {active contraint user},
month = {February},
priority = {1},
timestamp = {2006-06-16T10:34:37.000+0200},
title = {Semisupervised Clustering for Intelligent User Management},
url = {http://citeseer.ifi.unizh.ch/667120.html},
year = 2004
}