,

Maintaining imbalance highly dependent medical data using dirichlet process data generation

, , и .
2011 Sixth International Conference on Digital Information Management, Institute of Electrical and Electronics Engineers (IEEE), (сентября 2011)
DOI: 10.1109/icdim.2011.6093359

Аннотация

The existence of imbalanced data between one class and another class is an important issue to be considered in a classification problem. One of the well-known data balancing technique is the artificial oversampling, which increase the size of datasets. In this research, multinomial classification was applied to classify some recorded features obtained from a single ECG (electrocardiograph) sensor. Therefore, a Dirichlet process, a dirichlet distribution of cumulative distribution function of each data partition, was needed to model the distribution of the new generated data by also considering the statistical properties of the previous data. Data balancing process had given the result of 77.21% classification accuracy (CA), and 90.9% area under ROC curve (AUC).

тэги

Пользователи данного ресурса

  • @fanany
  • @dblp

Комментарии и рецензии