Co-selection of features and instances for unsupervised rare category analysis

Jingrui He, Jaime Carbonell

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Rare category analysis is of key importance both in theory and in practice. Previous research work focuses on supervised rare category analysis, such as rare category detection and rare category classification. In this paper, for the first time, we address the challenge of unsupervised rare category analysis, including feature selection and rare category selection. We propose to jointly deal with the two correlated tasks so that they can benefit from each other. To this end, we design an optimization framework which is able to co-select the relevant features and the examples from the rare category (a.k.a. the minority class). It is well justified theoretically. Furthermore, we develop the Partial Augmented Lagrangian Method (PALM) to solve the optimization problem. Experimental results on both synthetic and real data sets show the effectiveness of the proposed method.

Original languageEnglish (US)
Title of host publicationProceedings of the 10th SIAM International Conference on Data Mining, SDM 2010
Pages525-536
Number of pages12
StatePublished - 2010
Externally publishedYes
Event10th SIAM International Conference on Data Mining, SDM 2010 - Columbus, OH, United States
Duration: Apr 29 2010May 1 2010

Other

Other10th SIAM International Conference on Data Mining, SDM 2010
Country/TerritoryUnited States
CityColumbus, OH
Period4/29/105/1/10

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Co-selection of features and instances for unsupervised rare category analysis'. Together they form a unique fingerprint.

Cite this