Non-monotonic feature selection

Zenglin Xu; Rong Jin; Jieping Ye; Michael R. Lyu; Irwin King

doi:10.1145/1553374.1553520

Non-monotonic feature selection

Zenglin Xu, Rong Jin, Jieping Ye, Michael R. Lyu, Irwin King

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

14 Scopus citations

Abstract

We consider the problem of selecting a subset of m most informative features where m is the number of required features. This feature selection problem is essentially a combinatorial optimization problem, and is usually solved by an approximation. Conventional feature selection methods address the computational challenge in two steps: (a) ranking all the features by certain scores that are usually computed independently from the number of specified features m, and (b) selecting the top m ranked features. One major shortcoming of these approaches is that if a feature f is chosen when the number of specified features is m, it will always be chosen when the number of specified features is larger thanm. We refer to this property as the "monotonic" property of feature selection. In this work, we argue that it is important to develop efficient algorithms for non-monotonic feature selection. To this end, we develop an algorithm for non-monotonic feature selection that approximates the related combinatorial optimization problem by a Multiple Kernel Learning (MKL) problem. We also present a strategy that derives a discrete solution from the approximate solution ofMKL, and show the performance guarantee for the derived discrete solution when compared to the global optimal solution for the related combinatorial optimization problem. An empirical study with a number of benchmark data sets indicates the promising performance of the proposed framework compared with several state-of-the-art approaches for feature selection.

Original language	English (US)
Title of host publication	Proceedings of the 26th Annual International Conference on Machine Learning, ICML'09
DOIs	https://doi.org/10.1145/1553374.1553520
State	Published - 2009
Event	26th Annual International Conference on Machine Learning, ICML'09 - Montreal, QC, Canada Duration: Jun 14 2009 → Jun 18 2009

Publication series

Name	ACM International Conference Proceeding Series
Volume	382

Other

Other	26th Annual International Conference on Machine Learning, ICML'09
Country/Territory	Canada
City	Montreal, QC
Period	6/14/09 → 6/18/09

ASJC Scopus subject areas

Software
Human-Computer Interaction
Computer Vision and Pattern Recognition
Computer Networks and Communications

Access to Document

10.1145/1553374.1553520

Cite this

@inproceedings{53cfaa736f924a52b69b21598cd3606f,

title = "Non-monotonic feature selection",

abstract = "We consider the problem of selecting a subset of m most informative features where m is the number of required features. This feature selection problem is essentially a combinatorial optimization problem, and is usually solved by an approximation. Conventional feature selection methods address the computational challenge in two steps: (a) ranking all the features by certain scores that are usually computed independently from the number of specified features m, and (b) selecting the top m ranked features. One major shortcoming of these approaches is that if a feature f is chosen when the number of specified features is m, it will always be chosen when the number of specified features is larger thanm. We refer to this property as the {"}monotonic{"} property of feature selection. In this work, we argue that it is important to develop efficient algorithms for non-monotonic feature selection. To this end, we develop an algorithm for non-monotonic feature selection that approximates the related combinatorial optimization problem by a Multiple Kernel Learning (MKL) problem. We also present a strategy that derives a discrete solution from the approximate solution ofMKL, and show the performance guarantee for the derived discrete solution when compared to the global optimal solution for the related combinatorial optimization problem. An empirical study with a number of benchmark data sets indicates the promising performance of the proposed framework compared with several state-of-the-art approaches for feature selection.",

author = "Zenglin Xu and Rong Jin and Jieping Ye and Lyu, {Michael R.} and Irwin King",

year = "2009",

doi = "10.1145/1553374.1553520",

language = "English (US)",

isbn = "9781605585161",

series = "ACM International Conference Proceeding Series",

booktitle = "Proceedings of the 26th Annual International Conference on Machine Learning, ICML'09",

note = "26th Annual International Conference on Machine Learning, ICML'09 ; Conference date: 14-06-2009 Through 18-06-2009",

}

TY - GEN

T1 - Non-monotonic feature selection

AU - Xu, Zenglin

AU - Jin, Rong

AU - Ye, Jieping

AU - Lyu, Michael R.

AU - King, Irwin

PY - 2009

Y1 - 2009

N2 - We consider the problem of selecting a subset of m most informative features where m is the number of required features. This feature selection problem is essentially a combinatorial optimization problem, and is usually solved by an approximation. Conventional feature selection methods address the computational challenge in two steps: (a) ranking all the features by certain scores that are usually computed independently from the number of specified features m, and (b) selecting the top m ranked features. One major shortcoming of these approaches is that if a feature f is chosen when the number of specified features is m, it will always be chosen when the number of specified features is larger thanm. We refer to this property as the "monotonic" property of feature selection. In this work, we argue that it is important to develop efficient algorithms for non-monotonic feature selection. To this end, we develop an algorithm for non-monotonic feature selection that approximates the related combinatorial optimization problem by a Multiple Kernel Learning (MKL) problem. We also present a strategy that derives a discrete solution from the approximate solution ofMKL, and show the performance guarantee for the derived discrete solution when compared to the global optimal solution for the related combinatorial optimization problem. An empirical study with a number of benchmark data sets indicates the promising performance of the proposed framework compared with several state-of-the-art approaches for feature selection.

AB - We consider the problem of selecting a subset of m most informative features where m is the number of required features. This feature selection problem is essentially a combinatorial optimization problem, and is usually solved by an approximation. Conventional feature selection methods address the computational challenge in two steps: (a) ranking all the features by certain scores that are usually computed independently from the number of specified features m, and (b) selecting the top m ranked features. One major shortcoming of these approaches is that if a feature f is chosen when the number of specified features is m, it will always be chosen when the number of specified features is larger thanm. We refer to this property as the "monotonic" property of feature selection. In this work, we argue that it is important to develop efficient algorithms for non-monotonic feature selection. To this end, we develop an algorithm for non-monotonic feature selection that approximates the related combinatorial optimization problem by a Multiple Kernel Learning (MKL) problem. We also present a strategy that derives a discrete solution from the approximate solution ofMKL, and show the performance guarantee for the derived discrete solution when compared to the global optimal solution for the related combinatorial optimization problem. An empirical study with a number of benchmark data sets indicates the promising performance of the proposed framework compared with several state-of-the-art approaches for feature selection.

UR - http://www.scopus.com/inward/record.url?scp=70049098964&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70049098964&partnerID=8YFLogxK

U2 - 10.1145/1553374.1553520

DO - 10.1145/1553374.1553520

M3 - Conference contribution

AN - SCOPUS:70049098964

SN - 9781605585161

T3 - ACM International Conference Proceeding Series

BT - Proceedings of the 26th Annual International Conference on Machine Learning, ICML'09

T2 - 26th Annual International Conference on Machine Learning, ICML'09

Y2 - 14 June 2009 through 18 June 2009

ER -

Non-monotonic feature selection

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this