Feature selection for classification

M. Dash, Huan Liu

Research output: Contribution to journalArticle

1902 Citations (Scopus)

Abstract

Feature selection has been the focus of interest for quite some time and much work has been done. With the creation of huge databases and the consequent requirements for good machine learning techniques, new problems arise and novel approaches to feature selection are in demand. This survey is a comprehensive overview of many existing methods from the 1970's to the present. It identifies four steps of a typical feature selection method, and categorizes the different existing methods in terms of generation procedures and evaluation functions, and reveals hitherto unattempted combinations of generation procedures and evaluation functions. Representative methods are chosen from each category for detailed explanation and discussion via example. Benchmark datasets with different characteristics are used for comparative study. The strengths and weaknesses of different methods are explained. Guidelines for applying feature selection methods are given based on data types and domain characteristics. This survey identifies the future research areas in feature selection, introduces newcomers to this field, and paves the way for practitioners who search for suitable methods for solving domain-specific real-world applications.

Original languageEnglish (US)
Pages (from-to)131-156
Number of pages26
JournalIntelligent Data Analysis
Volume1
Issue number3
DOIs
StatePublished - 1997
Externally publishedYes

Fingerprint

Feature Selection
Feature extraction
Function evaluation
Evaluation Function
Learning systems
Real-world Applications
Comparative Study
Machine Learning
Benchmark
Requirements

Keywords

  • Classification
  • Feature selection
  • Framework

ASJC Scopus subject areas

  • Artificial Intelligence
  • Theoretical Computer Science
  • Computer Vision and Pattern Recognition

Cite this

Feature selection for classification. / Dash, M.; Liu, Huan.

In: Intelligent Data Analysis, Vol. 1, No. 3, 1997, p. 131-156.

Research output: Contribution to journalArticle

Dash, M. ; Liu, Huan. / Feature selection for classification. In: Intelligent Data Analysis. 1997 ; Vol. 1, No. 3. pp. 131-156.
@article{7cee3d47ab9c42afa0831ca4232aa803,
title = "Feature selection for classification",
abstract = "Feature selection has been the focus of interest for quite some time and much work has been done. With the creation of huge databases and the consequent requirements for good machine learning techniques, new problems arise and novel approaches to feature selection are in demand. This survey is a comprehensive overview of many existing methods from the 1970's to the present. It identifies four steps of a typical feature selection method, and categorizes the different existing methods in terms of generation procedures and evaluation functions, and reveals hitherto unattempted combinations of generation procedures and evaluation functions. Representative methods are chosen from each category for detailed explanation and discussion via example. Benchmark datasets with different characteristics are used for comparative study. The strengths and weaknesses of different methods are explained. Guidelines for applying feature selection methods are given based on data types and domain characteristics. This survey identifies the future research areas in feature selection, introduces newcomers to this field, and paves the way for practitioners who search for suitable methods for solving domain-specific real-world applications.",
keywords = "Classification, Feature selection, Framework",
author = "M. Dash and Huan Liu",
year = "1997",
doi = "10.3233/IDA-1997-1302",
language = "English (US)",
volume = "1",
pages = "131--156",
journal = "Intelligent Data Analysis",
issn = "1088-467X",
publisher = "IOS Press",
number = "3",

}

TY - JOUR

T1 - Feature selection for classification

AU - Dash, M.

AU - Liu, Huan

PY - 1997

Y1 - 1997

N2 - Feature selection has been the focus of interest for quite some time and much work has been done. With the creation of huge databases and the consequent requirements for good machine learning techniques, new problems arise and novel approaches to feature selection are in demand. This survey is a comprehensive overview of many existing methods from the 1970's to the present. It identifies four steps of a typical feature selection method, and categorizes the different existing methods in terms of generation procedures and evaluation functions, and reveals hitherto unattempted combinations of generation procedures and evaluation functions. Representative methods are chosen from each category for detailed explanation and discussion via example. Benchmark datasets with different characteristics are used for comparative study. The strengths and weaknesses of different methods are explained. Guidelines for applying feature selection methods are given based on data types and domain characteristics. This survey identifies the future research areas in feature selection, introduces newcomers to this field, and paves the way for practitioners who search for suitable methods for solving domain-specific real-world applications.

AB - Feature selection has been the focus of interest for quite some time and much work has been done. With the creation of huge databases and the consequent requirements for good machine learning techniques, new problems arise and novel approaches to feature selection are in demand. This survey is a comprehensive overview of many existing methods from the 1970's to the present. It identifies four steps of a typical feature selection method, and categorizes the different existing methods in terms of generation procedures and evaluation functions, and reveals hitherto unattempted combinations of generation procedures and evaluation functions. Representative methods are chosen from each category for detailed explanation and discussion via example. Benchmark datasets with different characteristics are used for comparative study. The strengths and weaknesses of different methods are explained. Guidelines for applying feature selection methods are given based on data types and domain characteristics. This survey identifies the future research areas in feature selection, introduces newcomers to this field, and paves the way for practitioners who search for suitable methods for solving domain-specific real-world applications.

KW - Classification

KW - Feature selection

KW - Framework

UR - http://www.scopus.com/inward/record.url?scp=0013326060&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0013326060&partnerID=8YFLogxK

U2 - 10.3233/IDA-1997-1302

DO - 10.3233/IDA-1997-1302

M3 - Article

AN - SCOPUS:0013326060

VL - 1

SP - 131

EP - 156

JO - Intelligent Data Analysis

JF - Intelligent Data Analysis

SN - 1088-467X

IS - 3

ER -