Toward integrating feature selection algorithms for classification and clustering

Huan Liu, Lei Yu

Research output: Contribution to journalArticle

1683 Citations (Scopus)

Abstract

This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.

Original languageEnglish (US)
Pages (from-to)491-502
Number of pages12
JournalIEEE Transactions on Knowledge and Data Engineering
Volume17
Issue number4
DOIs
StatePublished - Apr 2005

Fingerprint

Feature extraction
Data mining

Keywords

  • Categorizing framework
  • Classification
  • Clustering
  • Feature selection
  • Real-world applications
  • Unifying platform

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Artificial Intelligence
  • Information Systems

Cite this

Toward integrating feature selection algorithms for classification and clustering. / Liu, Huan; Yu, Lei.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 17, No. 4, 04.2005, p. 491-502.

Research output: Contribution to journalArticle

@article{aac615331c0944ad89f2613d0ee84ecf,
title = "Toward integrating feature selection algorithms for classification and clustering",
abstract = "This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.",
keywords = "Categorizing framework, Classification, Clustering, Feature selection, Real-world applications, Unifying platform",
author = "Huan Liu and Lei Yu",
year = "2005",
month = "4",
doi = "10.1109/TKDE.2005.66",
language = "English (US)",
volume = "17",
pages = "491--502",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "4",

}

TY - JOUR

T1 - Toward integrating feature selection algorithms for classification and clustering

AU - Liu, Huan

AU - Yu, Lei

PY - 2005/4

Y1 - 2005/4

N2 - This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.

AB - This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.

KW - Categorizing framework

KW - Classification

KW - Clustering

KW - Feature selection

KW - Real-world applications

KW - Unifying platform

UR - http://www.scopus.com/inward/record.url?scp=17044405923&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=17044405923&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2005.66

DO - 10.1109/TKDE.2005.66

M3 - Article

AN - SCOPUS:17044405923

VL - 17

SP - 491

EP - 502

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 4

ER -