Quantifying features using false nearest neighbors: An unsupervised approach

Jose Augusto Andrade Filho; Andre C P L F Carvalho; Rodrigo F. Mello; Salem Alelyani; Huan Liu

doi:10.1109/ICTAI.2011.170

Quantifying features using false nearest neighbors: An unsupervised approach

Jose Augusto Andrade Filho, Andre C P L F Carvalho, Rodrigo F. Mello, Salem Alelyani, Huan Liu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.

Original language	English (US)
Title of host publication	Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011
Pages	994-997
Number of pages	4
DOIs	https://doi.org/10.1109/ICTAI.2011.170
State	Published - Dec 1 2011
Event	23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011 - Boca Raton, FL, United States Duration: Nov 7 2011 → Nov 9 2011

Publication series

Name	Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
ISSN (Print)	1082-3409

Other

Other	23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011
Country/Territory	United States
City	Boca Raton, FL
Period	11/7/11 → 11/9/11

Keywords

Chaos Theory
Clustering
Machine Learning
Unsupervised Feature Selection

ASJC Scopus subject areas

Software
Artificial Intelligence
Computer Science Applications

Access to Document

10.1109/ICTAI.2011.170

Cite this

Filho, J. A. A., Carvalho, A. C. P. L. F., Mello, R. F., Alelyani, S., & Liu, H. (2011). Quantifying features using false nearest neighbors: An unsupervised approach. In Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011 (pp. 994-997). Article 6103461 (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI). https://doi.org/10.1109/ICTAI.2011.170

Quantifying features using false nearest neighbors: An unsupervised approach. / Filho, Jose Augusto Andrade; Carvalho, Andre C P L F; Mello, Rodrigo F. et al.
Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011. 2011. p. 994-997 6103461 (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Filho, JAA, Carvalho, ACPLF, Mello, RF, Alelyani, S & Liu, H 2011, Quantifying features using false nearest neighbors: An unsupervised approach. in Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011., 6103461, Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI, pp. 994-997, 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011, Boca Raton, FL, United States, 11/7/11. https://doi.org/10.1109/ICTAI.2011.170

Filho JAA, Carvalho ACPLF, Mello RF, Alelyani S, Liu H. Quantifying features using false nearest neighbors: An unsupervised approach. In Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011. 2011. p. 994-997. 6103461. (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI). doi: 10.1109/ICTAI.2011.170

Filho, Jose Augusto Andrade ; Carvalho, Andre C P L F ; Mello, Rodrigo F. et al. / Quantifying features using false nearest neighbors : An unsupervised approach. Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011. 2011. pp. 994-997 (Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI).

@inproceedings{e5c01a7ed83e42a593e452ef41ed1091,

title = "Quantifying features using false nearest neighbors: An unsupervised approach",

abstract = "Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.",

keywords = "Chaos Theory, Clustering, Machine Learning, Unsupervised Feature Selection",

author = "Filho, {Jose Augusto Andrade} and Carvalho, {Andre C P L F} and Mello, {Rodrigo F.} and Salem Alelyani and Huan Liu",

year = "2011",

month = dec,

day = "1",

doi = "10.1109/ICTAI.2011.170",

language = "English (US)",

isbn = "9780769545967",

series = "Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI",

pages = "994--997",

booktitle = "Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011",

note = "23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011 ; Conference date: 07-11-2011 Through 09-11-2011",

}

TY - GEN

T1 - Quantifying features using false nearest neighbors

T2 - 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011

AU - Filho, Jose Augusto Andrade

AU - Carvalho, Andre C P L F

AU - Mello, Rodrigo F.

AU - Alelyani, Salem

AU - Liu, Huan

PY - 2011/12/1

Y1 - 2011/12/1

N2 - Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.

AB - Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.

KW - Chaos Theory

KW - Clustering

KW - Machine Learning

KW - Unsupervised Feature Selection

UR - http://www.scopus.com/inward/record.url?scp=84862925201&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862925201&partnerID=8YFLogxK

U2 - 10.1109/ICTAI.2011.170

DO - 10.1109/ICTAI.2011.170

M3 - Conference contribution

AN - SCOPUS:84862925201

SN - 9780769545967

T3 - Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI

SP - 994

EP - 997

BT - Proceedings - 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2011

Y2 - 7 November 2011 through 9 November 2011

ER -

Quantifying features using false nearest neighbors: An unsupervised approach

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this