Pairwise exemplar clustering

Yingzhen Yang; Xinqi Chu; Feng Liang; Thomas S. Huang

Pairwise exemplar clustering

Yingzhen Yang, Xinqi Chu, Feng Liang, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Exemplar-based clustering methods have been extensively shown to be effective in many clustering problems. They adaptively determine the number of clusters and hold the appealing advantage of not requiring the estimation of latent parameters, which is otherwise difficult in case of complicated parametric model and high dimensionality of the data. However, modeling arbitrary underlying distribution of the data is still difficult for existing exemplar-based clustering methods. We present Pairwise Exemplar Clustering (PEC) to alleviate this problem by modeling the underlying cluster distributions more accurately with non-parametric kernel density estimation. Interpreting the clusters as classes from a supervised learning perspective, we search for an optimal partition of the data that balances two quantities: 1 the misclassification rate of the data partition for separating the clusters; 2 the sum of within-cluster dissimilarities for controlling the cluster size. The broadly used kernel form of cut turns out to be a special case of our formulation. Moreover, we optimize the corresponding objective function by a new efficient algorithm for message computation in a pairwise MRF. Experimental results on synthetic and real data demonstrate the effectiveness of our method.

Original language	English (US)
Title of host publication	AAAI-12 / IAAI-12 - Proceedings of the 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference
Pages	1204-1211
Number of pages	8
State	Published - 2012
Externally published	Yes
Event	26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12 - Toronto, ON, Canada Duration: Jul 22 2012 → Jul 26 2012

Publication series

Name	Proceedings of the National Conference on Artificial Intelligence
Volume	2

Other

Other	26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12
Country/Territory	Canada
City	Toronto, ON
Period	7/22/12 → 7/26/12

ASJC Scopus subject areas

Software
Artificial Intelligence

Cite this

Pairwise exemplar clustering. / Yang, Yingzhen; Chu, Xinqi; Liang, Feng et al.
AAAI-12 / IAAI-12 - Proceedings of the 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference. 2012. p. 1204-1211 (Proceedings of the National Conference on Artificial Intelligence; Vol. 2).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Yang, Y, Chu, X, Liang, F & Huang, TS 2012, Pairwise exemplar clustering. in AAAI-12 / IAAI-12 - Proceedings of the 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference. Proceedings of the National Conference on Artificial Intelligence, vol. 2, pp. 1204-1211, 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12, Toronto, ON, Canada, 7/22/12.

@inproceedings{970040a5c89049d0be2f8418ef8f8eb2,

title = "Pairwise exemplar clustering",

abstract = "Exemplar-based clustering methods have been extensively shown to be effective in many clustering problems. They adaptively determine the number of clusters and hold the appealing advantage of not requiring the estimation of latent parameters, which is otherwise difficult in case of complicated parametric model and high dimensionality of the data. However, modeling arbitrary underlying distribution of the data is still difficult for existing exemplar-based clustering methods. We present Pairwise Exemplar Clustering (PEC) to alleviate this problem by modeling the underlying cluster distributions more accurately with non-parametric kernel density estimation. Interpreting the clusters as classes from a supervised learning perspective, we search for an optimal partition of the data that balances two quantities: 1 the misclassification rate of the data partition for separating the clusters; 2 the sum of within-cluster dissimilarities for controlling the cluster size. The broadly used kernel form of cut turns out to be a special case of our formulation. Moreover, we optimize the corresponding objective function by a new efficient algorithm for message computation in a pairwise MRF. Experimental results on synthetic and real data demonstrate the effectiveness of our method.",

author = "Yingzhen Yang and Xinqi Chu and Feng Liang and Huang, {Thomas S.}",

note = "Copyright: Copyright 2012 Elsevier B.V., All rights reserved.; 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12 ; Conference date: 22-07-2012 Through 26-07-2012",

year = "2012",

language = "English (US)",

isbn = "9781577355687",

series = "Proceedings of the National Conference on Artificial Intelligence",

pages = "1204--1211",

booktitle = "AAAI-12 / IAAI-12 - Proceedings of the 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference",

}

TY - GEN

T1 - Pairwise exemplar clustering

AU - Yang, Yingzhen

AU - Chu, Xinqi

AU - Liang, Feng

AU - Huang, Thomas S.

PY - 2012

Y1 - 2012

N2 - Exemplar-based clustering methods have been extensively shown to be effective in many clustering problems. They adaptively determine the number of clusters and hold the appealing advantage of not requiring the estimation of latent parameters, which is otherwise difficult in case of complicated parametric model and high dimensionality of the data. However, modeling arbitrary underlying distribution of the data is still difficult for existing exemplar-based clustering methods. We present Pairwise Exemplar Clustering (PEC) to alleviate this problem by modeling the underlying cluster distributions more accurately with non-parametric kernel density estimation. Interpreting the clusters as classes from a supervised learning perspective, we search for an optimal partition of the data that balances two quantities: 1 the misclassification rate of the data partition for separating the clusters; 2 the sum of within-cluster dissimilarities for controlling the cluster size. The broadly used kernel form of cut turns out to be a special case of our formulation. Moreover, we optimize the corresponding objective function by a new efficient algorithm for message computation in a pairwise MRF. Experimental results on synthetic and real data demonstrate the effectiveness of our method.

AB - Exemplar-based clustering methods have been extensively shown to be effective in many clustering problems. They adaptively determine the number of clusters and hold the appealing advantage of not requiring the estimation of latent parameters, which is otherwise difficult in case of complicated parametric model and high dimensionality of the data. However, modeling arbitrary underlying distribution of the data is still difficult for existing exemplar-based clustering methods. We present Pairwise Exemplar Clustering (PEC) to alleviate this problem by modeling the underlying cluster distributions more accurately with non-parametric kernel density estimation. Interpreting the clusters as classes from a supervised learning perspective, we search for an optimal partition of the data that balances two quantities: 1 the misclassification rate of the data partition for separating the clusters; 2 the sum of within-cluster dissimilarities for controlling the cluster size. The broadly used kernel form of cut turns out to be a special case of our formulation. Moreover, we optimize the corresponding objective function by a new efficient algorithm for message computation in a pairwise MRF. Experimental results on synthetic and real data demonstrate the effectiveness of our method.

UR - http://www.scopus.com/inward/record.url?scp=84868293520&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84868293520&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84868293520

SN - 9781577355687

T3 - Proceedings of the National Conference on Artificial Intelligence

SP - 1204

EP - 1211

BT - AAAI-12 / IAAI-12 - Proceedings of the 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference

T2 - 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12

Y2 - 22 July 2012 through 26 July 2012

ER -

Pairwise exemplar clustering

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this