Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation

Shihong Yue; Teresa Wu; Yamin Wang; Kai Zhang; Weixia Liu

doi:10.1007/978-3-642-16248-0_70

Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation

Shihong Yue, Teresa Wu, Yamin Wang, Kai Zhang, Weixia Liu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

Clustering is a widely used unsupervised learning method to group data with similar characteristics. The performance of the clustering method can be in general evaluated through some validity indices. However, most validity indices are designed for the specific algorithms along with specific structure of data space. Moreover, these indices consist of a few within- and between- clustering distance functions. The applicability of these indices heavily relies on the correctness of combining these functions. In this research, we first summarize three common characteristics of any clustering evaluation: (1) the clustering outcome can be evaluated by a group of validity indices if some efficient validity indices are available, (2) the clustering outcome can be measured by an independent intra-cluster distance function and (3) the clustering outcome can be measured by the neighborhood based functions. Considering the complementary and unstable natures among the clustering evaluation, we then apply Dampster-Shafter (D-S) Evidence Theory to fuse the three characteristics to generate a new index, termed fused Multiple Characteristic Indices (fMCI). The fMCI generally is capable to evaluate clustering outcomes of arbitrary clustering methods associated with more complex structures of data space. We conduct a number of experiments to demonstrate that the fMCI is applicable to evaluate different clustering algorithms on different datasets and the fMCI can achieve more accurate and robust clustering evaluation comparing to existing indices.

Original language	English (US)
Title of host publication	Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings
Pages	499-519
Number of pages	21
DOIs	https://doi.org/10.1007/978-3-642-16248-0_70
State	Published - 2010
Event	5th International Conference on Rough Set and Knowledge Technology, RSKT 2010 - Beijing, China Duration: Oct 15 2010 → Oct 17 2010

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	6401 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Other

Other	5th International Conference on Rough Set and Knowledge Technology, RSKT 2010
Country/Territory	China
City	Beijing
Period	10/15/10 → 10/17/10

Keywords

Dampster-Shafer evidence theory
Validity index
clustering algorithm
data structure

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-642-16248-0_70

Cite this

Yue, S., Wu, T., Wang, Y., Zhang, K., & Liu, W. (2010). Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation. In Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings (pp. 499-519). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6401 LNAI). https://doi.org/10.1007/978-3-642-16248-0_70

Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation. / Yue, Shihong; Wu, Teresa; Wang, Yamin et al.
Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings. 2010. p. 499-519 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6401 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Yue, S, Wu, T, Wang, Y, Zhang, K & Liu, W 2010, Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation. in Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6401 LNAI, pp. 499-519, 5th International Conference on Rough Set and Knowledge Technology, RSKT 2010, Beijing, China, 10/15/10. https://doi.org/10.1007/978-3-642-16248-0_70

Yue S, Wu T, Wang Y, Zhang K, Liu W. Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation. In Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings. 2010. p. 499-519. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-16248-0_70

Yue, Shihong ; Wu, Teresa ; Wang, Yamin et al. / Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation. Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings. 2010. pp. 499-519 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{da86130b5390435f983a6d62b621517c,

title = "Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation",

abstract = "Clustering is a widely used unsupervised learning method to group data with similar characteristics. The performance of the clustering method can be in general evaluated through some validity indices. However, most validity indices are designed for the specific algorithms along with specific structure of data space. Moreover, these indices consist of a few within- and between- clustering distance functions. The applicability of these indices heavily relies on the correctness of combining these functions. In this research, we first summarize three common characteristics of any clustering evaluation: (1) the clustering outcome can be evaluated by a group of validity indices if some efficient validity indices are available, (2) the clustering outcome can be measured by an independent intra-cluster distance function and (3) the clustering outcome can be measured by the neighborhood based functions. Considering the complementary and unstable natures among the clustering evaluation, we then apply Dampster-Shafter (D-S) Evidence Theory to fuse the three characteristics to generate a new index, termed fused Multiple Characteristic Indices (fMCI). The fMCI generally is capable to evaluate clustering outcomes of arbitrary clustering methods associated with more complex structures of data space. We conduct a number of experiments to demonstrate that the fMCI is applicable to evaluate different clustering algorithms on different datasets and the fMCI can achieve more accurate and robust clustering evaluation comparing to existing indices.",

keywords = "Dampster-Shafer evidence theory, Validity index, clustering algorithm, data structure",

author = "Shihong Yue and Teresa Wu and Yamin Wang and Kai Zhang and Weixia Liu",

year = "2010",

doi = "10.1007/978-3-642-16248-0_70",

language = "English (US)",

isbn = "3642162479",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "499--519",

booktitle = "Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings",

note = "5th International Conference on Rough Set and Knowledge Technology, RSKT 2010 ; Conference date: 15-10-2010 Through 17-10-2010",

}

TY - GEN

T1 - Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation

AU - Yue, Shihong

AU - Wu, Teresa

AU - Wang, Yamin

AU - Zhang, Kai

AU - Liu, Weixia

PY - 2010

Y1 - 2010

N2 - Clustering is a widely used unsupervised learning method to group data with similar characteristics. The performance of the clustering method can be in general evaluated through some validity indices. However, most validity indices are designed for the specific algorithms along with specific structure of data space. Moreover, these indices consist of a few within- and between- clustering distance functions. The applicability of these indices heavily relies on the correctness of combining these functions. In this research, we first summarize three common characteristics of any clustering evaluation: (1) the clustering outcome can be evaluated by a group of validity indices if some efficient validity indices are available, (2) the clustering outcome can be measured by an independent intra-cluster distance function and (3) the clustering outcome can be measured by the neighborhood based functions. Considering the complementary and unstable natures among the clustering evaluation, we then apply Dampster-Shafter (D-S) Evidence Theory to fuse the three characteristics to generate a new index, termed fused Multiple Characteristic Indices (fMCI). The fMCI generally is capable to evaluate clustering outcomes of arbitrary clustering methods associated with more complex structures of data space. We conduct a number of experiments to demonstrate that the fMCI is applicable to evaluate different clustering algorithms on different datasets and the fMCI can achieve more accurate and robust clustering evaluation comparing to existing indices.

AB - Clustering is a widely used unsupervised learning method to group data with similar characteristics. The performance of the clustering method can be in general evaluated through some validity indices. However, most validity indices are designed for the specific algorithms along with specific structure of data space. Moreover, these indices consist of a few within- and between- clustering distance functions. The applicability of these indices heavily relies on the correctness of combining these functions. In this research, we first summarize three common characteristics of any clustering evaluation: (1) the clustering outcome can be evaluated by a group of validity indices if some efficient validity indices are available, (2) the clustering outcome can be measured by an independent intra-cluster distance function and (3) the clustering outcome can be measured by the neighborhood based functions. Considering the complementary and unstable natures among the clustering evaluation, we then apply Dampster-Shafter (D-S) Evidence Theory to fuse the three characteristics to generate a new index, termed fused Multiple Characteristic Indices (fMCI). The fMCI generally is capable to evaluate clustering outcomes of arbitrary clustering methods associated with more complex structures of data space. We conduct a number of experiments to demonstrate that the fMCI is applicable to evaluate different clustering algorithms on different datasets and the fMCI can achieve more accurate and robust clustering evaluation comparing to existing indices.

KW - Dampster-Shafer evidence theory

KW - Validity index

KW - clustering algorithm

KW - data structure

UR - http://www.scopus.com/inward/record.url?scp=78349277289&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78349277289&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-16248-0_70

DO - 10.1007/978-3-642-16248-0_70

M3 - Conference contribution

AN - SCOPUS:78349277289

SN - 3642162479

SN - 9783642162473

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 499

EP - 519

BT - Rough Set and Knowledge Technology - 5th International Conference, RSKT 2010, Proceedings

T2 - 5th International Conference on Rough Set and Knowledge Technology, RSKT 2010

Y2 - 15 October 2010 through 17 October 2010

ER -

Dampster-shafer evidence theory based multi-characteristics fusion for clustering evaluation

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this