TY - GEN
T1 - Application of LSA space's dimension character in document multi-hierarchy clustering
AU - Liu, Yun Feng
AU - Qi, Huan
AU - Hu, Xiang En
AU - Cai, Zhi Qiang
AU - Dai, Jian Min
AU - Zhu, Li
PY - 2005/12/12
Y1 - 2005/12/12
N2 - In LSA space, dimensions corresponding to bigger singular values reflect the general concept of language elements, while dimensions corresponding to smaller singular values reflect particular concept of language elements. On this basis, different dimensions of LSA space are adopted for document clustering under various concept granularities. In addition, in the LSA-based algorithm of document clustering, better clustering results are obtained by taking the row vectors of document self-indexing matrix as the objects to be clustered, instead of the document vectors with low dimensionality.
AB - In LSA space, dimensions corresponding to bigger singular values reflect the general concept of language elements, while dimensions corresponding to smaller singular values reflect particular concept of language elements. On this basis, different dimensions of LSA space are adopted for document clustering under various concept granularities. In addition, in the LSA-based algorithm of document clustering, better clustering results are obtained by taking the row vectors of document self-indexing matrix as the objects to be clustered, instead of the document vectors with low dimensionality.
KW - Concept Granularity
KW - Document Multi-hierarchy Clustering
KW - Document Self-indexing Matrix
KW - Latent Semantic Analysis
UR - http://www.scopus.com/inward/record.url?scp=28444478040&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=28444478040&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:28444478040
SN - 078039092X
SN - 9780780390928
T3 - 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
SP - 2384
EP - 2389
BT - 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
T2 - International Conference on Machine Learning and Cybernetics, ICMLC 2005
Y2 - 18 August 2005 through 21 August 2005
ER -