Abstract

Feature selection is widely used in preparing high- dimensional data for effective data mining. Increasingly popular social media data presents new challenges to feature selection. Social media data consists of (1) tra- ditional high-dimensional, attribute-value data such as posts, tweets, comments, and images, and (2) linked data that describes the relationships between social me- dia users as well as who post the posts, etc. The nature of social media also determines that its data is mas- sive, noisy, and incomplete, which exacerbates the al- ready challenging problem of feature selection. In this paper, we illustrate the differences between attribute- value data and social media data, investigate if linked data can be exploited in a new feature selection frame- work by taking advantage of social science theories, ex- Tensively evaluate the effects of user-user and user-post relationships manifested in linked data on feature selec- Tion, and discuss some research issues for future work.

Original languageEnglish (US)
Title of host publicationProceedings of the 12th SIAM International Conference on Data Mining, SDM 2012
Pages118-128
Number of pages11
StatePublished - 2012
Event12th SIAM International Conference on Data Mining, SDM 2012 - Anaheim, CA, United States
Duration: Apr 26 2012Apr 28 2012

Other

Other12th SIAM International Conference on Data Mining, SDM 2012
CountryUnited States
CityAnaheim, CA
Period4/26/124/28/12

Fingerprint

Feature extraction
Social sciences
Data mining

ASJC Scopus subject areas

  • Computer Science Applications

Cite this

Tang, J., & Liu, H. (2012). Feature selection with linked data in social media. In Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012 (pp. 118-128)

Feature selection with linked data in social media. / Tang, Jiliang; Liu, Huan.

Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012. 2012. p. 118-128.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tang, J & Liu, H 2012, Feature selection with linked data in social media. in Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012. pp. 118-128, 12th SIAM International Conference on Data Mining, SDM 2012, Anaheim, CA, United States, 4/26/12.
Tang J, Liu H. Feature selection with linked data in social media. In Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012. 2012. p. 118-128
Tang, Jiliang ; Liu, Huan. / Feature selection with linked data in social media. Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012. 2012. pp. 118-128
@inproceedings{8c732c98b1a4487697c9552cffaa370d,
title = "Feature selection with linked data in social media",
abstract = "Feature selection is widely used in preparing high- dimensional data for effective data mining. Increasingly popular social media data presents new challenges to feature selection. Social media data consists of (1) tra- ditional high-dimensional, attribute-value data such as posts, tweets, comments, and images, and (2) linked data that describes the relationships between social me- dia users as well as who post the posts, etc. The nature of social media also determines that its data is mas- sive, noisy, and incomplete, which exacerbates the al- ready challenging problem of feature selection. In this paper, we illustrate the differences between attribute- value data and social media data, investigate if linked data can be exploited in a new feature selection frame- work by taking advantage of social science theories, ex- Tensively evaluate the effects of user-user and user-post relationships manifested in linked data on feature selec- Tion, and discuss some research issues for future work.",
author = "Jiliang Tang and Huan Liu",
year = "2012",
language = "English (US)",
isbn = "9781611972320",
pages = "118--128",
booktitle = "Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012",

}

TY - GEN

T1 - Feature selection with linked data in social media

AU - Tang, Jiliang

AU - Liu, Huan

PY - 2012

Y1 - 2012

N2 - Feature selection is widely used in preparing high- dimensional data for effective data mining. Increasingly popular social media data presents new challenges to feature selection. Social media data consists of (1) tra- ditional high-dimensional, attribute-value data such as posts, tweets, comments, and images, and (2) linked data that describes the relationships between social me- dia users as well as who post the posts, etc. The nature of social media also determines that its data is mas- sive, noisy, and incomplete, which exacerbates the al- ready challenging problem of feature selection. In this paper, we illustrate the differences between attribute- value data and social media data, investigate if linked data can be exploited in a new feature selection frame- work by taking advantage of social science theories, ex- Tensively evaluate the effects of user-user and user-post relationships manifested in linked data on feature selec- Tion, and discuss some research issues for future work.

AB - Feature selection is widely used in preparing high- dimensional data for effective data mining. Increasingly popular social media data presents new challenges to feature selection. Social media data consists of (1) tra- ditional high-dimensional, attribute-value data such as posts, tweets, comments, and images, and (2) linked data that describes the relationships between social me- dia users as well as who post the posts, etc. The nature of social media also determines that its data is mas- sive, noisy, and incomplete, which exacerbates the al- ready challenging problem of feature selection. In this paper, we illustrate the differences between attribute- value data and social media data, investigate if linked data can be exploited in a new feature selection frame- work by taking advantage of social science theories, ex- Tensively evaluate the effects of user-user and user-post relationships manifested in linked data on feature selec- Tion, and discuss some research issues for future work.

UR - http://www.scopus.com/inward/record.url?scp=84880191846&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880191846&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84880191846

SN - 9781611972320

SP - 118

EP - 128

BT - Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012

ER -