Completing missing views for multiple sources of web media

Shankara Subramanya, Zheshen Wang, Baoxin Li, Huan Liu

Research output: Contribution to journalReview article

3 Scopus citations

Abstract

Combining multiple data sources, each with its own features, to achieve optimal inference has received a lot of attention in recent years. In inference from multiple data sources, each source can be thought of as providing one view of the underlying object. In general, different views may provide complementary information for the inference task. However, often not all the views are available all the time for the available instances in an application. In this paper, we propose a view completion approach based on canonical correlation analysis that heuristically predicts the missing views and further ranks all within-view features, through learning the intrinsic correlation among the views from training set. We evaluate our approach and compare it with existing approaches in the literature, using web page classification and photo tag recommendation as case studies. Experiments demonstrate the improved performance of the proposed approach. The results suggest that the work has great potential for inference problems with multiple information sources.

Original languageEnglish (US)
Pages (from-to)23-44
Number of pages22
JournalInternational Journal of Data Mining, Modelling and Management
Volume1
Issue number1
DOIs
StatePublished - 2009

Keywords

  • CCA
  • Canonical correlation analysis
  • Feature selection
  • View completion

ASJC Scopus subject areas

  • Management Information Systems
  • Modeling and Simulation
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Completing missing views for multiple sources of web media'. Together they form a unique fingerprint.

  • Cite this