Abstract

The last few years have witnessed fast development on dictionary learning approaches for a set of visual computing tasks, largely due to their utilization in developing new techniques based on sparse representation. Compared with conventional techniques employing manually defined dictionaries, such as Fourier Transform and Wavelet Transform, dictionary learning aims at obtaining a dictionary adaptively from the data so as to support optimal sparse representation of the data. In contrast to conventional clustering algorithms like K-means, where a data point is associated with only one cluster center, in a dictionary-based representation, a data point can be associated with a small set of dictionary atoms. Thus, dictionary learning provides a more flexible representation of data and may have the potential to capture more relevant features from the original feature space of the data. One of the early algorithms for dictionary learning is K-SVD. In recent years, many variations/extensions of K-SVD and other new algorithms have been proposed, with some aiming at adding discriminative capability to the dictionary, and some attempting to model the relationship of multiple dictionaries. One prominent application of dictionary learning is in the general field of visual computing, where long-standing challenges have seen promising new solutions based on sparse representation with learned dictionaries. With a timely review of recent advances of dictionary learning in visual computing, covering the most recent literature with an emphasis on papers after 2008, this book provides a systematic presentation of the general methodologies, specific algorithms, and examples of applications for those who wish to have a quick start on this subject.

Original languageEnglish (US)
Title of host publicationSynthesis Lectures on Image, Video, and Multimedia Processing
PublisherMorgan and Claypool Publishers
Pages1-151
Number of pages151
Volume18
ISBN (Print)9781627057776
DOIs
Publication statusPublished - 2015

Publication series

NameSynthesis Lectures on Image, Video, and Multimedia Processing
Volume18
ISSN (Print)15598136
ISSN (Electronic)15598144

    Fingerprint

Keywords

  • Background Subtraction
  • Blind Source Separation
  • Compressive Sensing
  • Dictionary Learning
  • Face Recognition
  • Image Compression
  • Image Demosaicing
  • Image Denoising
  • Image Inpainting
  • Image Segmentation
  • Image Super-resolution
  • Matrix Completion
  • Saliency Detection
  • Sparse Coding
  • Sparse Representation
  • Visual Tracking

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Computer Graphics and Computer-Aided Design
  • Atomic and Molecular Physics, and Optics

Cite this

Zhang, Q., & Li, B. (2015). Dictionary learning in visual computing. In Synthesis Lectures on Image, Video, and Multimedia Processing (Vol. 18, pp. 1-151). (Synthesis Lectures on Image, Video, and Multimedia Processing; Vol. 18). Morgan and Claypool Publishers. https://doi.org/10.2200/S00640ED1V01Y201504IVM018