Abstract

Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our knowledge, this is the first attempt to develop such techniques in the active learning literature.

Original languageEnglish (US)
Title of host publicationProceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012
Pages489-490
Number of pages2
DOIs
StatePublished - 2012
Event14th IEEE International Symposium on Multimedia, ISM 2012 - Irvine, CA, United States
Duration: Dec 10 2012Dec 12 2012

Other

Other14th IEEE International Symposium on Multimedia, ISM 2012
CountryUnited States
CityIrvine, CA
Period12/10/1212/12/12

Fingerprint

Pattern recognition
Labels
Labeling
Face recognition
Learning algorithms
Problem-Based Learning
Availability
Personnel

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition
  • Software

Cite this

Chakraborty, S., Balasubramanian, V., & Panchanathan, S. (2012). Batch mode active learning for multimedia pattern recognition. In Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012 (pp. 489-490). [6424714] https://doi.org/10.1109/ISM.2012.101

Batch mode active learning for multimedia pattern recognition. / Chakraborty, Shayok; Balasubramanian, Vineeth; Panchanathan, Sethuraman.

Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012. 2012. p. 489-490 6424714.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chakraborty, S, Balasubramanian, V & Panchanathan, S 2012, Batch mode active learning for multimedia pattern recognition. in Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012., 6424714, pp. 489-490, 14th IEEE International Symposium on Multimedia, ISM 2012, Irvine, CA, United States, 12/10/12. https://doi.org/10.1109/ISM.2012.101
Chakraborty S, Balasubramanian V, Panchanathan S. Batch mode active learning for multimedia pattern recognition. In Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012. 2012. p. 489-490. 6424714 https://doi.org/10.1109/ISM.2012.101
Chakraborty, Shayok ; Balasubramanian, Vineeth ; Panchanathan, Sethuraman. / Batch mode active learning for multimedia pattern recognition. Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012. 2012. pp. 489-490
@inproceedings{92c8b56341c04bdb9315b8da062e21af,
title = "Batch mode active learning for multimedia pattern recognition",
abstract = "Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our knowledge, this is the first attempt to develop such techniques in the active learning literature.",
author = "Shayok Chakraborty and Vineeth Balasubramanian and Sethuraman Panchanathan",
year = "2012",
doi = "10.1109/ISM.2012.101",
language = "English (US)",
isbn = "9780769548753",
pages = "489--490",
booktitle = "Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012",

}

TY - GEN

T1 - Batch mode active learning for multimedia pattern recognition

AU - Chakraborty, Shayok

AU - Balasubramanian, Vineeth

AU - Panchanathan, Sethuraman

PY - 2012

Y1 - 2012

N2 - Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our knowledge, this is the first attempt to develop such techniques in the active learning literature.

AB - Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our knowledge, this is the first attempt to develop such techniques in the active learning literature.

UR - http://www.scopus.com/inward/record.url?scp=84874272706&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84874272706&partnerID=8YFLogxK

U2 - 10.1109/ISM.2012.101

DO - 10.1109/ISM.2012.101

M3 - Conference contribution

AN - SCOPUS:84874272706

SN - 9780769548753

SP - 489

EP - 490

BT - Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012

ER -