Abstract

A story is defined as actors taking actions that culminate in resolutions. In this paper, we extract subject - verb - object relationships from paragraphs and generalize them into semantic conceptual representations. Overlapping generalized concepts and relationships correspond to archetypes/targets and actions that characterize story forms. We present an analytic framework which implements co-clustering based on generalized conceptual relationships to automatically detect such story forms. Co-clustering can help in identifying similarities that exist in low-dimensional sub-spaces of sparse data such as textual paragraphs. Through co-clustering, we detect not only the clusters themselves but also their characteristic features which can be useful in describing and summarizing their contents. We perform co-clustering of stories using two different types of features: standard unigrams/bigrams and generalized concepts. We show that the residual error of factorization with concept-based features is significantly lower than the error with standard keyword-based features. Qualitative evaluations also suggest that concept-based features yield more coherent, distinctive and interesting story forms compared to those produced by using standard keyword-based features.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE International Conferences on Big Data and Cloud Computing, BDCloud 2016, Social Computing and Networking, SocialCom 2016 and Sustainable Computing and Communications, SustainCom 2016
EditorsZhipeng Cai, Guangchun Luo, Liang Cheng, Rafal Angryk, Yingshu Li, Anu Bourgeois, Wenzhan Song, Xiaojun Cao, Bhaskar Krishnamachari
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages258-265
Number of pages8
ISBN (Electronic)9781509039364
DOIs
StatePublished - Oct 26 2016
Event6th IEEE International Conference on Big Data and Cloud Computing, BDCloud 2016, 9th IEEE International Conference on Social Computing and Networking, SocialCom 2016 and 2016 IEEE International Conference on Sustainable Computing and Communications, SustainCom 2016 - Atlanta, United States
Duration: Oct 8 2016Oct 10 2016

Publication series

NameProceedings - 2016 IEEE International Conferences on Big Data and Cloud Computing, BDCloud 2016, Social Computing and Networking, SocialCom 2016 and Sustainable Computing and Communications, SustainCom 2016

Other

Other6th IEEE International Conference on Big Data and Cloud Computing, BDCloud 2016, 9th IEEE International Conference on Social Computing and Networking, SocialCom 2016 and 2016 IEEE International Conference on Sustainable Computing and Communications, SustainCom 2016
Country/TerritoryUnited States
CityAtlanta
Period10/8/1610/10/16

Keywords

  • Co-clustering
  • Narrative analysis
  • Non-negative matrix factorization
  • Story forms

ASJC Scopus subject areas

  • Information Systems and Management
  • Computer Networks and Communications
  • Information Systems
  • Sociology and Political Science
  • Communication

Fingerprint

Dive into the research topics of 'Story forms detection in text through concept-based co-clustering'. Together they form a unique fingerprint.

Cite this