Word ambiguity and search: Implications for enterprise performance management

David Schuff, Karen Corral, Robert St Louis, Gregory Schymik

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The proliferation of unstructured data is a growing threat to effective enterprise performance management. Enterprise search is a tool to help organizations more effectively manage this documentbased information. The success of full-text enterprise search is limited by ambiguity in word meanings, which can result in many documents returned which are not relevant to the searcher. While early work by Zipf provided a first attempt at quantifying the impact of this issue on search, little work has been done to demonstrate the applicability of Zipf's work to contemporary document collections. In this paper we examine whether the frequency-meaning relationship discovered by Zipf holds for contemporary document collections, and whether it consistently holds across different subject domains. We then discuss the implications of our results for the development and use of user-centered KPIs designed to measure the enterprise wide effectiveness of search activities.

Original languageEnglish (US)
Title of host publicationAMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems
PublisherAssociation for Information Systems
StatePublished - 2016
Event22nd Americas Conference on Information Systems: Surfing the IT Innovation Wave, AMCIS 2016 - San Diego, United States
Duration: Aug 11 2016Aug 14 2016

Other

Other22nd Americas Conference on Information Systems: Surfing the IT Innovation Wave, AMCIS 2016
CountryUnited States
CitySan Diego
Period8/11/168/14/16

Fingerprint

Industry

Keywords

  • Document management
  • Enterprise search
  • KPI
  • Semantic ambiguity
  • Zipf's frequency-meaning relationship

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Networks and Communications
  • Information Systems

Cite this

Schuff, D., Corral, K., St Louis, R., & Schymik, G. (2016). Word ambiguity and search: Implications for enterprise performance management. In AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems Association for Information Systems.

Word ambiguity and search : Implications for enterprise performance management. / Schuff, David; Corral, Karen; St Louis, Robert; Schymik, Gregory.

AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems. Association for Information Systems, 2016.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Schuff, D, Corral, K, St Louis, R & Schymik, G 2016, Word ambiguity and search: Implications for enterprise performance management. in AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems. Association for Information Systems, 22nd Americas Conference on Information Systems: Surfing the IT Innovation Wave, AMCIS 2016, San Diego, United States, 8/11/16.
Schuff D, Corral K, St Louis R, Schymik G. Word ambiguity and search: Implications for enterprise performance management. In AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems. Association for Information Systems. 2016
Schuff, David ; Corral, Karen ; St Louis, Robert ; Schymik, Gregory. / Word ambiguity and search : Implications for enterprise performance management. AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems. Association for Information Systems, 2016.
@inproceedings{42c509931c9041729dfcd097c190e17a,
title = "Word ambiguity and search: Implications for enterprise performance management",
abstract = "The proliferation of unstructured data is a growing threat to effective enterprise performance management. Enterprise search is a tool to help organizations more effectively manage this documentbased information. The success of full-text enterprise search is limited by ambiguity in word meanings, which can result in many documents returned which are not relevant to the searcher. While early work by Zipf provided a first attempt at quantifying the impact of this issue on search, little work has been done to demonstrate the applicability of Zipf's work to contemporary document collections. In this paper we examine whether the frequency-meaning relationship discovered by Zipf holds for contemporary document collections, and whether it consistently holds across different subject domains. We then discuss the implications of our results for the development and use of user-centered KPIs designed to measure the enterprise wide effectiveness of search activities.",
keywords = "Document management, Enterprise search, KPI, Semantic ambiguity, Zipf's frequency-meaning relationship",
author = "David Schuff and Karen Corral and {St Louis}, Robert and Gregory Schymik",
year = "2016",
language = "English (US)",
booktitle = "AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems",
publisher = "Association for Information Systems",

}

TY - GEN

T1 - Word ambiguity and search

T2 - Implications for enterprise performance management

AU - Schuff, David

AU - Corral, Karen

AU - St Louis, Robert

AU - Schymik, Gregory

PY - 2016

Y1 - 2016

N2 - The proliferation of unstructured data is a growing threat to effective enterprise performance management. Enterprise search is a tool to help organizations more effectively manage this documentbased information. The success of full-text enterprise search is limited by ambiguity in word meanings, which can result in many documents returned which are not relevant to the searcher. While early work by Zipf provided a first attempt at quantifying the impact of this issue on search, little work has been done to demonstrate the applicability of Zipf's work to contemporary document collections. In this paper we examine whether the frequency-meaning relationship discovered by Zipf holds for contemporary document collections, and whether it consistently holds across different subject domains. We then discuss the implications of our results for the development and use of user-centered KPIs designed to measure the enterprise wide effectiveness of search activities.

AB - The proliferation of unstructured data is a growing threat to effective enterprise performance management. Enterprise search is a tool to help organizations more effectively manage this documentbased information. The success of full-text enterprise search is limited by ambiguity in word meanings, which can result in many documents returned which are not relevant to the searcher. While early work by Zipf provided a first attempt at quantifying the impact of this issue on search, little work has been done to demonstrate the applicability of Zipf's work to contemporary document collections. In this paper we examine whether the frequency-meaning relationship discovered by Zipf holds for contemporary document collections, and whether it consistently holds across different subject domains. We then discuss the implications of our results for the development and use of user-centered KPIs designed to measure the enterprise wide effectiveness of search activities.

KW - Document management

KW - Enterprise search

KW - KPI

KW - Semantic ambiguity

KW - Zipf's frequency-meaning relationship

UR - http://www.scopus.com/inward/record.url?scp=84987680696&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84987680696&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84987680696

BT - AMCIS 2016: Surfing the IT Innovation Wave - 22nd Americas Conference on Information Systems

PB - Association for Information Systems

ER -