Maximizing ANLP evaluation: Harmonizing flawed input

Adam M. Renner, Philip M. McCarthy, Chutima Boonthum-Denecke, Danielle McNamara

Research output: Chapter in Book/Report/Conference proceedingChapter

1 Scopus citations

Abstract

A continuing problem for ANLP (compared with NLP) is that language tends to be more natural in ANLP than that examined in more controlled natural language processing (NLP) studies. Specifically, ineffective or misleading feedback can result from faulty assessment of misspelled words. This chapter describes the Harmonizer system for addressing the problem of user input irregularities (e.g., typos). The Harmonizer is specifically designed for Intelligence Tutoring Systems (ITSs) that use NLP to provide assessment and feedback based on the typed input of the user. Our approach is to "harmonize" similar words to the same form in the benchmark, rather than correcting them to dictionary entries. This chapter describes the Harmonizer, and evaluates its performance using various computational approaches on unedited input from high school students in the context of an ITS (i.e., iSTART). Our results indicate that various metric approaches to NLP (such as word-overlap cohesion scores) are moderately affected when student errors are filtered by the Harmonizer. Given the prevalence of typing errors in the sample, the study substantiates the need to "clean" typed input in comparable NLP-based learning systems. The Harmonizer provides such ability and is easy to implement with light processing requirements.

Original languageEnglish (US)
Title of host publicationApplied Natural Language Processing
Subtitle of host publicationIdentification, Investigation and Resolution
PublisherIGI Global
Pages436-454
Number of pages19
ISBN (Print)9781609607418
DOIs
StatePublished - 2011

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Maximizing ANLP evaluation: Harmonizing flawed input'. Together they form a unique fingerprint.

Cite this