Maximizing ANLP evaluation: Harmonizing flawed input

Adam M. Renner; Philip M. McCarthy; Chutima Boonthum-Denecke; Danielle McNamara

doi:10.4018/978-1-60960-741-8.ch026

Maximizing ANLP evaluation: Harmonizing flawed input

Adam M. Renner, Philip M. McCarthy, Chutima Boonthum-Denecke, Danielle McNamara

Educational Leadership and Innovation, Division of

Research output: Chapter in Book/Report/Conference proceeding › Chapter

1 Scopus citations

Abstract

A continuing problem for ANLP (compared with NLP) is that language tends to be more natural in ANLP than that examined in more controlled natural language processing (NLP) studies. Specifically, ineffective or misleading feedback can result from faulty assessment of misspelled words. This chapter describes the Harmonizer system for addressing the problem of user input irregularities (e.g., typos). The Harmonizer is specifically designed for Intelligence Tutoring Systems (ITSs) that use NLP to provide assessment and feedback based on the typed input of the user. Our approach is to "harmonize" similar words to the same form in the benchmark, rather than correcting them to dictionary entries. This chapter describes the Harmonizer, and evaluates its performance using various computational approaches on unedited input from high school students in the context of an ITS (i.e., iSTART). Our results indicate that various metric approaches to NLP (such as word-overlap cohesion scores) are moderately affected when student errors are filtered by the Harmonizer. Given the prevalence of typing errors in the sample, the study substantiates the need to "clean" typed input in comparable NLP-based learning systems. The Harmonizer provides such ability and is easy to implement with light processing requirements.

Original language	English (US)
Title of host publication	Applied Natural Language Processing
Subtitle of host publication	Identification, Investigation and Resolution
Publisher	IGI Global
Pages	436-454
Number of pages	19
ISBN (Print)	9781609607418
DOIs	https://doi.org/10.4018/978-1-60960-741-8.ch026
State	Published - 2011

ASJC Scopus subject areas

General Computer Science

Access to Document

10.4018/978-1-60960-741-8.ch026

Cite this

@inbook{072ffd098fad4c3b998a3e22e1d35868,

title = "Maximizing ANLP evaluation: Harmonizing flawed input",

abstract = "A continuing problem for ANLP (compared with NLP) is that language tends to be more natural in ANLP than that examined in more controlled natural language processing (NLP) studies. Specifically, ineffective or misleading feedback can result from faulty assessment of misspelled words. This chapter describes the Harmonizer system for addressing the problem of user input irregularities (e.g., typos). The Harmonizer is specifically designed for Intelligence Tutoring Systems (ITSs) that use NLP to provide assessment and feedback based on the typed input of the user. Our approach is to {"}harmonize{"} similar words to the same form in the benchmark, rather than correcting them to dictionary entries. This chapter describes the Harmonizer, and evaluates its performance using various computational approaches on unedited input from high school students in the context of an ITS (i.e., iSTART). Our results indicate that various metric approaches to NLP (such as word-overlap cohesion scores) are moderately affected when student errors are filtered by the Harmonizer. Given the prevalence of typing errors in the sample, the study substantiates the need to {"}clean{"} typed input in comparable NLP-based learning systems. The Harmonizer provides such ability and is easy to implement with light processing requirements.",

author = "Renner, {Adam M.} and McCarthy, {Philip M.} and Chutima Boonthum-Denecke and Danielle McNamara",

year = "2011",

doi = "10.4018/978-1-60960-741-8.ch026",

language = "English (US)",

isbn = "9781609607418",

pages = "436--454",

booktitle = "Applied Natural Language Processing",

publisher = "IGI Global",

}

TY - CHAP

T1 - Maximizing ANLP evaluation

T2 - Harmonizing flawed input

AU - Renner, Adam M.

AU - McCarthy, Philip M.

AU - Boonthum-Denecke, Chutima

AU - McNamara, Danielle

PY - 2011

Y1 - 2011

N2 - A continuing problem for ANLP (compared with NLP) is that language tends to be more natural in ANLP than that examined in more controlled natural language processing (NLP) studies. Specifically, ineffective or misleading feedback can result from faulty assessment of misspelled words. This chapter describes the Harmonizer system for addressing the problem of user input irregularities (e.g., typos). The Harmonizer is specifically designed for Intelligence Tutoring Systems (ITSs) that use NLP to provide assessment and feedback based on the typed input of the user. Our approach is to "harmonize" similar words to the same form in the benchmark, rather than correcting them to dictionary entries. This chapter describes the Harmonizer, and evaluates its performance using various computational approaches on unedited input from high school students in the context of an ITS (i.e., iSTART). Our results indicate that various metric approaches to NLP (such as word-overlap cohesion scores) are moderately affected when student errors are filtered by the Harmonizer. Given the prevalence of typing errors in the sample, the study substantiates the need to "clean" typed input in comparable NLP-based learning systems. The Harmonizer provides such ability and is easy to implement with light processing requirements.

AB - A continuing problem for ANLP (compared with NLP) is that language tends to be more natural in ANLP than that examined in more controlled natural language processing (NLP) studies. Specifically, ineffective or misleading feedback can result from faulty assessment of misspelled words. This chapter describes the Harmonizer system for addressing the problem of user input irregularities (e.g., typos). The Harmonizer is specifically designed for Intelligence Tutoring Systems (ITSs) that use NLP to provide assessment and feedback based on the typed input of the user. Our approach is to "harmonize" similar words to the same form in the benchmark, rather than correcting them to dictionary entries. This chapter describes the Harmonizer, and evaluates its performance using various computational approaches on unedited input from high school students in the context of an ITS (i.e., iSTART). Our results indicate that various metric approaches to NLP (such as word-overlap cohesion scores) are moderately affected when student errors are filtered by the Harmonizer. Given the prevalence of typing errors in the sample, the study substantiates the need to "clean" typed input in comparable NLP-based learning systems. The Harmonizer provides such ability and is easy to implement with light processing requirements.

UR - http://www.scopus.com/inward/record.url?scp=84899337542&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84899337542&partnerID=8YFLogxK

U2 - 10.4018/978-1-60960-741-8.ch026

DO - 10.4018/978-1-60960-741-8.ch026

M3 - Chapter

AN - SCOPUS:84899337542

SN - 9781609607418

SP - 436

EP - 454

BT - Applied Natural Language Processing

PB - IGI Global

ER -

Maximizing ANLP evaluation: Harmonizing flawed input

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this