Please readerbench this text: A multi-dimensional textual complexity assessment framework

Mihai Dascalu, Scott A. Crossley, Danielle S. McNamara, Philippe Dessus, Stefan Trausan-Matu

Research output: Chapter in Book/Report/Conference proceedingChapter

17 Scopus citations

Abstract

A critical task for tutors is to provide learners with suitable reading materials in terms of difficulty. The challenge of this endeavor is increased by students' individual variability and the multiple levels in which complexity can vary, thus arguing for the necessity of automated systems to support teachers. This chapter describes ReaderBench, an open-source multi-dimensional and multi-lingual system that uses advanced Natural Language Processing techniques to assess textual complexity at multiple levels including surface-based, syntax, semantics and discourse structure. In contrast to other existing approaches, ReaderBench is centered on cohesion and makes extensive usage of two complementary models, i.e., Cohesion Network Analysis and the polyphonic model inspired from dialogism. The first model provides an in-depth view of discourse in terms of cohesive links, whereas the second one highlights interactions between points of view spanning throughout the discourse. In order to argue for its wide applicability and extensibility, two studies are introduced. The first study investigates the degree to which ReaderBench textual complexity indices differentiate between high and low cohesion texts. The ReaderBench indices led to a higher classification accuracy than those included in prior studies using Coh-Metrix and TAACO. In the second study, ReaderBench indices are used to predict the difficulty of a set of various texts. Although the high number of predictive indices (50 plus) accounted for less variance than previous studies, they make valuable contributions to our understanding of text due to their wide coverage.

Original languageEnglish (US)
Title of host publicationTutoring and Intelligent Tutoring Systems
PublisherNova Science Publishers, Inc.
Pages251-271
Number of pages21
ISBN (Electronic)9781536140866
ISBN (Print)9781536140859
StatePublished - Aug 28 2018

Keywords

  • Automated essay scoring
  • Comprehension modeling
  • Data analytics
  • Learning analytics
  • Natural Language Processing

ASJC Scopus subject areas

  • General Social Sciences

Fingerprint

Dive into the research topics of 'Please readerbench this text: A multi-dimensional textual complexity assessment framework'. Together they form a unique fingerprint.

Cite this