ReaderBench: A multi-lingual framework for analyzing text complexity

Mihai Dascalu, Gabriel Gutu, Stefan Ruseti, Ionut Cristian Paraschiv, Philippe Dessus, Danielle McNamara, Scott A. Crossley, Stefan Trausan-Matu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Assessing textual complexity is a difficult, but important endeavor, especially for adapting learning materials to students’ and readers’ levels of understanding. With the continuous growth of information technologies spanning through various research fields, automated assessment tools have become reliable solutions to automatically assessing textual complexity. ReaderBench is a text processing framework relying on advanced Natural Language Processing techniques that encompass a wide range of text analysis modules available in a variety of languages, including English, French, Romanian, and Dutch. To our knowledge, ReaderBench is the only open-source multilingual textual analysis solution that provides unified access to more than 200 textual complexity indices including: surface, syntactic, morphological, semantic, and discourse specific factors, alongside cohesion metrics derived from specific lexicalized ontologies and semantic models.

Original languageEnglish (US)
Title of host publicationData Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings
PublisherSpringer Verlag
Pages495-499
Number of pages5
Volume10474 LNCS
ISBN (Print)9783319666099
DOIs
StatePublished - 2017
Event12th European Conference on Technology Enhanced Learning, EC-TEL 2017 - Tallinn, Estonia
Duration: Sep 12 2017Sep 15 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10474 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other12th European Conference on Technology Enhanced Learning, EC-TEL 2017
CountryEstonia
CityTallinn
Period9/12/179/15/17

Fingerprint

Semantics
Text processing
Syntactics
Information technology
Text Processing
Text Analysis
Ontology
Cohesion
Students
Information Technology
Open Source
Natural Language
Processing
Metric
Module
Range of data
Framework
Text
Model
Language

Keywords

  • Comprehension prediction
  • Multi-lingual text analysis
  • Natural Language Processing
  • Textual cohesion
  • Textual complexity
  • Writing style

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Dascalu, M., Gutu, G., Ruseti, S., Paraschiv, I. C., Dessus, P., McNamara, D., ... Trausan-Matu, S. (2017). ReaderBench: A multi-lingual framework for analyzing text complexity. In Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings (Vol. 10474 LNCS, pp. 495-499). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10474 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-66610-5_48

ReaderBench : A multi-lingual framework for analyzing text complexity. / Dascalu, Mihai; Gutu, Gabriel; Ruseti, Stefan; Paraschiv, Ionut Cristian; Dessus, Philippe; McNamara, Danielle; Crossley, Scott A.; Trausan-Matu, Stefan.

Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings. Vol. 10474 LNCS Springer Verlag, 2017. p. 495-499 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10474 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Dascalu, M, Gutu, G, Ruseti, S, Paraschiv, IC, Dessus, P, McNamara, D, Crossley, SA & Trausan-Matu, S 2017, ReaderBench: A multi-lingual framework for analyzing text complexity. in Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings. vol. 10474 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10474 LNCS, Springer Verlag, pp. 495-499, 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Tallinn, Estonia, 9/12/17. https://doi.org/10.1007/978-3-319-66610-5_48
Dascalu M, Gutu G, Ruseti S, Paraschiv IC, Dessus P, McNamara D et al. ReaderBench: A multi-lingual framework for analyzing text complexity. In Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings. Vol. 10474 LNCS. Springer Verlag. 2017. p. 495-499. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-66610-5_48
Dascalu, Mihai ; Gutu, Gabriel ; Ruseti, Stefan ; Paraschiv, Ionut Cristian ; Dessus, Philippe ; McNamara, Danielle ; Crossley, Scott A. ; Trausan-Matu, Stefan. / ReaderBench : A multi-lingual framework for analyzing text complexity. Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings. Vol. 10474 LNCS Springer Verlag, 2017. pp. 495-499 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{08584939b78a4227bbe26c9e6da8b317,
title = "ReaderBench: A multi-lingual framework for analyzing text complexity",
abstract = "Assessing textual complexity is a difficult, but important endeavor, especially for adapting learning materials to students’ and readers’ levels of understanding. With the continuous growth of information technologies spanning through various research fields, automated assessment tools have become reliable solutions to automatically assessing textual complexity. ReaderBench is a text processing framework relying on advanced Natural Language Processing techniques that encompass a wide range of text analysis modules available in a variety of languages, including English, French, Romanian, and Dutch. To our knowledge, ReaderBench is the only open-source multilingual textual analysis solution that provides unified access to more than 200 textual complexity indices including: surface, syntactic, morphological, semantic, and discourse specific factors, alongside cohesion metrics derived from specific lexicalized ontologies and semantic models.",
keywords = "Comprehension prediction, Multi-lingual text analysis, Natural Language Processing, Textual cohesion, Textual complexity, Writing style",
author = "Mihai Dascalu and Gabriel Gutu and Stefan Ruseti and Paraschiv, {Ionut Cristian} and Philippe Dessus and Danielle McNamara and Crossley, {Scott A.} and Stefan Trausan-Matu",
year = "2017",
doi = "10.1007/978-3-319-66610-5_48",
language = "English (US)",
isbn = "9783319666099",
volume = "10474 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "495--499",
booktitle = "Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings",
address = "Germany",

}

TY - GEN

T1 - ReaderBench

T2 - A multi-lingual framework for analyzing text complexity

AU - Dascalu, Mihai

AU - Gutu, Gabriel

AU - Ruseti, Stefan

AU - Paraschiv, Ionut Cristian

AU - Dessus, Philippe

AU - McNamara, Danielle

AU - Crossley, Scott A.

AU - Trausan-Matu, Stefan

PY - 2017

Y1 - 2017

N2 - Assessing textual complexity is a difficult, but important endeavor, especially for adapting learning materials to students’ and readers’ levels of understanding. With the continuous growth of information technologies spanning through various research fields, automated assessment tools have become reliable solutions to automatically assessing textual complexity. ReaderBench is a text processing framework relying on advanced Natural Language Processing techniques that encompass a wide range of text analysis modules available in a variety of languages, including English, French, Romanian, and Dutch. To our knowledge, ReaderBench is the only open-source multilingual textual analysis solution that provides unified access to more than 200 textual complexity indices including: surface, syntactic, morphological, semantic, and discourse specific factors, alongside cohesion metrics derived from specific lexicalized ontologies and semantic models.

AB - Assessing textual complexity is a difficult, but important endeavor, especially for adapting learning materials to students’ and readers’ levels of understanding. With the continuous growth of information technologies spanning through various research fields, automated assessment tools have become reliable solutions to automatically assessing textual complexity. ReaderBench is a text processing framework relying on advanced Natural Language Processing techniques that encompass a wide range of text analysis modules available in a variety of languages, including English, French, Romanian, and Dutch. To our knowledge, ReaderBench is the only open-source multilingual textual analysis solution that provides unified access to more than 200 textual complexity indices including: surface, syntactic, morphological, semantic, and discourse specific factors, alongside cohesion metrics derived from specific lexicalized ontologies and semantic models.

KW - Comprehension prediction

KW - Multi-lingual text analysis

KW - Natural Language Processing

KW - Textual cohesion

KW - Textual complexity

KW - Writing style

UR - http://www.scopus.com/inward/record.url?scp=85029603796&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85029603796&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-66610-5_48

DO - 10.1007/978-3-319-66610-5_48

M3 - Conference contribution

AN - SCOPUS:85029603796

SN - 9783319666099

VL - 10474 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 495

EP - 499

BT - Data Driven Approaches in Digital Education - 12th European Conference on Technology Enhanced Learning, EC-TEL 2017, Proceedings

PB - Springer Verlag

ER -