Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction

Bogdan Nicula; Cecile A. Perret; Mihai Dascalu; Danielle S. McNamara

doi:10.1007/978-3-030-52240-7_42

Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction

Bogdan Nicula, Cecile A. Perret, Mihai Dascalu, Danielle S. McNamara

Psychology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Theories of discourse argue that comprehension depends on the coherence of the learner’s mental representation. Our aim is to create a reliable automated representation to estimate readers’ level of comprehension based on different productions, namely self-explanations and answers to open-ended questions. Previous work relied on Cohesion Network Analysis to model a cohesion graph composed of semantic links between multiple reference texts and student productions. From this graph, a set of features was derived and used to build machine learning models to predict student comprehension scores. In this paper, we build on top of the previous study by: a) extending the CNA graph by adding new semantic links targeting specific sentences that should have been captured within the learner’s productions, and b) cleaning the self-explanations by eliminating frozen expression, as well as entries which seemed nearly identical to the source text. The results are in line with the conclusions of the previous study regarding the importance of both self-explanations and question answers in predicting the students’ reading comprehension level. They also outline the limitations of our feature generation approach, in which no substantial improvements were detected, despite adding more fine-grained features.

Original language	English (US)
Title of host publication	Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings
Editors	Ig Ibert Bittencourt, Mutlu Cukurova, Rose Luckin, Kasia Muldner, Eva Millán
Publisher	Springer
Pages	228-233
Number of pages	6
ISBN (Print)	9783030522391
DOIs	https://doi.org/10.1007/978-3-030-52240-7_42
State	Published - 2020
Event	21st International Conference on Artificial Intelligence in Education, AIED 2020 - Ifrane, Morocco Duration: Jul 6 2020 → Jul 10 2020

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12164 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	21st International Conference on Artificial Intelligence in Education, AIED 2020
Country/Territory	Morocco
City	Ifrane
Period	7/6/20 → 7/10/20

Keywords

Cohesion Network Analysis
Multi-document comprehension modeling
Natural Language Processing

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-52240-7_42

Cite this

Nicula, B., Perret, C. A., Dascalu, M., & McNamara, D. S. (2020). Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction. In I. I. Bittencourt, M. Cukurova, R. Luckin, K. Muldner, & E. Millán (Eds.), Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings (pp. 228-233). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12164 LNAI). Springer. https://doi.org/10.1007/978-3-030-52240-7_42

Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction. / Nicula, Bogdan; Perret, Cecile A.; Dascalu, Mihai et al.
Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings. ed. / Ig Ibert Bittencourt; Mutlu Cukurova; Rose Luckin; Kasia Muldner; Eva Millán. Springer, 2020. p. 228-233 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12164 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Nicula, B, Perret, CA, Dascalu, M & McNamara, DS 2020, Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction. in II Bittencourt, M Cukurova, R Luckin, K Muldner & E Millán (eds), Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12164 LNAI, Springer, pp. 228-233, 21st International Conference on Artificial Intelligence in Education, AIED 2020, Ifrane, Morocco, 7/6/20. https://doi.org/10.1007/978-3-030-52240-7_42

Nicula B, Perret CA, Dascalu M, McNamara DS. Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction. In Bittencourt II, Cukurova M, Luckin R, Muldner K, Millán E, editors, Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings. Springer. 2020. p. 228-233. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-52240-7_42

Nicula, Bogdan ; Perret, Cecile A. ; Dascalu, Mihai et al. / Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction. Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings. editor / Ig Ibert Bittencourt ; Mutlu Cukurova ; Rose Luckin ; Kasia Muldner ; Eva Millán. Springer, 2020. pp. 228-233 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{f24c066d53814c4b822f8bbfe6bff08e,

title = "Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction",

abstract = "Theories of discourse argue that comprehension depends on the coherence of the learner{\textquoteright}s mental representation. Our aim is to create a reliable automated representation to estimate readers{\textquoteright} level of comprehension based on different productions, namely self-explanations and answers to open-ended questions. Previous work relied on Cohesion Network Analysis to model a cohesion graph composed of semantic links between multiple reference texts and student productions. From this graph, a set of features was derived and used to build machine learning models to predict student comprehension scores. In this paper, we build on top of the previous study by: a) extending the CNA graph by adding new semantic links targeting specific sentences that should have been captured within the learner{\textquoteright}s productions, and b) cleaning the self-explanations by eliminating frozen expression, as well as entries which seemed nearly identical to the source text. The results are in line with the conclusions of the previous study regarding the importance of both self-explanations and question answers in predicting the students{\textquoteright} reading comprehension level. They also outline the limitations of our feature generation approach, in which no substantial improvements were detected, despite adding more fine-grained features.",

keywords = "Cohesion Network Analysis, Multi-document comprehension modeling, Natural Language Processing",

author = "Bogdan Nicula and Perret, {Cecile A.} and Mihai Dascalu and McNamara, {Danielle S.}",

note = "Funding Information: This research was partially supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNCS – UEFISCDI, project number PN-III-P1-1.2-PCCDI-2017-0689/“Lib2Life-Revitalizing Libraries and Cultural Heritage through Advanced Technologies” within PNCDI III, the Institute of Education Sciences (R305A180144, R305A180261 and R305A190063), and the Office of Naval Research (N00014-17-1-2300). The opinions expressed are those of the authors and do not represent views of the IES or ONR. Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 21st International Conference on Artificial Intelligence in Education, AIED 2020 ; Conference date: 06-07-2020 Through 10-07-2020",

year = "2020",

doi = "10.1007/978-3-030-52240-7_42",

language = "English (US)",

isbn = "9783030522391",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "228--233",

editor = "Bittencourt, {Ig Ibert} and Mutlu Cukurova and Rose Luckin and Kasia Muldner and Eva Mill{\'a}n",

booktitle = "Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings",

}

TY - GEN

T1 - Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction

AU - Nicula, Bogdan

AU - Perret, Cecile A.

AU - Dascalu, Mihai

AU - McNamara, Danielle S.

N1 - Funding Information: This research was partially supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNCS – UEFISCDI, project number PN-III-P1-1.2-PCCDI-2017-0689/“Lib2Life-Revitalizing Libraries and Cultural Heritage through Advanced Technologies” within PNCDI III, the Institute of Education Sciences (R305A180144, R305A180261 and R305A190063), and the Office of Naval Research (N00014-17-1-2300). The opinions expressed are those of the authors and do not represent views of the IES or ONR. Publisher Copyright: © 2020, Springer Nature Switzerland AG.

PY - 2020

Y1 - 2020

N2 - Theories of discourse argue that comprehension depends on the coherence of the learner’s mental representation. Our aim is to create a reliable automated representation to estimate readers’ level of comprehension based on different productions, namely self-explanations and answers to open-ended questions. Previous work relied on Cohesion Network Analysis to model a cohesion graph composed of semantic links between multiple reference texts and student productions. From this graph, a set of features was derived and used to build machine learning models to predict student comprehension scores. In this paper, we build on top of the previous study by: a) extending the CNA graph by adding new semantic links targeting specific sentences that should have been captured within the learner’s productions, and b) cleaning the self-explanations by eliminating frozen expression, as well as entries which seemed nearly identical to the source text. The results are in line with the conclusions of the previous study regarding the importance of both self-explanations and question answers in predicting the students’ reading comprehension level. They also outline the limitations of our feature generation approach, in which no substantial improvements were detected, despite adding more fine-grained features.

AB - Theories of discourse argue that comprehension depends on the coherence of the learner’s mental representation. Our aim is to create a reliable automated representation to estimate readers’ level of comprehension based on different productions, namely self-explanations and answers to open-ended questions. Previous work relied on Cohesion Network Analysis to model a cohesion graph composed of semantic links between multiple reference texts and student productions. From this graph, a set of features was derived and used to build machine learning models to predict student comprehension scores. In this paper, we build on top of the previous study by: a) extending the CNA graph by adding new semantic links targeting specific sentences that should have been captured within the learner’s productions, and b) cleaning the self-explanations by eliminating frozen expression, as well as entries which seemed nearly identical to the source text. The results are in line with the conclusions of the previous study regarding the importance of both self-explanations and question answers in predicting the students’ reading comprehension level. They also outline the limitations of our feature generation approach, in which no substantial improvements were detected, despite adding more fine-grained features.

KW - Cohesion Network Analysis

KW - Multi-document comprehension modeling

KW - Natural Language Processing

UR - http://www.scopus.com/inward/record.url?scp=85088569410&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85088569410&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-52240-7_42

DO - 10.1007/978-3-030-52240-7_42

M3 - Conference contribution

AN - SCOPUS:85088569410

SN - 9783030522391

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 228

EP - 233

BT - Artificial Intelligence in Education - 21st International Conference, AIED 2020, Proceedings

A2 - Bittencourt, Ig Ibert

A2 - Cukurova, Mutlu

A2 - Luckin, Rose

A2 - Muldner, Kasia

A2 - Millán, Eva

PB - Springer

T2 - 21st International Conference on Artificial Intelligence in Education, AIED 2020

Y2 - 6 July 2020 through 10 July 2020

ER -

Extended Multi-document Cohesion Network Analysis Centered on Comprehension Prediction

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this