Automated Scoring of Students’ Small-Group Discussions to Assess Reading Ability

Audra E. Kosh; Jeffrey A. Greene; P. Karen Murphy; Hal Burdick; Carla M. Firetto; Jeff Elmore

doi:10.1111/emip.12174

Automated Scoring of Students’ Small-Group Discussions to Assess Reading Ability

Audra E. Kosh, Jeffrey A. Greene, P. Karen Murphy, Hal Burdick, Carla M. Firetto, Jeff Elmore

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

We explored the feasibility of using automated scoring to assess upper-elementary students’ reading ability through analysis of transcripts of students’ small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one school year, data were collected at 10 time points for a total of 327 student-text encounters, with a different text discussed at each time point. To explore the possibility of automated scoring, we considered which quantitative discourse variables (e.g., variables to measure language sophistication and latent semantic analysis variables) were the strongest predictors of scores on a multiple-choice and constructed-response reading comprehension test. Convergent validity evidence was collected by comparing automatically calculated quantitative discourse features to scores on a reading fluency test. After examining a variety of discourse features using multilevel modeling, results showed that measures of word rareness and word diversity were the most promising variables to use in automated scoring of students’ discussions.

Original language	English (US)
Pages (from-to)	20-34
Number of pages	15
Journal	Educational Measurement: Issues and Practice
Volume	37
Issue number	2
DOIs	https://doi.org/10.1111/emip.12174
State	Published - Jun 1 2018
Externally published	Yes

Keywords

assessment
automated scoring
reading ability

ASJC Scopus subject areas

Education

Access to Document

10.1111/emip.12174

Cite this

@article{63bd6a5636bc4ef585de5deed31dd1ee,

title = "Automated Scoring of Students{\textquoteright} Small-Group Discussions to Assess Reading Ability",

abstract = "We explored the feasibility of using automated scoring to assess upper-elementary students{\textquoteright} reading ability through analysis of transcripts of students{\textquoteright} small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one school year, data were collected at 10 time points for a total of 327 student-text encounters, with a different text discussed at each time point. To explore the possibility of automated scoring, we considered which quantitative discourse variables (e.g., variables to measure language sophistication and latent semantic analysis variables) were the strongest predictors of scores on a multiple-choice and constructed-response reading comprehension test. Convergent validity evidence was collected by comparing automatically calculated quantitative discourse features to scores on a reading fluency test. After examining a variety of discourse features using multilevel modeling, results showed that measures of word rareness and word diversity were the most promising variables to use in automated scoring of students{\textquoteright} discussions.",

keywords = "assessment, automated scoring, reading ability",

author = "Kosh, {Audra E.} and Greene, {Jeffrey A.} and Murphy, {P. Karen} and Hal Burdick and Firetto, {Carla M.} and Jeff Elmore",

note = "Funding Information: This research was supported by the Institute of Educational Sciences, U.S. Department of Education, through Grant R305A130031 to The Pennsylvania State University. Any opinions, findings, and conclusions or recommendations expressed are those of the author(s) and do not represent the views of the Institute or the U.S. Department of Education. Publisher Copyright: {\textcopyright} 2017 by the National Council on Measurement in Education",

year = "2018",

month = jun,

day = "1",

doi = "10.1111/emip.12174",

language = "English (US)",

volume = "37",

pages = "20--34",

journal = "Educational Measurement: Issues and Practice",

issn = "0731-1745",

publisher = "Wiley-Blackwell",

number = "2",

}

TY - JOUR

T1 - Automated Scoring of Students’ Small-Group Discussions to Assess Reading Ability

AU - Kosh, Audra E.

AU - Greene, Jeffrey A.

AU - Murphy, P. Karen

AU - Burdick, Hal

AU - Firetto, Carla M.

AU - Elmore, Jeff

N1 - Funding Information: This research was supported by the Institute of Educational Sciences, U.S. Department of Education, through Grant R305A130031 to The Pennsylvania State University. Any opinions, findings, and conclusions or recommendations expressed are those of the author(s) and do not represent the views of the Institute or the U.S. Department of Education. Publisher Copyright: © 2017 by the National Council on Measurement in Education

PY - 2018/6/1

Y1 - 2018/6/1

N2 - We explored the feasibility of using automated scoring to assess upper-elementary students’ reading ability through analysis of transcripts of students’ small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one school year, data were collected at 10 time points for a total of 327 student-text encounters, with a different text discussed at each time point. To explore the possibility of automated scoring, we considered which quantitative discourse variables (e.g., variables to measure language sophistication and latent semantic analysis variables) were the strongest predictors of scores on a multiple-choice and constructed-response reading comprehension test. Convergent validity evidence was collected by comparing automatically calculated quantitative discourse features to scores on a reading fluency test. After examining a variety of discourse features using multilevel modeling, results showed that measures of word rareness and word diversity were the most promising variables to use in automated scoring of students’ discussions.

AB - We explored the feasibility of using automated scoring to assess upper-elementary students’ reading ability through analysis of transcripts of students’ small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one school year, data were collected at 10 time points for a total of 327 student-text encounters, with a different text discussed at each time point. To explore the possibility of automated scoring, we considered which quantitative discourse variables (e.g., variables to measure language sophistication and latent semantic analysis variables) were the strongest predictors of scores on a multiple-choice and constructed-response reading comprehension test. Convergent validity evidence was collected by comparing automatically calculated quantitative discourse features to scores on a reading fluency test. After examining a variety of discourse features using multilevel modeling, results showed that measures of word rareness and word diversity were the most promising variables to use in automated scoring of students’ discussions.

KW - assessment

KW - automated scoring

KW - reading ability

UR - http://www.scopus.com/inward/record.url?scp=85034035294&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85034035294&partnerID=8YFLogxK

U2 - 10.1111/emip.12174

DO - 10.1111/emip.12174

M3 - Article

AN - SCOPUS:85034035294

SN - 0731-1745

VL - 37

SP - 20

EP - 34

JO - Educational Measurement: Issues and Practice

JF - Educational Measurement: Issues and Practice

IS - 2

ER -

Automated Scoring of Students’ Small-Group Discussions to Assess Reading Ability

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this