Computational assessment of lexical differences in L1 and L2 writing

Scott A. Crossley, Danielle S. McNamara

Research output: Contribution to journalArticlepeer-review

133 Scopus citations

Abstract

The purpose of this paper is to provide a detailed analysis of how lexical differences related to cohesion and connectionist models can distinguish first language (L1) writers of English from second language (L2) writers of English. Key to this analysis is the use of the computational tool Coh-Metrix, which measures cohesion and text difficulty at various levels of language, discourse, and conceptual analysis, and a statistical method known as discriminant function analysis. Results show that L1 and L2 written texts vary in several dimensions related to the writer's use of lexical choices. These dimensions correlate to lexical depth of knowledge, variation, and sophistication. These findings, together with the relevance of the new computational tools for the text analysis used in the study, are discussed.

Original languageEnglish (US)
Pages (from-to)119-135
Number of pages17
JournalJournal of Second Language Writing
Volume18
Issue number2
DOIs
StatePublished - Jun 2009
Externally publishedYes

Keywords

  • Cohesion
  • Computational linguistics
  • Corpus linguistics
  • Lexical networks
  • Lexical proficiency
  • Second language writing

ASJC Scopus subject areas

  • Language and Linguistics
  • Education
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Computational assessment of lexical differences in L1 and L2 writing'. Together they form a unique fingerprint.

Cite this