Till sidans topp

Sidansvarig: Webbredaktion
Sidan uppdaterades: 2012-09-11 15:12

Tipsa en vän
Utskriftsversion

From Distributions to Lab… - Göteborgs universitet Till startsida
Webbkarta
Till innehåll Läs mer om hur kakor används på gu.se

From Distributions to Labels: A Lexical Proficiency Analysis using Learner Corpora

Paper i proceeding
Författare David Alfter
Yuri Bizzoni
Anders Agebjörn
Elena Volodina
Ildikó Pilán
Publicerad i Linköping Electronic Conference Proceedings
ISBN 978-91-7685-633-8
ISSN 1650-3686
Förlag Linköping University Electronic Press
Publiceringsår 2016
Publicerad vid Institutionen för svenska språket
Institutionen för filosofi, lingvistik och vetenskapsteori
Språk en
Länkar www.ep.liu.se/ecp/article.asp?issue...
Ämnesord Lexical complexity, Common European Framework of Reference, Mapping, Semantic space
Ämneskategorier Datorlingvistik, Svenska språket

Sammanfattning

In this work we look at how information from second language learner essay corpora can be used for the evaluation of unseen learner essays. Using a corpus of learner essays which have been graded by well-trained human assessors using the CEFR scale, we extract a list of word distributions over CEFR levels. For the analysis of unseen essays, we want to map each word to a so-called target CEFR level using this word list. However, the task of mapping from a distribution to a single label is not trivial. We are also investigating how we can evaluate the mapping from distribution to label. We show that the distributional profile of words from the essays, informed with the essays’ levels, consistently overlaps with our frequency-based method, in the sense that words holding the same level of proficiency as predicted by our mapping tend to cluster together in a semantic space. In the absence of a gold standard, this information can be useful to see how often a word is associated with the same level in two different models. Also, in this case we have a similarity measure that can show which words are more central to a given level and which words are more peripheral.

Sidansvarig: Webbredaktion|Sidan uppdaterades: 2012-09-11
Dela:

På Göteborgs universitet använder vi kakor (cookies) för att webbplatsen ska fungera på ett bra sätt för dig. Genom att surfa vidare godkänner du att vi använder kakor.  Vad är kakor?