To the top

Page Manager: Webmaster
Last update: 9/11/2012 3:13 PM

Tell a friend about this page
Print version

Word Type Grouping in Swe… - University of Gothenburg, Sweden Till startsida
To content Read more about how we use cookies on

Word Type Grouping in Swedish Secondary School Textbooks - An Inventory of Words from a Second Language Perspective

Conference paper
Authors Inger Lindberg
Sofie Johansson Kokkinakis
Published in Multilingualism, Proceedings of the 23rd Scandinavian Conference of Linguistics
Volume Studia Linguistica Upsaliensia 8
Issue Acta Universitatis Upsaliensis
Pages 337-339
ISBN 978-91-554-7594-9
ISSN 1652-1366
Publication year 2009
Published at Department of Swedish
Pages 337-339
Language en
Keywords Language technology, Word type grouping, Lexical analysis, Secondary School Textbooks, Second Language
Subject categories Language Technology (Computational Linguistics), Bilingualism, Swedish language


When investigating questions like “What characterizes the vocabulary of school book texts in general and in different subjects at different levels?” and “Which words present particular problems for students studying in their second language?” school book corpora offer important empirical data. We have therefore compiled and analyzed a corpus of secondary school textbooks of one million words (OrdiL) with texts from eight different school subjects contributed by three different publishers (Lindberg & Johansson Kokkinakis 2007). In order to perform comparative studies and analyses on the vocabulary of these texts we have performed computer-based disambiguation and semantic analyses at a lemma- and lexeme-level. Moreover, we have carried out word frequency analyses, including comparative analyses of relative frequency and dispersion between different subjects. To be able to distinguish characteristic features of school book vocabulary we use an equally sized reference corpus of easy-to-read texts as a point of comparison. The vocabulary of this corpus is representative of the kind of written language that children at this age can be exposed to out of school. To identify and categorize various types of words in textbooks, we propose a model based on earlier research by Coxhead & Nation (2001) and Hyland & Tse (2007) modified to account for the word types of potential difficulty for second language secondary school students. Based on analyses of frequency and range we propose two main groups of words divided into at least two subgroups. To explore the question “Which words present particular problems for students studying in their second language?” we also need empirical data on students’ actual vocabulary difficulties. For this purpose we are constructing a number of computerized tests focusing various types of lexical knowledge in relation to the different word types. These tests will be administered to students with different background characteristics to make comparisons of quantitative as well as qualitative aspects of vocabulary knowledge between different groups of students possible.

Page Manager: Webmaster|Last update: 9/11/2012

The University of Gothenburg uses cookies to provide you with the best possible user experience. By continuing on this website, you approve of our use of cookies.  What are cookies?