To the top

Page Manager: Webmaster
Last update: 9/11/2012 3:13 PM

Tell a friend about this page
Print version

A Multi-domain Corpus of … - University of Gothenburg, Sweden Till startsida
To content Read more about how we use cookies on

A Multi-domain Corpus of Swedish Word Sense Annotation

Conference paper
Authors Richard Johansson
Yvonne Adesam
Gerlof Bouma
Karin Hedberg
Published in 10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, Portorož (Slovenia)
ISBN 978-2-9517408-9-1
Publisher European Language Resources Association
Publication year 2016
Published at Department of Swedish
Department of Computer Science and Engineering (GU)
Language en
Keywords ordbetydelsedisambiguering, word sense disambiguation, lexical semantics, corpora, annotation
Subject categories Linguistics, Computational linguistics, Language Technology (Computational Linguistics)


We describe the word sense annotation layer in Eukalyptus, a freely available five-domain corpus of contemporary Swedish with several annotation layers. The annotation uses the SALDO lexicon to define the sense inventory, and allows word sense annotation of compound segments and multiword units. We give an overview of the new annotation tool developed for this project, and finally present an analysis of the inter-annotator agreement between two annotators.

Page Manager: Webmaster|Last update: 9/11/2012

The University of Gothenburg uses cookies to provide you with the best possible user experience. By continuing on this website, you approve of our use of cookies.  What are cookies?