To the top

Page Manager: Webmaster
Last update: 9/11/2012 3:13 PM

Tell a friend about this page
Print version

SWORD: Towards Cutting-Ed… - University of Gothenburg, Sweden Till startsida
To content Read more about how we use cookies on

SWORD: Towards Cutting-Edge Swedish Word Processing

Conference contribution
Authors Fabienne Cap
Yvonne Adesam
Lars Ahrenberg
Lars Borin
Gerlof Bouma
Markus Forsberg
Viggo Kann
Robert Östling
Aaron Smith
Mats Wirén
Joakim Nivre
Published in Proceedings of the Sixth Swedish Language Technology Conference (SLTC) Umeå University, 17-18 November, 2016
Publication year 2016
Published at Department of Swedish
Language en
Subject categories Swedish language, General Language Studies and Linguistics, Language Technology (Computational Linguistics)


Despite many years of research on Swedish language technology, there is still no well-documented standard for Swedish word processing covering the whole spectrum from low-level tokenization to morphological analysis and disambiguation. SWORD is a new initiative within the SWE-CLARIN consortium aiming to develop documented standards for Swedish word processing. In this paper, we report on a pilot study of Swedish tokenization, where we compare the output of six different tokenizers on four different text types. For one text type (Wikipedia articles), we also compare to the tokenization produced by six manual annotators.

Page Manager: Webmaster|Last update: 9/11/2012

The University of Gothenburg uses cookies to provide you with the best possible user experience. By continuing on this website, you approve of our use of cookies.  What are cookies?