Sidansvarig: Webbredaktion
Sidan uppdaterades: 2012-09-11 15:12
Författare |
Lars Borin Luis Nieto Piña Richard Johansson |
---|---|
Publicerad i | Linköping Electronic Conference Proceedings. Semantic resources and semantic annotation for Natural Language Processing and the Digital Humanities. Workshop at NODALIDA , May 11, 13-18 2015, Vilnius |
Volym | 112 |
Sidor | 1-11 |
ISBN | 978-91-7519-049-5 |
ISSN | 1650-3686 |
Publiceringsår | 2015 |
Publicerad vid |
Institutionen för svenska språket |
Sidor | 1-11 |
Språk | en |
Länkar |
www.ep.liu.se/ecp/112/002/ecp151120... |
Ämnesord | language technology, thesaurus, word sense disambiguation, inter-resource mapping, corpus-based word semantics, lexicon based word semantics, SALDO, Roget |
Ämneskategorier | Språkteknologi (språkvetenskaplig databehandling), Datorlingvistik, Lingvistik |
Lexical-semantic knowledges sources are a stock item in the language technologist’s toolbox, having proved their practical worth in many and diverse natural language processing (NLP) applications. In linguistics, lexical semantics comes in many flavors, but in the NLP world, wordnets reign more or less supreme. There has been some promising work utilizing Roget-style thesauruses instead, but wider experimentation is hampered by the limited availability of such resources. The work presented here is a first step in the direction of creating a freely available Roget-style lexical resource for modern Swedish. Here, we explore methods for automatic disambiguation of interresource mappings with the longer-term goal of utilizing similar techniques for automatic enrichment of lexical-semantic resources.