To the top

Page Manager: Webmaster
Last update: 9/11/2012 3:13 PM

Tell a friend about this page
Print version

Many a little makes a mic… - University of Gothenburg, Sweden Till startsida
To content Read more about how we use cookies on

Many a little makes a mickle - infrastructure component reuse for a massively multilingual linguistic study

Conference paper
Authors Lars Borin
Shafqat Virk
Anju Saxena
Published in Selected papers from the CLARIN Annual Conference 2017, Budapest, 18–20 September 2017
ISBN 978-91-7685-273-6
ISSN 1650-3686
Publisher Linköping University Electronic Press
Place of publication Linköping
Publication year 2018
Published at Department of Swedish
Language en
Keywords corpus infrastructure, lexicon infrastructure, Swe-Clarin, large-scale comparative linguistics, linguistic database, language typology, areal linguistics, genetic linguistics, South Asian languages, language technology
Subject categories Other languages, Linguistics, General Language Studies and Linguistics, Computational linguistics, Language Technology (Computational Linguistics)


We present ongoing work aiming at turning the linguistic material available in Grierson’s classical Linguistic Survey of India (LSI) into a digital language resource, a database suitable for a broad array of linguistic investigations of the languages of South Asia and studies relating to language typology and contact linguistics. The project has two concrete main aims: (1) to conduct a linguistic investigation of the claim that South Asia constitutes a linguistic area; (2) to develop state-of-the-art language technology for automatically extracting the relevant information from the text of the LSI. In this presentation we focus on how, in the first part of the project, a number of existing research infrastructure components provided by Swe-Clarin, the Swedish CLARIN consortium, have been ‘recycled’ in order to allow the linguists involved in the project to quickly orient themselves in the vast LSI material, and to be able to provide input to the language technologists designing the tools for information extraction from the descriptive grammars.

Page Manager: Webmaster|Last update: 9/11/2012

The University of Gothenburg uses cookies to provide you with the best possible user experience. By continuing on this website, you approve of our use of cookies.  What are cookies?