A part-of-speech (POS) lexicon of Classical Tibetan for NLP

Main author: Hill, Nathan W.
Other authors: Garrett, Edward
Format: Datasets           
Online access: Click here to view record


id eprints-24153
recordtype eprints
institution SOAS, University of London
collection SOAS Research Online
language English
language_search English
topic PI Oriental languages and literatures
PL Languages and literatures of Eastern Asia, Africa, Oceania
description This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging
format Datasets
author Hill, Nathan W.
author_facet Hill, Nathan W.
Garrett, Edward
authorStr Hill, Nathan W.
author_letter Hill, Nathan W.
author2 Garrett, Edward
author2Str Garrett, Edward
title A part-of-speech (POS) lexicon of Classical Tibetan for NLP
publisher Zenodo
publishDate 2017
url https://eprints.soas.ac.uk/24153/