Text Recognition for Nepalese Manuscripts in Pracalit Script

Main author: O’Neill, Alexander James
Other authors: Hill, Nathan W.
Format: Journal Article           
Online access: Click here to view record


id eprints-39996
recordtype eprints
institution SOAS, University of London
collection SOAS Research Online
language English
language_search English
description This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines. It explains our methodology for developing the requisite ground truth consisting of manuscript images and corresponding transcriptions, training our model with a PyLAia engine, and this model’s limitations. This dataset shared on Zenodo can be used by anyone working with manuscripts in Pracalit script, which will benefit the fields of Indology and Newar studies, as well as historical and linguistic analysis.
format Journal Article
author O’Neill, Alexander James
author_facet O’Neill, Alexander James
Hill, Nathan W.
authorStr O’Neill, Alexander James
author_letter O’Neill, Alexander James
author2 Hill, Nathan W.
author2Str Hill, Nathan W.
title Text Recognition for Nepalese Manuscripts in Pracalit Script
publisher Ubiquity Press
publishDate 2022
url https://eprints.soas.ac.uk/39996/