Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm

View/Open
Document typeConference report
Defense date2015
Rights accessOpen Access
Abstract
In this paper we describe a method to morphologically segment highly agglutinating and inflectional languages from Dravidian family. We use nested Pitman-Yor process to segment long agglutinated words into their basic components, and use a corpus based morpheme induction algorithm to perform morpheme segmentation. We test our method in two languages, Malayalam and Kannada and compare the results with Morfessor
CitationKumar, A., Padro, L., Oliver, A. Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm. A: International Conference on Asian Language Processing. "IALP 2015: 19th International Conference on Asian Language Processing: Suzhow, China: October 24-25, 2015: proceedings book". Suzhou: 2015.
Files | Description | Size | Format | View |
---|---|---|---|---|
kumar15c.pdf | 136,7Kb | View/Open |
Except where otherwise noted, content on this work
is licensed under a Creative Commons license
:
Attribution-NonCommercial-NoDerivs 3.0 Spain