Joint Bayesian Morphology learning of Dravidian Languages
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/83374
Tipus de documentText en actes de congrés
Data publicació2016
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
In this paper a methodology for learning the complex agglutinative morphology of some Indian languages using Adaptor Grammars and morphology rules is presented. Adaptor grammars are a compositional Bayesian framework for grammatical inference, where we define a morphological grammar for agglutinative languages and morphological boundaries are inferred from a plain text corpus. Once morphological segmentations are produce,
regular expressions for sandhi rules and orthography are applied to achieve the final segmentation. We test our algorithm in the case of two complex languages from the Dravidian family. The same morphological model and results are evaluated comparing to other state-of-the art unsupervised morphology learning systems
CitacióKumar, A., Padro, L., Oliver, A. Joint Bayesian Morphology learning of Dravidian Languages. A: Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects. "RICTA 2015: Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects: Hissan, Bulgaria: September 10, 2015: proceedings book". Hissar: 2016.
ISBN978-954-452-031-1
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
kumar15a.pdf | 161,2Kb | Visualitza/Obre |