Mostra el registre d'ítem simple

dc.contributor.authorKumar, Arun
dc.contributor.authorPadró, Lluís
dc.contributor.authorOliver González, Antoni
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Ciències de la Computació
dc.date.accessioned2016-02-24T10:55:34Z
dc.date.available2016-02-24T10:55:34Z
dc.date.issued2016
dc.identifier.citationKumar, A., Padro, L., Oliver, A. Joint Bayesian Morphology learning of Dravidian Languages. A: Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects. "RICTA 2015: Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects: Hissan, Bulgaria: September 10, 2015: proceedings book". Hissar: 2016.
dc.identifier.isbn978-954-452-031-1
dc.identifier.urihttp://hdl.handle.net/2117/83374
dc.description.abstractIn this paper a methodology for learning the complex agglutinative morphology of some Indian languages using Adaptor Grammars and morphology rules is presented. Adaptor grammars are a compositional Bayesian framework for grammatical inference, where we define a morphological grammar for agglutinative languages and morphological boundaries are inferred from a plain text corpus. Once morphological segmentations are produce, regular expressions for sandhi rules and orthography are applied to achieve the final segmentation. We test our algorithm in the case of two complex languages from the Dravidian family. The same morphological model and results are evaluated comparing to other state-of-the art unsupervised morphology learning systems
dc.language.isoeng
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Llenguatges de programació
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshNatural language processing (Computer science)
dc.titleJoint Bayesian Morphology learning of Dravidian Languages
dc.typeConference report
dc.subject.lemacTractament del llenguatge natural (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.rights.accessOpen Access
local.identifier.drac17502975
dc.description.versionPostprint (published version)
local.citation.authorKumar, A.; Padro, L.; Oliver, A.
local.citation.contributorJoint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects
local.citation.pubplaceHissar
local.citation.publicationNameRICTA 2015: Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects: Hissan, Bulgaria: September 10, 2015: proceedings book


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple