|
E-prints UPC >
Altres >
Enviament des de DRAC >
Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/2117/11269
|
| Citació: | Khalilov, M. [et al.]. English-Latvian SMT: the challenge of translating into a free word order language. A: International Workshop on Spoken Languages Technologies for Under-resourced Languages. "The second International Workshop on Spoken Languages Technologies for Under-Resourced Languages". Penang: 2010, p. 87-94. |
| Títol: | English-Latvian SMT: the challenge of translating into a free word order language |
| Autor: | Khalilov, Maxim ; Rodríguez Fonollosa, José Adrián ; Skadina, Inguna; Bralitis, Edgar; Pretkalnina, Lauma |
| Data: | 2010 |
| Tipus de document: | Conference lecture |
| Resum: | This paper presents a comparative study of two approaches to
statistical machine translation (SMT) and their application to
a task of English-to-Latvian translation, which is still an open
research line in the field of automatic translation.
We consider a state-of-the-art phrase-based SMT and an
alternative N-gram-based SMT systems. The major differences
between these two approaches lie in the distinct representations
of bilingual units, which are the components of the
bilingual model driving translation process and in the statistical
modeling of the translation context.
Latvian being a rather free word order language implies
additional difficulties to the translation process. We contrast
different reordering models and investigate how well they
deal with the word ordering issue.
Moving beyond automatic scores of translation quality
that are classically presented in MT research papers, we contribute
presenting a manual error analysis of MT systems output
that helps to shed light on advantages and disadvantages
of the SMT systems under consideration and identify the most
prominent source of errors typical for both SMT systems. |
| URI: | http://hdl.handle.net/2117/11269 |
| Versió de l'editor: | http://staff.science.uva.nl/~khalilov/Docs/sltu10-CR-3.pdf |
| Apareix a les col·leccions: | Altres. Enviament des de DRAC Departament de Teoria del Senyal i Comunicacions. Ponències/Comunicacions de congressos VEU - Grup de Tractament de la Parla. Ponències/Comunicacions de congressos
|
| Comparteix: |
|
Queda prohibida la reproducció, transformació, distribució i comunicació pública d'aquesta obra. Es permet, en tot cas, la reproducció per a ús privat sempre i quan la còpia que se'n faci no sigui objecte d'utilització col·lectiva ni lucrativa (art. 31.2 del Reial Decret Legislatiu 1/1996, de 12 d'abril, pel qual s'aprova el Text Refós de la Llei de Propietat Intel·lectual, http://bibliotecnica.upc.es/sepi/legislacio.asp).
Per a qualsevol ús que es vulgui fer diferent al permès, dirigiu-vos a: sepi@upc.edu
|