Mostra el registre d'ítem simple
Correcting input noise in SMT as a char-based translation problem
dc.contributor.author | Formiga Fanals, Lluís |
dc.contributor.author | Rodríguez Fonollosa, José Adrián |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2013-03-13T15:57:38Z |
dc.date.available | 2013-03-13T15:57:38Z |
dc.date.created | 2012-10-31 |
dc.date.issued | 2012-10-31 |
dc.identifier.citation | Formiga, L.; Fonollosa, José A. R. "Correcting input noise in SMT as a char-based translation problem". 2012. |
dc.identifier.uri | http://hdl.handle.net/2117/18275 |
dc.description.abstract | Misspelled words have a direct impact on the final quality obtained by Statistical Machine Translation (SMT) systems as the input becomes noisy and unpredictable. This paper presents some improvement strategies for translating real-life noisy input. The proposed strategies are based on a preprocessing step consisting in a character-based translator. |
dc.format.extent | 13 p. |
dc.language.iso | eng |
dc.relation.ispartofseries | TALP-2012-OCT-31 |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Ensenyament i aprenentatge::Aprenentatge de llengües |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Natural language processing (Computer science) |
dc.subject.lcsh | Signal theory (Telecommunication) |
dc.title | Correcting input noise in SMT as a char-based translation problem |
dc.type | External research report |
dc.subject.lemac | Tractament del llenguatge natural (Informàtica) |
dc.subject.lemac | Senyal, Teoria del (Telecomunicació) |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://nlp.lsi.upc.edu/publications/papers/misspelling_techrep_oct2012.pdf |
dc.rights.access | Open Access |
local.identifier.drac | 11024794 |
dc.description.version | Preprint |
local.citation.author | Formiga, L.; Fonollosa, José A. R. |
local.citation.publicationName | Correcting input noise in SMT as a char-based translation problem |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [42]
-
Reports de recerca [198]