Mostra el registre d'ítem simple
PHAST: Spoken document retrieval based on sequence alignment
dc.contributor.author | Comas Umbert, Pere Ramon |
dc.contributor.author | Turmo Borras, Jorge |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics |
dc.date.accessioned | 2016-05-02T08:59:13Z |
dc.date.available | 2016-05-02T08:59:13Z |
dc.date.issued | 2008-01 |
dc.identifier.citation | Comas, P.R., Turmo, J. "PHAST: Spoken document retrieval based on sequence alignment". 2008. |
dc.identifier.uri | http://hdl.handle.net/2117/86459 |
dc.description.abstract | This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. Classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques, based on terms or n-grams. However, state-of-the-art large vocabulary continuous ASRs produce transcripts of spontaneous speech with a word error rate of 25% or higher, which is a drawback for retrieval techniques based on terms or n-grams. In order to overcome such a limitation, our method is based on a sequence alignment algorithm drawn from the field of bioinformatics to search |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.relation.ispartofseries | LSI-08-2-R |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial |
dc.subject.other | Information retrieval |
dc.subject.other | Spoken document retrieval |
dc.subject.other | Approximate matching |
dc.title | PHAST: Spoken document retrieval based on sequence alignment |
dc.type | External research report |
dc.contributor.group | Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural |
dc.rights.access | Open Access |
local.identifier.drac | 1872237 |
dc.description.version | Postprint (published version) |
local.citation.author | Comas, P.R.; Turmo, J. |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [1.107]
-
Reports de recerca [88]