PHAST: Spoken document retrieval based on sequence alignment

Comas Umbert, Pere Ramon; Turmo Borras, Jorge

dc.contributor.author	Comas Umbert, Pere Ramon
dc.contributor.author	Turmo Borras, Jorge
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics
dc.date.accessioned	2016-05-02T08:59:13Z
dc.date.available	2016-05-02T08:59:13Z
dc.date.issued	2008-01
dc.identifier.citation	Comas, P.R., Turmo, J. "PHAST: Spoken document retrieval based on sequence alignment". 2008.
dc.identifier.uri	http://hdl.handle.net/2117/86459
dc.description.abstract	This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. Classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques, based on terms or n-grams. However, state-of-the-art large vocabulary continuous ASRs produce transcripts of spontaneous speech with a word error rate of 25% or higher, which is a drawback for retrieval techniques based on terms or n-grams. In order to overcome such a limitation, our method is based on a sequence alignment algorithm drawn from the field of bioinformatics to search
dc.format.extent	10 p.
dc.language.iso	eng
dc.relation.ispartofseries	LSI-08-2-R
dc.subject	Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
dc.subject.other	Information retrieval
dc.subject.other	Spoken document retrieval
dc.subject.other	Approximate matching
dc.title	PHAST: Spoken document retrieval based on sequence alignment
dc.type	External research report
dc.contributor.group	Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.rights.access	Open Access
local.identifier.drac	1872237
dc.description.version	Postprint (published version)
local.citation.author	Comas, P.R.; Turmo, J.

Fitxers d'aquest items

Nom:: R08-2.pdf
Mida:: 282,6Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Reports de recerca [1.107]
Reports de recerca [88]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

PHAST: Spoken document retrieval based on sequence alignment

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora