Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases

Mariño Acebal, José Bernardo; Padrell, J; Moreno Bilbao, M. Asunción; Nadeu Camprubí, Climent

dc.contributor.author	Mariño Acebal, José Bernardo
dc.contributor.author	Padrell, J
dc.contributor.author	Moreno Bilbao, M. Asunción
dc.contributor.author	Nadeu Camprubí, Climent
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2017-04-27T17:30:55Z
dc.date.available	2017-04-27T17:30:55Z
dc.date.issued	2000
dc.identifier.citation	Mariño, J.B., Padrell, J., Moreno, A., Nadeu, C. Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases. A: Workshop on Speech Recognition Based on Very Large Telephone Speech Databases. "XLDB- Very Large Telephone Speech Databases: Proceedings". Atenas: C. Draxler, 2000, p. 57-61.
dc.identifier.uri	http://hdl.handle.net/2117/103811
dc.description.abstract	Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried out using both the Spanish and the Catalan speech material. A speech recognition system has been trained for the Spanish language using a selection of the phonetically balanced utterances from the 4500 SpeechDat training sessions. Utterances with mispronounced or incomplete words and with intermittent noise were discarded. A set of 26 allophones was selected to account for the Spanish sounds and clustered demiphones have been used as context dependent sub-lexical units. Following the same methodology, a recognition system was trained from the Catalan SpeechDat database. Catalan sounds were described with 32 allophones. Additionally, a bilingual recognition system was built for both the Spanish and Catalan languages. By means of clustering techniques, the suitable set of allophones to cover simultaneously both languages was determined. Thus, 33 allophones were selected. The training material was built by the whole Catalan training material and the Spanish material coming from the Eastern region of Spain (the region where Catalan is spoken). The performance of the Spanish, Catalan and bilingual systems were assessed under the same framework. The Spanish system exhibits a significantly better performance than the rest of systems due to its better training. The bilingual system provides an equivalent performance to that afforded by both language specific systems trained with the Eastern Spanish material or the Catalan SpeechDat corpus.
dc.format.extent	5 p.
dc.language.iso	eng
dc.publisher	C. Draxler
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Telecommunication
dc.title	Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases
dc.type	Conference report
dc.subject.lemac	Telecomunicació
dc.contributor.group	Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.description.peerreviewed	Peer Reviewed
dc.rights.access	Open Access
local.identifier.drac	2415029
dc.description.version	Postprint (published version)
local.citation.author	Mariño, J.B.; Padrell, J.; Moreno, A.; Nadeu, C.
local.citation.contributor	Workshop on Speech Recognition Based on Very Large Telephone Speech Databases
local.citation.pubplace	Atenas
local.citation.publicationName	XLDB- Very Large Telephone Speech Databases: Proceedings
local.citation.startingPage	57
local.citation.endingPage	61

Fitxers d'aquest items

Nom:: 10.1.1.77.3578.pdf
Mida:: 29,75Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [437]
Ponències/Comunicacions de congressos [3.327]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora