DSpace DSpace UPC
 Català   Castellano   English  

E-prints UPC >
Enginyeria electrònica i telecomunicacions >
VEU - Grup de Tractament de la Parla >


VEU - Grup de Tractament de la Parla

Ponències/Comunicacions de congressos : [74]

Cerca a aquesta col·lecció:


 




Llista per 
Subscriviu-vos per rebre un correu electrònic cada vegada que s'introdueixi un nou ítem en aquesta col·lecció.
Vista preliminarDataTítolAutor(s)
Modelling the Effects of Spontaneous Speech in Speech Recognition_Schulz et al.pdf.jpg2013Modelling the effects of spontaneous speech in speech recognitionShulz, Henrik; Rodríguez Fonollosa, José Adrián
2013Joint recognition and direction-of-arrival estimation of simultaneous meeting-room acoustic eventsChakraborty, Rupayan; Nadeu Camprubí, Climent
2013Solar EUV flux rate estimation during mid and strong flares from the ionospheric electron content response signature in GNSS observationsHernández Pajares, Manuel; García Rigo, Alberto; Juan Zornoza, José Miguel; Sanz Subirana, Jaume; Monte Moreno, Enrique; Aragón Ángel, María Ángeles
2013A statistical approach to reverberation in non-diffusive rectangular rooms based on the image source modelNogueiras Rodríguez, Albino; Colom Olivares, Jordi
2013Real-time multi-microphone recognition of simultaneous sounds in a room environmentChakraborty, Rupayan; Nadeu Camprubí, Climent
2013The TALP-UPC approach to system selection: ASIYA features and pairwise classification using random forestsFormiga Fanals, Lluís; González Bermúdez, Meritxell; Barrón Cedeño, Luis Alberto; Rodríguez Fonollosa, José Adrián; Màrquez Villodre, Lluís
2013Channel selection using N-best hypothesis for multi-microphone ASRWolf, Martin; Nadeu Camprubí, Climent
The BUCEADOR multi-language search engine for digital libraries.pdf.jpg2012BUCEADOR, a multi-language search engine for digital librariesAdell Mercado, Jordi; Bonafonte Cávez, Antonio; Cardenal, Antonio; Ruiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción; Navas, Eva; Rodríguez Banga, Eduardo
Building Synthetic Voices in the METANET Framework.pdf.jpg2012Building synthetic voices in the METANET frameworkGarcia Casademont, Emília; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción
2012On the use of agglomerative and spectral clustering in speaker diarization of meetingsHernando Pericás, Francisco Javier
2012Building synthetic voices in the META-NET frameworkGarcia Casademont, Emília; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción
2012Accelerating boosting-based face detection on GPUsOro, David; Fernández, Carles; Segura, Carlos; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier
Paper INTERSPEECH 2012.pdf.jpg2012GCC-PHAT based head orientation estimationSegura, Carlos; Hernando Pericás, Francisco Javier
2012Measuring acoustic reduction in feature spaceRodríguez Fonollosa, José Adrián; Schulz, Henrik
W12-3133.pdf.jpg2012The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptationFormiga Fanals, Lluís; Henríquez Quintana, Carlos Alberto; Hernández Huerta, Adolfo; Mariño Acebal, José Bernardo; Monte Moreno, Enrique; Rodríguez Fonollosa, José Adrián
IberSPEECH2012.pdf.jpg2012Detection and handling of overlapping speech for speaker diarizationZelenak, Martin; Hernando Pericás, Francisco Javier
Paper ICB 2012.pdf.jpg2012A novel method for low-constrained iris boundary localizationFernández, Carles; Pérez, Dídac; Segura, Carlos; Hernando Pericás, Francisco Javier
Paper JRBP 12.pdf.jpg2012New approaches for iris boundary localizationPérez, Dídac; Fernández, Carles; Segura, Carlos; Hernando Pericás, Francisco Javier
Improving English to Spanish out-of-domain translations by morphology generalization and generation.pdf.jpg2012Improving English to Spanish out-of-domain translations by morphology generalization and generationFormiga Fanals, Lluís; Hernández Huerta, Adolfo; Mariño Acebal, José Bernardo; Monte Moreno, Enrique
POSTERS032.pdf.jpg2012Dealing with input noise in statistical machine translationFormiga Fanals, Lluís; Rodríguez Fonollosa, José Adrián
2011Audio segmentation of broadcast news : a hierarchical system with feature selection for the Albayzin-2010 evaluationButko, Taras; Nadeu Camprubí, Climent
2011An HMM-Based Approach to the INTERSPEECH 2011 Speaker State ChallengeNogueiras Rodríguez, Albino
BUCEADOR hybrid TTS for Blizzard Challenge 2011.pdf.jpg2011BUCEADOR hybrid TTS for blizzard challenge 2011Sainz, Iñaki; Erro, Daniel; Navas, Eva; Adell Mercado, Jordi; Bonafonte Cávez, Antonio
2011Work in progress - Cooperative and competitive projects for engaging students in advanced ICT subjectsPardàs Feliu, Montse; Bonafonte Cávez, Antonio
LREC2010_Maxim.pdf.jpg2011Towards improving English-Latvian translation: a system comparison and a new rescoring featureKhalilov, Maxim; Rodríguez Fonollosa, José Adrián; Skadina, Inguna; Braliti, Edgars; Pretkalnina, Lauma
LREC2010_Marta_Linear.pdf.jpg2011Using linear interpolation and weighted reordering hypotheses in the moses systemRuiz Costa-Jussà, Marta; Rodríguez Fonollosa, José Adrián
LREC2010_Marta_Automatic.pdf.jpg2011Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systemsRuiz Costa-Jussà, Marta; Farrús Cabeceran, Mireia; Mariño Acebal, José Bernardo; Rodríguez Fonollosa, José Adrián
94.pdf.jpg2011The L2F - UPC Speaker Recognition System for NIST SRE 2010Abad, Alberto; Luque, Jordi; Trancoso, Isabel; Hernando Pericás, Francisco Javier
2011On building and evaluating a broadcast-news audio segmentation systemButko, Taras; Nadeu Camprubí, Climent
2011Real-time GPU-based face detection in HD video sequencesOro, David; Fernández, Carles; Rodriguez Saeta, Javier; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier
2011The detection of overlapping speech with prosodic features for speaker diarizationZelenak, Martin; Hernando Pericás, Francisco Javier
101.pdf.jpg2011Two-source acoustic event detection and localization: online implementation in a smart-roomButko, Taras; Gonzalez Pla, Fran; Segura Perales, Carlos; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier
2011A multilingual corpus for rich audio-visual scene description in a meeting-room environmentButko, Taras; Nadeu Camprubí, Climent; Moreno Bilbao, M. Asunción
2011Extension of the remos concept to frequency-filtering-based features for reverberation-robust speech recognitionMaas, Roland; Wolf, Martin; Sehr, Armin; Nadeu Camprubí, Climent; Kellermann, Walter
Farrús2010.pdf.jpgmai-2010Linguistic-based evaluation criteria to identify statistical machine translation errorsFarrús Cabeceran, Mireia; Ruiz Costa-Jussà, Marta; Mariño Acebal, José Bernardo; Rodríguez Fonollosa, José Adrián
95.pdf.jpg2010On the improvement of speaker diarization by detecting overlapped speechHernando Pericás, Francisco Javier; Hernando Pericás, Francisco Javier
a166_p.pdf.jpg2010UPC VTEC Forecast Model Based On IGS GIMSGarcía Rigo, Alberto; Monte Moreno, Enrique; Hernández Pajares, Manuel; Juan Zornoza, José Miguel; Sanz Subirana, Jaume; Orús Pérez, Raül
TTSevaluation.pdf.jpg2010TTS evaluation campaign with a common spanish databaseSainz, Iñaki; Navas, Eva; Hernáez, Inma; Bonafonte Cávez, Antonio; Campillo, Francisco
2010Overlap detection for speaker diarization by fusing spectral and spatial featuresZelenak, Martin; Segura Perales, Carlos; Hernando Pericás, Francisco Javier
Bragos_EDUCON_2010_GILABVIR Virtual.pdf.jpg2010GILABVIR: Virtual laboratories and remote laboratories in engineering. A teaching innovation group of interestCabrera Beán, Margarita Asuncion; Bragós Bardia, Ramon; Pérez, Marimar; Mariño Acebal, José Bernardo; Rius Casals, Juan Manuel; Gomis Bellmunt, Oriol; Casany Guerrero, María José; Gironella Cobos, Xavier
96.pdf.jpg2010Albayzin 2010 Evaluation campaign: speaker diarizationZelenak, Martin; Schulz, Henrik; Hernando Pericás, Francisco Javier
sltu10-CR-3.pdf.jpg2010English-Latvian SMT: the challenge of translating into a free word order languageKhalilov, Maxim; Rodríguez Fonollosa, José Adrián; Skadina, Inguna; Bralitis, Edgar; Pretkalnina, Lauma
wolf_nadeu_camready.pdf.jpg2010On the potential of channel selection for recognition of reverberated speech with multiple microphonesWolf, Martin; Nadeu Camprubí, Climent
Interspeech2010_Taras_cameraready.pdf.jpg2010A fast one-pass-training feature selection technique for GMM-based acoustic event detection with audio-visual dataButko, Taras; Nadeu Camprubí, Climent
FALA2010-3.pdf.jpg2010Detection of overlapped acoustic events using fusion of audio and video modalitiesButko, Taras; Nadeu Camprubí, Climent
FALA2010  p 429-432. pdf.pdf.jpg2010A hierarchical architecture with feature selection for audio segmentation in a broadcast news domainButko, Taras; Nadeu Camprubí, Climent
FALA2010 p 305-308.pdf.jpg2010Albayzin-2010 audio segmentation evaluation: evaluation setup and resultsButko, Taras; Nadeu Camprubí, Climent; Schulz, Henrik
2010Synthesis of filled pauses based on a disfluent speech modelAdell Roig, Jordi; Bonafonte Cávez, Antonio; Escudero Mancebo, David
Defining analogy for non-native inclusions in Spanish TTS.pdf.jpg2010Defining analogy for non-native inclusions in Spanish utterancesPolyakova, Tatyana; Bonafonte Cávez, Antonio
Nativization of English words in Spanish using analogy.pdf.jpg2010Nativization of English words in Spanish using analogyPolyakova, Tatyana; Bonafonte Cávez, Antonio
set-2009A Catalan broadcast conversational speech databaseSchulz, Henrik; Rodríguez Fonollosa, José Adrián
lncsschulz2009.pdf.jpgset-2009A baseline system for the transcription of catalan broadcast conversationSchulz, Henrik; Rodríguez Fonollosa, José Adrián; Rybach, David
poch2009.pdf.jpgset-2009The TALP on-line Spanish-Catalan machine-translation systemPoch, M; Farrús Cabeceran, Mireia; Ruiz Costa-Jussà, Marta; Mariño Acebal, José Bernardo; Hernández, Adolfo; Henríquez Quintana, Carlos Alberto; Rodríguez Fonollosa, José Adrián
Coupling.pdf.jpgjun-2009Coupling hierarchical word reordering and decoding in phrase-based statistical machine translationDras, Mark; Khalilov, Maxim; Rodríguez Fonollosa, José Adrián
khalilov09b.pdf.jpgmai-2009A new subtree-transfer approach to syntax-based reordering for statistical machine translationKhalilov, Maxim; Rodríguez Fonollosa, José Adrián; Dras, Mark
E09-1049.pdf.jpg30-mar-2009N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combinationKhalilov, Maxim; Rodríguez Fonollosa, José Adrián
WMT-0914.pdf.jpg30-mar-2009The TALP-UPC phrase-based translation system for EACL-WMT 2009Rodríguez Fonollosa, José Adrián; Khalilov, Maxim; Ruiz Costa-Jussà, Marta; Henríquez Quintana, Carlos Alberto; Hernández, Adolfo; Banchs Martínez, Rafael Enrique
2009Audiovisual event detection towards scene understandingCanton Ferrer, Cristian; Butko, Taras; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon
bonafonte_SigIL09.pdf.jpg2009Recent work on the FESTCAT database for speech synthesisBonafonte Cávez, Antonio; Esquerra Llucià, Ignasi; Aguilar, Lourdes; Oller Martínez, Sergio Horacio; Moreno Bilbao, M. Asunción
IWSLT-2008-Khalilov.pdf.jpg31-oct-2008The TALP & I2R SMT Systems for IWSLT 2008Li, H.; Aw, A.; Zhang, M.; Khalilov, Maxim; Ruiz Costa-Jussà, Marta; Henríquez Quintana, Carlos Alberto; Rodríguez Fonollosa, José Adrián; Hernández, A.; Mariño Acebal, José Bernardo; Banchs Martínez, Rafael Enrique; Chen, B.
87.pdf.jpg2008Speaker orientation estimation based on hybridation of GCC-PHAT and HLBRSegura Perales, Carlos; Abad, Alberto; Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent
86.pdf.jpg2008Bi-Gaussian score equalization in an audio-visual SVM-based person verification systemEjarque, Pascual; Hernando Pericás, Francisco Javier
2006Multidialectal acoustic modeling: a comparative studyCaballero, Mónica; Moreno Bilbao, M. Asunción; Nogueiras Rodríguez, Albino
2006Joint training of codebooks and acoustic models in automatic speech recognition using semi-continuous HMMsNogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Mariño Acebal, José Bernardo
2006First experiments on an HMM based double layer framework for automatic continuous speech recognitionNogueiras Rodríguez, Albino; Casar López, Marta; Rodríguez Fonollosa, José Adrián; Caballero Galeote, Mónica
2002Multi-dialectal Spanish speech recognitionNogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Moreno Bilbao, M. Asunción
2001Speech emotion recognition using hidden Markov modelsNogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción
1998Task independent minimum confusability training for continuous speech recognitionNogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo
1998An adaptive gradient-search based algorithm for discriminative training of hmm'sNogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo; Monte Moreno, Enrique
1998Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databasesNogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo
NaniBD a set of tools for transcribing and validating speech databases.pdf.jpg1998NaniBD: a set of tools for transcribing and validating speech databasesNogueiras Rodríguez, Albino; Moreno Bilbao, M. Asunción
1996Frequency and time filtering of filter-bank energies for HMM speech recognitionNadeu Camprubí, Climent; Mariño Acebal, José Bernardo; Hernando Pericás, Francisco Javier; Nogueiras Rodríguez, Albino
1996Explicit segmentation of speech using gaussian modelsBonafonte Cávez, Antonio; Nogueiras Rodríguez, Albino; Rodriguez-Garrido, A
1996SETHOS: the UPC speech understanding systemBonafonte Cávez, Antonio; Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino

 

Valid XHTML 1.0! Programari DSpace Copyright © 2002-2004 MIT and Hewlett-Packard Comentaris
Universitat Politècnica de Catalunya. Servei de Biblioteques, Publicacions i Arxius