Deep learning at Dublin city university

Woodward Riquelme, Alejandro Benjamín

dc.contributor	Mcguinness, Kevin
dc.contributor	Giró Nieto, Xavier
dc.contributor.author	Woodward Riquelme, Alejandro Benjamín
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2018-11-15T14:25:31Z
dc.date.issued	2018-05
dc.identifier.uri	http://hdl.handle.net/2117/124388
dc.description.abstract	Speech recognition involves generating sequences of words that match what is being said in recordings of speech. In recent years, machine learning techniques are increasingly being used in speech recognition mainly due to the widespread availability of training data and the decrease in cost related to large scale computation resources. These two factors made feasible the use of a powerful machine learning technique - deep learning - to create end-to-end speech recognition systems. This, compared to classical methods used in this field, does not require an extensive knowledge of phonetics. When listening to any kind of speech, humans use prior knowledge about the topic (politics, medicine, sports, etc.) of the speech for better understanding. In contrast, speech recognition systems do not usually use this prior knowledge. The use of contextual information to improve an automatic speech recognition system is explored in this thesis. The output of this thesis will be used by the company Vilynx to transcribe speech from videos that, among others, contain general, sport, and entertainment news. Contextual information is extracted from the video title and video description.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Machine learning
dc.subject.lcsh	Artificial intelligence
dc.subject.other	speech recognition
dc.subject.other	deep learning
dc.subject.other	contextual information
dc.title	Deep learning at Dublin city university
dc.title.alternative	Contextual information for speech recognition
dc.type	Master thesis
dc.subject.lemac	Aprenentatge automàtic
dc.subject.lemac	Intel·ligència artificial
dc.identifier.slug	ETSETB-230.135197
dc.rights.access	Restricted access - confidentiality agreement
dc.date.lift	10000-01-01
dc.date.updated	2018-07-26T05:53:02Z
dc.audience.educationlevel	Màster
dc.audience.mediator	Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.audience.degree	MÀSTER UNIVERSITARI EN ENGINYERIA DE TELECOMUNICACIÓ (Pla 2013)
dc.contributor.covenantee	Dublin City University

Fitxers d'aquest items

Nom:: masters-thesis-alejandro.pdf
Mida:: 1,958Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Master's degree in Telecommunications Engineering (MET) [393]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Deep learning at Dublin city university

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora