Mostra el registre d'ítem simple
Deep learning at Dublin city university
dc.contributor | Mcguinness, Kevin |
dc.contributor | Giró Nieto, Xavier |
dc.contributor.author | Woodward Riquelme, Alejandro Benjamín |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2018-11-15T14:25:31Z |
dc.date.issued | 2018-05 |
dc.identifier.uri | http://hdl.handle.net/2117/124388 |
dc.description.abstract | Speech recognition involves generating sequences of words that match what is being said in recordings of speech. In recent years, machine learning techniques are increasingly being used in speech recognition mainly due to the widespread availability of training data and the decrease in cost related to large scale computation resources. These two factors made feasible the use of a powerful machine learning technique - deep learning - to create end-to-end speech recognition systems. This, compared to classical methods used in this field, does not require an extensive knowledge of phonetics. When listening to any kind of speech, humans use prior knowledge about the topic (politics, medicine, sports, etc.) of the speech for better understanding. In contrast, speech recognition systems do not usually use this prior knowledge. The use of contextual information to improve an automatic speech recognition system is explored in this thesis. The output of this thesis will be used by the company Vilynx to transcribe speech from videos that, among others, contain general, sport, and entertainment news. Contextual information is extracted from the video title and video description. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject.lcsh | Machine learning |
dc.subject.lcsh | Artificial intelligence |
dc.subject.other | speech recognition |
dc.subject.other | deep learning |
dc.subject.other | contextual information |
dc.title | Deep learning at Dublin city university |
dc.title.alternative | Contextual information for speech recognition |
dc.type | Master thesis |
dc.subject.lemac | Aprenentatge automàtic |
dc.subject.lemac | Intel·ligència artificial |
dc.identifier.slug | ETSETB-230.135197 |
dc.rights.access | Restricted access - confidentiality agreement |
dc.date.lift | 10000-01-01 |
dc.date.updated | 2018-07-26T05:53:02Z |
dc.audience.educationlevel | Màster |
dc.audience.mediator | Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona |
dc.audience.degree | MÀSTER UNIVERSITARI EN ENGINYERIA DE TELECOMUNICACIÓ (Pla 2013) |
dc.contributor.covenantee | Dublin City University |