Mostra el registre d'ítem simple

dc.contributorMorros Rubió, Josep Ramon
dc.contributorSayrol Clols, Elisa
dc.contributor.authorFernández-Pedraza Jorde, Carolina
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2019-06-06T07:38:55Z
dc.date.available2019-06-06T07:38:55Z
dc.date.issued2019-05
dc.identifier.urihttp://hdl.handle.net/2117/134028
dc.description.abstractIn the recent years, the demand for video tools to automatically annotate and classify large audiovisual datasets has increased considerably. One specific task in this field applies to TV broadcast videos, to determine who and when a person appears in a video sequence. This work starts from the base of the ALBAYZIN evaluation series presented in the IberSPEECH-RTVE 2018 in Barcelona, and the purpose of this thesis is trying to improve the results obtained and compare the different face detection and tracking methods. We will evaluate the performance of classic face detection techniques and other techniques based on machine learning on a closed dataset of 34 known people. The rest of characters on the audiovisual document will be labelled as "unknown". We will work with small videos and images of each known character to build his/her model and finally, evaluate the performance of the ALBAYZIN algorithm over a 2h video called "La noche en 24H" whose format is like a news program. We will analyze the results and the type of errors and scenarios we encountered as well as the solutions we propose for each of them if there is any. In this work, We will only focus on a monomodal basis of face recognition and tracking.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsS'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada'
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcshMachine learning
dc.subject.lcshHuman face recognition (Computer science)
dc.subject.otherdeep learning
dc.subject.othermachine learning
dc.subject.otherface detection
dc.subject.otherface classification
dc.subject.otherface tracking
dc.subject.otherAlbayzin
dc.titlePerson annotation in video sequences
dc.typeMaster thesis
dc.subject.lemacAprenentatge automàtic
dc.subject.lemacReconeixement facial (Informàtica)
dc.identifier.slugETSETB-230.138886
dc.rights.accessOpen Access
dc.date.updated2019-06-03T05:52:28Z
dc.audience.educationlevelMàster
dc.audience.mediatorEscola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN ENGINYERIA DE TELECOMUNICACIÓ (Pla 2013)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple