Mostra el registre d'ítem simple
Towards large scale multimedia indexing: a case study on person discovery in broadcast news
dc.contributor.author | Le, Nam |
dc.contributor.author | Bredin, Herve |
dc.contributor.author | Sergent, Gabriel |
dc.contributor.author | India Massana, Miquel Àngel |
dc.contributor.author | López-Otero, Paula |
dc.contributor.author | Barras, Claude |
dc.contributor.author | Guinaudeau, Camille |
dc.contributor.author | Gravier, Guillaume |
dc.contributor.author | Barbosa da Fonseca, Gabriel |
dc.contributor.author | Lyon Freire, Izabela |
dc.contributor.author | Patrocinio Jr., Zenilton |
dc.contributor.author | Jamil F. Guimarães, Silvio |
dc.contributor.author | Martí Juan, Gerard |
dc.contributor.author | Morros Rubió, Josep Ramon |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.author | Docio-Fernández, Laura |
dc.contributor.author | García-Mateo, Carmen |
dc.contributor.author | Meignier, Sylvain |
dc.contributor.author | Odobez, Jean-Marc |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2017-12-19T11:12:46Z |
dc.date.issued | 2017 |
dc.identifier.citation | Le, N., Bredin, H., Sergent, G., India, M., López-Otero, P., Barras, C., Guinaudeau, C., Gravier, G., Barbosa da Fonseca, G., Lyon Freire, I., Patrocinio Jr., Z., Jamil F. Guimarães, S., Marti, G., Morros, J.R., Hernando, J., Docio-Fernández, L., García-Mateo, C., Meignier, S., Odobez, J. Towards large scale multimedia indexing: a case study on person discovery in broadcast news. A: International Workshop on Content-Based Multimedia Indexing. "Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017, Florence, Italy, June 19-21, 2017". Firenze: Association for Computing Machinery (ACM), 2017, p. 1-6. |
dc.identifier.isbn | 978-1-4503-5333-5 |
dc.identifier.uri | http://hdl.handle.net/2117/112283 |
dc.description.abstract | The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery in the absence of prior identity knowledge requires accurate association of audio-visual cues and detected names. To this end, we present 3 different strategies to approach this problem: clustering-based naming, verification-based naming, and graph-based naming. Each of these strategies utilizes different recent advances in unsupervised face / speech representation, verification, and optimization. To have a better understanding of the approaches, this paper also provides a quantitative and qualitative comparative study of these approaches using the associated corpus of the Person Discovery challenge at MediaEval 2016. From the results of our experiments, we can observe the pros and cons of each approach, thus paving the way for future promising research directions. |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.publisher | Association for Computing Machinery (ACM) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Bases de dades |
dc.subject | Àrees temàtiques de la UPC::So, imatge i multimèdia |
dc.subject.lcsh | Databases |
dc.subject.lcsh | Audio-visual materials |
dc.subject.lcsh | Cluster |
dc.title | Towards large scale multimedia indexing: a case study on person discovery in broadcast news |
dc.type | Conference report |
dc.subject.lemac | Bases de dades |
dc.subject.lemac | Audiovisuals |
dc.subject.lemac | Sistemes productius locals |
dc.contributor.group | Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.identifier.doi | 10.1145/3095713.3095732 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://dl.acm.org/citation.cfm?doid=3095713.3095732 |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 21669987 |
dc.description.version | Postprint (published version) |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/FP7/611057/EU/EUMSSI- Event Understanding through Multimodal Social Stream Interpretation/EUMSSI |
dc.date.lift | 10000-01-01 |
local.citation.author | Le, N.; Bredin, H.; Sergent, G.; India, M.; López-Otero, P.; Barras, C.; Guinaudeau, C.; Gravier, G.; Barbosa da Fonseca, G.; Lyon Freire, I.; Patrocinio Jr., Z.; Jamil F. Guimarães, S.; Marti, G.; Morros, J.R.; Hernando, J.; Docio-Fernández, L.; García-Mateo, C.; Meignier, S.; Odobez, J. |
local.citation.contributor | International Workshop on Content-Based Multimedia Indexing |
local.citation.pubplace | Firenze |
local.citation.publicationName | Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017, Florence, Italy, June 19-21, 2017 |
local.citation.startingPage | 1 |
local.citation.endingPage | 6 |