Mostra el registre d'ítem simple

dc.contributor.authorPoignant, Johann
dc.contributor.authorBudnik, Mateusz
dc.contributor.authorBredin, Herve
dc.contributor.authorBarras, Claude
dc.contributor.authorAdda, Gilles
dc.contributor.authorHernando Pericás, Francisco Javier
dc.contributor.authorMariani, Joseph
dc.contributor.authorMorros Rubió, Josep Ramon
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2017-03-03T13:04:10Z
dc.date.available2017-03-03T13:04:10Z
dc.date.issued2016
dc.identifier.citationPoignant, J., Budnik, M., Bredin, H., Barras, C., Adda, G., Hernando, J., Mariani, J., Morros, J. The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents. A: Language Resources and Evaluation Conference. "LREC 2016: Tenth International Conference on Language Resources and Evaluation". Portorož: European Language Resources Association, 2016, p. 1421-1425.
dc.identifier.isbn978-2-9517408-9-1
dc.identifier.urihttp://hdl.handle.net/2117/101916
dc.description.abstractIn this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which can be performed on 3M data, the structure of the server was kept intentionally simple in order to preserve its genericity, relying on standard Web technologies. Layers of annotations, defined as data associated to a media fragment from the corpus, are stored in a database and can be managed through standard interfaces with authentication. Interfaces tailored specifically to the needed task can then be developed in an agile way, relying on simple but reliable services for the management of the centralized annotations. We then present our implementation of an active learning scenario for person annotation in video, relying on the CAMOMILE server; during a dry run experiment, the manual annotation of 716 speech segments was thus propagated to 3504 labeled tracks. The code of the CAMOMILE framework is distributed in open source.
dc.format.extent5 p.
dc.language.isoeng
dc.publisherEuropean Language Resources Association
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshAutomatic speech recognition
dc.subject.otherAnnotation tool
dc.subject.otherCollaborative annotation
dc.subject.otherMultimedia
dc.subject.otherActive learning
dc.subject.otherPerson annotation.
dc.titleThe CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents
dc.typeConference lecture
dc.subject.lemacReconeixement automàtic de la parla
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.contributor.groupUniversitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2016/index.html
dc.rights.accessOpen Access
local.identifier.drac19723042
dc.description.versionPostprint (author's final draft)
dc.rights.licenseCretaive Commons License (by-nc-nd)
local.citation.authorPoignant, J.; Budnik, M.; Bredin, H.; Barras, C.; Adda, G.; Hernando, J.; Mariani, J.; Morros, J.R.
local.citation.contributorLanguage Resources and Evaluation Conference
local.citation.pubplacePortorož
local.citation.publicationNameLREC 2016: Tenth International Conference on Language Resources and Evaluation
local.citation.startingPage1421
local.citation.endingPage1425


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple