The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon

dc.contributor.author	Poignant, Johann
dc.contributor.author	Budnik, Mateusz
dc.contributor.author	Bredin, Herve
dc.contributor.author	Barras, Claude
dc.contributor.author	Adda, Gilles
dc.contributor.author	Hernando Pericás, Francisco Javier
dc.contributor.author	Mariani, Joseph
dc.contributor.author	Morros Rubió, Josep Ramon
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2017-03-03T13:04:10Z
dc.date.available	2017-03-03T13:04:10Z
dc.date.issued	2016
dc.identifier.citation	Poignant, J., Budnik, M., Bredin, H., Barras, C., Adda, G., Hernando, J., Mariani, J., Morros, J. The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents. A: Language Resources and Evaluation Conference. "LREC 2016: Tenth International Conference on Language Resources and Evaluation". Portorož: European Language Resources Association, 2016, p. 1421-1425.
dc.identifier.isbn	978-2-9517408-9-1
dc.identifier.uri	http://hdl.handle.net/2117/101916
dc.description.abstract	In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which can be performed on 3M data, the structure of the server was kept intentionally simple in order to preserve its genericity, relying on standard Web technologies. Layers of annotations, defined as data associated to a media fragment from the corpus, are stored in a database and can be managed through standard interfaces with authentication. Interfaces tailored specifically to the needed task can then be developed in an agile way, relying on simple but reliable services for the management of the centralized annotations. We then present our implementation of an active learning scenario for person annotation in video, relying on the CAMOMILE server; during a dry run experiment, the manual annotation of 716 speech segments was thus propagated to 3504 labeled tracks. The code of the CAMOMILE framework is distributed in open source.
dc.format.extent	5 p.
dc.language.iso	eng
dc.publisher	European Language Resources Association
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcsh	Automatic speech recognition
dc.subject.other	Annotation tool
dc.subject.other	Collaborative annotation
dc.subject.other	Multimedia
dc.subject.other	Active learning
dc.subject.other	Person annotation.
dc.title	The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents
dc.type	Conference lecture
dc.subject.lemac	Reconeixement automàtic de la parla
dc.contributor.group	Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.contributor.group	Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://www.lrec-conf.org/proceedings/lrec2016/index.html
dc.rights.access	Open Access
local.identifier.drac	19723042
dc.description.version	Postprint (author's final draft)
dc.rights.license	Cretaive Commons License (by-nc-nd)
local.citation.author	Poignant, J.; Budnik, M.; Bredin, H.; Barras, C.; Adda, G.; Hernando, J.; Mariani, J.; Morros, J.R.
local.citation.contributor	Language Resources and Evaluation Conference
local.citation.pubplace	Portorož
local.citation.publicationName	LREC 2016: Tenth International Conference on Language Resources and Evaluation
local.citation.startingPage	1421
local.citation.endingPage	1425

Fitxers d'aquest items

Nom:: LREC 2016.pdf
Mida:: 884,7Kb
Format:: PDF
Descripció:: Paper

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [317]
Ponències/Comunicacions de congressos [437]
Ponències/Comunicacions de congressos [3.327]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora