dc.contributor.author | Herrera-Palacio, Alba |
dc.contributor.author | Ventura, Carles |
dc.contributor.author | Giró Nieto, Xavier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2019-11-12T14:10:42Z |
dc.date.issued | 2019 |
dc.identifier.citation | Herrera-Palacio, A.; Ventura, C.; Giro, X. Video object linguistic grounding. A: International Workshop on Multimodal Understanding and Learning for Embodied Applications. "MULEA '19 1st International Workshop on Multimodal Understanding and Learning for Embodied Applications Nice, France: October 25-25, 2019". New York: Association for Computing Machinery (ACM), 2019, p. 49-51. |
dc.identifier.isbn | 978-1-4503-6918-3 |
dc.identifier.other | https://imatge.upc.edu/web/publications/video-object-linguistic-grounding |
dc.identifier.uri | http://hdl.handle.net/2117/172234 |
dc.description.abstract | The goal of this work is segmenting on a video sequence the objects which are mentioned in a linguistic description of the scene. We have adapted an existing deep neural network that achieves state of the art performance in semi-supervised video object segmentation, to add a linguistic branch that would generate an attention map over the video frames, making the segmentation of the objects temporally consistent along the sequence. |
dc.format.extent | 3 p. |
dc.language.iso | eng |
dc.publisher | Association for Computing Machinery (ACM) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial |
dc.subject.lcsh | Neural networks (Computer science) |
dc.subject.lcsh | Linguistics |
dc.subject.lcsh | Image processing -- Digital techniques |
dc.subject.other | Video object gounding |
dc.subject.other | Neural networks |
dc.subject.other | Linguistics |
dc.title | Video object linguistic grounding |
dc.type | Conference lecture |
dc.subject.lemac | Xarxes neuronals (Informàtica) -- Aplicacions |
dc.subject.lemac | Lingüística |
dc.subject.lemac | Imatges -- Processament -- Tècniques digitals |
dc.contributor.group | Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo |
dc.identifier.doi | 10.1145/3347450.3357662 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://dl.acm.org/citation.cfm?id=3357662 |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 25894716 |
dc.description.version | Postprint (published version) |
dc.date.lift | 10000-01-01 |
local.citation.author | Herrera-Palacio, A.; Ventura, C.; Giro, X. |
local.citation.contributor | International Workshop on Multimodal Understanding and Learning for Embodied Applications |
local.citation.pubplace | New York |
local.citation.publicationName | MULEA '19 1st International Workshop on Multimodal Understanding and Learning for Embodied Applications Nice, France: October 25-25, 2019 |
local.citation.startingPage | 49 |
local.citation.endingPage | 51 |