Enviaments recents

Uso de redes neuronales convolucionales para la detección remota de frutos con cámaras RGB-D

Gené Mola, Jordi; Vilaplana Besler, Verónica; Rosell Polo, Joan Ramon; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Gregorio López, Eduard (Universidad de Zaragoza (UZA), 2019)
Text en actes de congrés
Accés obert

La detección remota de frutos será una herramienta indispensable para la gestión agronómica optimizada y sostenible de las plantaciones frutícolas del futuro, con aplicaciones en previsión de cosecha, robotización de la ...

Comparative study of upsampling methods for super-resolution in remote sensing

Salgueiro Romero, Luis Fernando; Marcello Ruiz, Javier; Vilaplana Besler, Verónica (International Society for Photo-Optical Instrumentation Engineers (SPIE), 2019)
Text en actes de congrés
Accés obert

Many remote sensing applications require high spatial resolution images, but the elevated cost of these images makes some studies unfeasible. Single-image super-resolution algorithms can improve the spatial resolution of ...

Sign language video retrieval with free-form textual queries

Cardoso Duarte, Amanda; Albanie, Samuel; Giró Nieto, Xavier; Varol, Gül (Institute of Electrical and Electronics Engineers (IEEE), 2022)
Comunicació de congrés
Accés obert

Systems that can efficiently search collections of sign language videos have been highlighted as a useful application of sign language technology. However, the problem of searching videos beyond individual keywords has ...

Channel-wise early stopping without a validation set via NNK polytope interpolation

Bonet Solé, David; Ortega, Antonio; Ruiz Hidalgo, Javier; Sarath Shekkizhar, Sarath (2021)
Text en actes de congrés
Accés obert

State-of-the-art neural network architectures continue to scale in size and deliver impressive generalization results, although this comes at the expense of limited interpretability. In particular, a key challenge is to ...

H3D-Net: Few-shot high-fidelity 3D head reconstruction

Ramon Maldonado, Eduard; Triginer Garcés, Gil; Escurt i Gelabert, Janna; Pumarola Peris, Albert; García Giráldez, Jaime; Giró Nieto, Xavier; Moreno-Noguer, Francesc (Computer Vision Foundation, 2021)
Comunicació de congrés
Accés obert

Recent learning approaches that implicitly represent surface geometry using coordinate-based neural representations have shown impressive results in the problem of multi-view 3D reconstruction. The effectiveness of these ...

Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data

Mañas Sánchez, Óscar; Lacoste, Alexandre; Giró Nieto, Xavier; Vázquez Bermúdez, David; Rodriguez López, Pau (Computer Vision Foundation, 2021)
Comunicació de congrés
Accés obert

Remote sensing and automatic earth monitoring are key to solve global-scale challenges such as disaster prevention, land use monitoring, or tackling climate change. Although there exist vast amounts of remote sensing data, ...

How2Sign: A large-scale multimodal dataset for continuous American sign language

Cardoso Duarte, Amanda; Palaskar, Shruti; Ventura Ripol, Lucas; Ghadiyaram, Deepti; DeHaan, Kenneth; Metze, Florian; Torres Viñals, Jordi; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2021)
Comunicació de congrés
Accés obert

One of the factors that have hindered progress in the areas of sign language recognition, translation, and production is the absence of large annotated datasets. Towards this end, we introduce How2Sign, a multimodal and ...

UPCommons. Portal del coneixement obert de la UPC

Ponències/Comunicacions de congressos: Enviaments recents

Uso de redes neuronales convolucionales para la detección remota de frutos con cámaras RGB-D

Comparative study of upsampling methods for super-resolution in remote sensing

Sign language video retrieval with free-form textual queries

Channel-wise early stopping without a validation set via NNK polytope interpolation

H3D-Net: Few-shot high-fidelity 3D head reconstruction

Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data

How2Sign: A large-scale multimodal dataset for continuous American sign language

Refinement network for unsupervised on the scene foreground segmentation

Explore, discover and learn: unsupervised discovery of state-covering skills

Weakly supervised semantic segmentation for remote sensing hyperspectral imaging

One perceptron to rule them all: language, vision, audio and speech

Automatic reminiscence therapy for dementia

Explora

Ponències/Comunicacions de congressos: Enviaments recents

Uso de redes neuronales convolucionales para la detección remota de frutos con cámaras RGB-D ﻿

Comparative study of upsampling methods for super-resolution in remote sensing ﻿

Sign language video retrieval with free-form textual queries ﻿

Channel-wise early stopping without a validation set via NNK polytope interpolation ﻿

H3D-Net: Few-shot high-fidelity 3D head reconstruction ﻿

Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data ﻿

How2Sign: A large-scale multimodal dataset for continuous American sign language ﻿

Refinement network for unsupervised on the scene foreground segmentation ﻿

Explore, discover and learn: unsupervised discovery of state-covering skills ﻿

Weakly supervised semantic segmentation for remote sensing hyperspectral imaging ﻿

One perceptron to rule them all: language, vision, audio and speech ﻿

Automatic reminiscence therapy for dementia ﻿

Uso de redes neuronales convolucionales para la detección remota de frutos con cámaras RGB-D

Comparative study of upsampling methods for super-resolution in remote sensing

Sign language video retrieval with free-form textual queries

Channel-wise early stopping without a validation set via NNK polytope interpolation

H3D-Net: Few-shot high-fidelity 3D head reconstruction

Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data

How2Sign: A large-scale multimodal dataset for continuous American sign language

Refinement network for unsupervised on the scene foreground segmentation

Explore, discover and learn: unsupervised discovery of state-covering skills

Weakly supervised semantic segmentation for remote sensing hyperspectral imaging

One perceptron to rule them all: language, vision, audio and speech

Automatic reminiscence therapy for dementia