Now showing items 1-6 of 6

    • Cross-modal neural sign language translation 

      Cardoso Duarte, Amanda (Association for Computing Machinery (ACM), 2019)
      Conference report
      Open Access
      Sign Language is the primary means of communication for the majority of the Deaf and hard-of-hearing communities. Current computational approaches in this general research area have focused specifically on sign language ...
    • Data and methods for a visual understanding of sign languages 

      Cardoso Duarte, Amanda (Universitat Politècnica de Catalunya, 2022-06-27)
      Doctoral thesis
      Open Access
      Signed languages are complete and natural languages used as the first or preferred mode of communication by millions of people worldwide. However, they, unfortunately, continue to be marginalized languages. Designing, ...
    • How2Sign: A large-scale multimodal dataset for continuous American sign language 

      Cardoso Duarte, Amanda; Palaskar, Shruti; Ventura Ripol, Lucas; Ghadiyaram, Deepti; DeHaan, Kenneth; Metze, Florian; Torres Viñals, Jordi; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2021)
      Conference lecture
      Open Access
      One of the factors that have hindered progress in the areas of sign language recognition, translation, and production is the absence of large annotated datasets. Towards this end, we introduce How2Sign, a multimodal and ...
    • Sign language translation from instructional videos 

      Tarrés Benet, Laia; Gallego Olsina, Gerard Ion; Cardoso Duarte, Amanda; Torres Viñals, Jordi; Giró Nieto, Xavier (Computer Vision Foundation, 2023)
      Conference report
      Open Access
      The advances in automatic sign language translation (SLT) to spoken languages have been mostly benchmarked with datasets of limited size and restricted domains. Our work advances the state of the art by providing the first ...
    • Sign language video retrieval with free-form textual queries 

      Cardoso Duarte, Amanda; Albanie, Samuel; Giró Nieto, Xavier; Varol, Gül (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference lecture
      Open Access
      Systems that can efficiently search collections of sign language videos have been highlighted as a useful application of sign language technology. However, the problem of searching videos beyond individual keywords has ...
    • Wav2Pix: speech-conditioned face generation using generative adversarial networks 

      Cardoso Duarte, Amanda; Roldan, Francisco; Tubau, Miquel; Escur, Janna; Pascual de la Puente, Santiago; Salvador Aguilera, Amaia; Mohedano, Eva; McGuinness, Kevin; Torres Viñals, Jordi; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference lecture
      Restricted access - publisher's policy
      Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a ...