Show simple item record

dc.contributorEscalera, Sergio
dc.contributorMeysam Madadi
dc.contributor.authorBirkholz, Michael
dc.date.accessioned2022-06-22T07:05:58Z
dc.date.issued2022-04-27
dc.identifier.urihttp://hdl.handle.net/2117/368897
dc.description.abstractEstimating accurate 3D pose and shape from a 2D image is an inherently difficult problem. Part of the difficulty arises from the ambiguity of potential solutions based solely on geometric features. The fields of computer vision and artificial intelligence are particularly suited to finding a solution to this problem although they have primarily focused on pose recovery, leaving shape as an afterthought. This thesis explores adaptations of and extensions to a recent human mesh recovery framework that showed a significant improvement on shape metrics compared to a very popular real-time pose and shape estimator. The framework employs a multi-stage process, refining pose first, and then shape, through a non-differentiable mesh deformation process. A differentiable alternative to these deformation steps was proposed. In support of this effort, a dataset was compiled which indexes some of the most popular 2D and 3D human datasets and provides a common access format. The mesh recovery framework was retrained using this dataset, which incorporated an order of magnitude more samples than the dataset used to train the published framework. The new weights achieved the same levels of performance as the published weights, despite having less reliable ground-truth annotations. In addition, a multi-layer perceptron that has demonstrated state-of-the-art performance at pose parameter regression was trained, using millions of ground-truth 3D human meshes, to correct perturbations in shape and pose. Training techniques and methods of interfacing this network to the mesh recovery framework have been investigated and documented.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica::Aplicacions de la informàtica::Disseny assistit per ordinador
dc.subject.lcshThree-dimensional display systems
dc.subject.lcshComputer vision
dc.subject.other3D human pose estimation
dc.subject.other3D human pose and shape estimation
dc.subject.otherDeep learning
dc.subject.otherWeakly-supervised learning
dc.subject.othercomputer vision
dc.subject.otherSMPL
dc.titleWeakly-Supervised RGB-Based 3D human body pose and shape estimation
dc.typeMaster thesis
dc.subject.lemacVisualització tridimensional (Informàtica)
dc.subject.lemacVisió per ordinador
dc.identifier.slug163458
dc.rights.accessRestricted access - author's decision
dc.date.lift10000-01-01
dc.date.updated2022-05-25T04:00:34Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2017)
dc.contributor.covenanteeUniversitat de Barcelona. Facultat de Matemàtiques i Informàtica


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record