dc.contributor | Escalera, Sergio |
dc.contributor | Meysam Madadi |
dc.contributor.author | Birkholz, Michael |
dc.date.accessioned | 2022-06-22T07:05:58Z |
dc.date.issued | 2022-04-27 |
dc.identifier.uri | http://hdl.handle.net/2117/368897 |
dc.description.abstract | Estimating accurate 3D pose and shape from a 2D image is an inherently difficult problem. Part of the difficulty arises from the ambiguity of potential solutions based solely on geometric features. The fields of computer vision and artificial intelligence are particularly suited to finding a solution to this problem although they have primarily focused on pose recovery, leaving shape as an afterthought. This thesis explores adaptations of and extensions to a recent human mesh recovery framework that showed a significant improvement on shape metrics compared to a very popular real-time pose and shape estimator. The framework employs a multi-stage process, refining pose first, and then shape, through a non-differentiable mesh deformation process. A differentiable alternative to these deformation steps was proposed. In support of this effort, a dataset was compiled which indexes some of the most popular 2D and 3D human datasets and provides a common access format. The mesh recovery framework was retrained using this dataset, which incorporated an order of magnitude more samples than the dataset used to train the published framework. The new weights achieved the same levels of performance as the published weights, despite having less reliable ground-truth annotations. In addition, a multi-layer perceptron that has demonstrated state-of-the-art performance at pose parameter regression was trained, using millions of ground-truth 3D human meshes, to correct perturbations in shape and pose. Training techniques and methods of interfacing this network to the mesh recovery framework have been investigated and documented. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Aplicacions de la informàtica::Disseny assistit per ordinador |
dc.subject.lcsh | Three-dimensional display systems |
dc.subject.lcsh | Computer vision |
dc.subject.other | 3D human pose estimation |
dc.subject.other | 3D human pose and shape estimation |
dc.subject.other | Deep learning |
dc.subject.other | Weakly-supervised learning |
dc.subject.other | computer vision |
dc.subject.other | SMPL |
dc.title | Weakly-Supervised RGB-Based 3D human body pose and shape estimation |
dc.type | Master thesis |
dc.subject.lemac | Visualització tridimensional (Informàtica) |
dc.subject.lemac | Visió per ordinador |
dc.identifier.slug | 163458 |
dc.rights.access | Restricted access - author's decision |
dc.date.lift | 10000-01-01 |
dc.date.updated | 2022-05-25T04:00:34Z |
dc.audience.educationlevel | Màster |
dc.audience.mediator | Facultat d'Informàtica de Barcelona |
dc.audience.degree | MÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2017) |
dc.contributor.covenantee | Universitat de Barcelona. Facultat de Matemàtiques i Informàtica |