Controllable synthetic image datasets generation for advancements in human pose and shape estimation

Show simple item record

dc.contributor.author Viniavskyi, Ostap
dc.date.accessioned 2024-08-23T11:35:25Z
dc.date.available 2024-08-23T11:35:25Z
dc.date.issued 2024
dc.identifier.citation Viniavskyi Ostap. Controllable synthetic image datasets generation for advancements in human pose and shape estimation. Ukrainian Catholic University, Faculty of Applied Sciences, Department of Computer Sciences. Lviv 2024, 45 p. uk
dc.identifier.uri https://er.ucu.edu.ua/handle/1/4679
dc.language.iso en uk
dc.subject datasets generation uk
dc.subject Controllable synthetic image datasets generation uk
dc.subject advancements uk
dc.subject human pose uk
dc.subject shape estimation uk
dc.title Controllable synthetic image datasets generation for advancements in human pose and shape estimation uk
dc.type Preprint uk
dc.status Публікується вперше uk
dc.description.abstracten Human-centric applications are ubiquitous in the modern world. Myriads of edu- cational, entertainment, e-commerce, and other applications require understanding the measurements of the human body in the image. The deep-learning methods for solving human-centric problems usually rely on supervised learning approaches and need tons of labeled data to excel. Label acquisition for 3D Computer Vision tasks and specifically for the human mesh estimation task became even more difficult and error-prone compared to 2D tasks due to the inherent complexity of working with an additional dimension. That is where the synthetic data steps in, allowing researchers to obtain much more cost- efficient and pixel-perfect annotations. In this work, we utilize the existing Latent Diffusion Model for conditional image generation and create a method for synthesizing a large dataset of humans with 3D mesh labels obtained without the involvement of a human annotator. Further, we show the effectiveness of using such a synthetic dataset and its superiority compared to other synthetic data obtained from the game engines. The implementation of the proposed approach can be accessed on the GitHub uk


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Browse

My Account