Показати скорочений опис матеріалу
dc.contributor.author | Viniavskyi, Ostap | |
dc.date.accessioned | 2024-08-23T11:35:25Z | |
dc.date.available | 2024-08-23T11:35:25Z | |
dc.date.issued | 2024 | |
dc.identifier.citation | Viniavskyi Ostap. Controllable synthetic image datasets generation for advancements in human pose and shape estimation. Ukrainian Catholic University, Faculty of Applied Sciences, Department of Computer Sciences. Lviv 2024, 45 p. | uk |
dc.identifier.uri | https://er.ucu.edu.ua/handle/1/4679 | |
dc.language.iso | en | uk |
dc.subject | datasets generation | uk |
dc.subject | Controllable synthetic image datasets generation | uk |
dc.subject | advancements | uk |
dc.subject | human pose | uk |
dc.subject | shape estimation | uk |
dc.title | Controllable synthetic image datasets generation for advancements in human pose and shape estimation | uk |
dc.type | Preprint | uk |
dc.status | Публікується вперше | uk |
dc.description.abstracten | Human-centric applications are ubiquitous in the modern world. Myriads of edu- cational, entertainment, e-commerce, and other applications require understanding the measurements of the human body in the image. The deep-learning methods for solving human-centric problems usually rely on supervised learning approaches and need tons of labeled data to excel. Label acquisition for 3D Computer Vision tasks and specifically for the human mesh estimation task became even more difficult and error-prone compared to 2D tasks due to the inherent complexity of working with an additional dimension. That is where the synthetic data steps in, allowing researchers to obtain much more cost- efficient and pixel-perfect annotations. In this work, we utilize the existing Latent Diffusion Model for conditional image generation and create a method for synthesizing a large dataset of humans with 3D mesh labels obtained without the involvement of a human annotator. Further, we show the effectiveness of using such a synthetic dataset and its superiority compared to other synthetic data obtained from the game engines. The implementation of the proposed approach can be accessed on the GitHub | uk |