dc.description.abstracten |
This thesis introduces a framework in which training image autoencoders by apply-
ing losses in latent space improves the quality of decodings and can significantly
decrease training time. Furthermore, within this framework, we propose a mask
guidance mechanism that combines mask features and image features in the early
phases of image encoding, which supports the encoder in reconstructing the image
embeddings. These methods are demonstrated in the context of the ill-posed image
inpainting problem, which seeks to reconstruct regions of the image that have been
occluded by a mask. We show that latent loss application results in more naturally
inpainted textures when used in a state-of-the-art inpainting architecture. We also
show that a mask-controlled embedding gives superior results across every com-
mon inpainting metric when compared to a state-of-the-art approach, which pro-
vides mask conditioning only in the image space. The final component of our study
involves the visualization of latent space to highlight damaged areas of features that
need refinement. |
uk |