Head Rotation in Denoising Diffusion Models

Denoising Diffusion Models (DDM) are emerging as the cutting-edge technology in the realm of deep generative modeling, challenging the dominance of Generative Adversarial Networks.

Paper and LLMs Denoising Face Generation

GitHub Link

The GitHub link is https://github.com/asperti/head-rotation

Introduce

This repository, "Head-Rotation," is linked to the article "Head Rotation in Denoising Diffusion Models." Collaboratively authored, the article addresses challenges in exploring and manipulating the latent space of Denoising Diffusion Models (DDM) for face rotation. The researchers employ an embedding technique for Denoising Diffusion Implicit Models (DDIM) to achieve significant manipulations of face rotation angles, up to ±30°. The method involves computing trajectories through linear regression in the latent space to represent rotations. The CelebA dataset is labeled based on illumination direction, enhancing the accuracy of image selection for the process. The study showcases the intricate relationship between illumination, pose, and rotation. Denoising Diffusion Models (DDM) are emerging as the cutting-edge technology in the realm of deep generative modeling, challenging the dominance of Generative Adversarial Networks.

Content

This is a companion repository to the article "Head Rotation in Denoising Diffusion Models", joint work with Gabriele Colasuonno and Antonio Guerra. In this research, our focus is specifically on face rotation, which is recognized as one of the most complex editing operations. By utilizing a recent embedding technique for Denoising Diffusion Implicit Models (DDIM), we have achieved remarkable manipulations covering a wide rotation angle of up to $pm 30^o$, while preserving the distinct characteristics of each individual. Our methodology involves computing trajectories that approximate clusters of latent representations from dataset samples with various yaw rotations through linear regression. These trajectories are obtained by analyzing subsets of data that share significant attributes with the source image. One of these critical attributes is the light provenance: as a byproduct of our research, we have labeled the CelebA dataset, categorizing images into three major groups based on the illumination direction: left, center, and right. For a fixed direction (left or right), the approach is schematically described in the following picture We prefer to compute centroids instead of directly fitting over all clusters for computational reasons. In the picture below, we summarise the outcome of our labeling and the complex interplay between illumination and orientation by showing the mean faces corresponding to different light sources and poses.

Alternatives & Similar Tools

Generating observation guided ensembles for data assimilation with denoising diffusion probabilistic model Free

This paper presents an ensemble data assimilation method using the pseudo ensembles generated by denoising diffusion probabilistic model.

Visit →

Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh Reconstruction Free

To overcome the above issues, we introduce CycleAdapt, which cyclically adapts two networks: a human mesh reconstruction network (HMRNet) and a human motion denoising network (MDNet), given a test video.

Visit →

Free Google Gemini: the best largest and most capable AI model Free

Google Gemini, a multimodal AI by DeepMind, processes text, audio, images, and more. Gemini outperforms in AI benchmarks, is optimized for varied devices, and has been tested for safety and bias, adhering to responsible AI practices.

Visit →

Video ReTalking-focuses on audio-based lip synchronization for talking head video editing Open Source

Video ReTalking, advanced real-world talking head video according to input audio, producing a high-quality

Visit →

UniSim-Chat Control Video and Virtual simulation Open Source

Then transplant it to the real world to solve complex problems

Visit →

Premium Face Swap Online Platform Freemium

swap faces in photos and videos

Visit →