Synthesizing Obama: Learning Lip Sync from Audio

Created on 2022-05-08T02:55:43-05:00

Uses LSTMs to predict mouth shapes from audio.

They don't produce a mouth so much as a "mouth PCA" feature set, which is a set of variables created from doing principal component analysis on the actual mouth feature set.