Speech driven lip synthesis using viseme based hidden markov models

Created on 2022-05-25T10:29:21-05:00

Vector Quantization: audio features clustered in to a fixed number of prototypes and incoming sound is matched to the closest prototype.

Direct Estimation: directly compute the output features with some math formula.

Visemes: shape of the mouth, used when certain sounds are voiced