FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning

Haque, Kazi Injamamul; Yumak, Zerrin

doi:https://doi.org/10.48550/arXiv.2303.05416

FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning

DSpace/Manakin Repository

FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning

Haque, Kazi Injamamul; Yumak, Zerrin

(2023) Utrecht University Repository

(Preprint)

Abstract

This paper presents FaceXHuBERT, a text-less speech-driven 3D facial animation generation method that allows to capture personalized and subtle cues in speech (e.g. identity, emotion and hesitation). It is also very robust to background noise and can handle audio recorded in a variety of situations (e.g. multiple people speaking). Recent ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Preprint

Keywords: cs.CV, cs.AI, I.2.0; I.3.0; I.5.0

DOI: https://doi.org/10.48550/arXiv.2303.05416

Publisher: arXiv

Note: 13 pages, 4 figures, code included

See more statistics about this item