×
Feb 25, 2024 · In this paper, we propose AVI-Talking, an Audio-Visual Instruction system for expressive Talking face generation. This system harnesses the ...
Missing: expressions/ url? q=https://arxiv.org/html/2402.16124v1
Apr 11, 2024 · To address this gap, we introduce Audio Dialogues: a multi-turn dialogue dataset containing 163.8k samples for general audio sounds and music.
Missing: expressions/ q=https://arxiv.org/html/2402.16124v1
Mar 19, 2024 · We propose a novel audio-visual method for compound expression recognition. Our method relies on emotion recognition models that fuse modalities ...
Missing: url? q=https://arxiv.org/html/2402.16124v1
Aug 26, 2022 · This paper presents MuLan: a first attempt at a new generation of acoustic models that link music audio directly to unconstrained natural ...
Missing: expressions/ q=https://arxiv.org/html/2402.16124v1
Feb 26, 2024 · This study investigates the inclusion of emotion as a novel contextual cue within AVSE, hypothesizing that incorporating emotional understanding ...
Feb 28, 2024 · In this paper, we delve into the problem of AR-MOT from the perspective of audio-video fusion and audio-video tracking. ... expressions and visual ...
Feb 14, 2024 · Over the past few years, arXiv has made good progress in making our website more accessible according to W3C WAI guidelines. While this ...
Missing: expressions/ url? q=https://arxiv.org/html/2402.16124v1
Apr 16, 2024 · More recent efforts have expanded the scope to include a broader array of facial expressions and head movements derived from audio inputs.
In order to show you the most relevant results, we have omitted some entries very similar to the 8 already displayed. If you like, you can repeat the search with the omitted results included.