Back

VOCASET

4D Face Dataset with Voice Animation

VOCASET

VOCASET is a 4D face dataset with about 29 minutes of high-fidelity 3D scans captured at 60 fps (with a 3dMD head scanner) and synchronized audio. In total, the dataset contains audio-4D scan pairs captured from 6 female and 6 male subjects. For each subject, the dataset contains 40 sequences of English spoken sentences, each of length three to five seconds. Publicly available are raw scanner data (i.e. raw audio-4D scan pairs), registered data (i.e. in FLAME topology), and unposed data (i.e. registered data where effects of global rotation, translation, and head rotation around the neck are removed).

Try V7 now
->
View author website
Task
3D Reconstruction / Photogrammetry
Annotation Types
3D Point Cloud
Items
Classes
Labels
Models using this dataset
Last updated on 
October 31, 2023
Licensed under 
Research Only
Blog
Learn about machine learning and latests advancements in AI.
Read More
Playbooks
Discover how to optimize AI for your business.
Learn more
Case Studies
Discover how V7 empowers AI industry greats.
Explore now
Webinars
Explore AI topics, gain insights, and learn from experts.
Watch now