Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
NVIDIA Audio2Face-3D is a microservice for animating 3D character's facial characteristics to match any audio track, whether for a game, film, or real-time digital assistant. This model is designed for commercial use.
NVIDIA Audio2Emotion is embedded within Audio2Face, and it is designed to automatically recognize the emotions in human speech. These predictions are used to drive the Audio2Face avatar’s facial expressions to make it even more natural.
EULA information is available here. Customer will use the Software exclusively for authorized purposes, consistent with the Agreement’s terms and all applicable laws, regulations and the rights of others.
Architecture Type:
Network Architecture
Input Type(s): Audio
Input Format: .wav
Input Parameters: 2D: (Tuning Parameters and Audio)
Other Properties Related to Input: Supported Sampling rates: 22.05KHz, 44.1KHz, 16KHz; All audio is resampled to 16KHz. There is no max audio length.
Output Type(s):
Output Format: Custom Protobuf Format
Output Parameters: 2D: Custom Protobuf Format
Other Properties Related to Output: N/A
Runtime Engine(s):
Supported Hardware Microarchitecture Compatibility:
Audio2Face:
Audio2Emotion:
Data Collection Method by dataset:
Labeling Method by dataset:
Properties (Quantity, Dataset Descriptions, Sensor(s)):
Engine: TensorRT
Test Hardware: A100
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards here.
Please report security vulnerabilities or NVIDIA AI Concerns here.
AI models generate responses and outputs based on complex algorithms and machine learning techniques, and those responses or outputs may be inaccurate or indecent. By testing this model, you assume the risk of any harm caused by any response or output of the model. Please do not upload any confidential information or personal data. Your use is logged for security.