Legal

NVIDIA Audio2Face-3D NIM and Authoring Microservice allow you to upload audio files to drive an animation. NVIDIA will only use and store the audio files to provide you with the NVIDIA Audio2Face Authoring Microservice. For more information about our data processing practices, see our Privacy Policy. By clicking “Get API Key” you consent to the processing of your data in accordance with the NVIDIA Cloud Agreement and Service-Specific Terms for NVIDIA Audio2Face 3D Authoring Microservice and NVIDIA Audio2Face 3D Microservice NIM.

### Getting Started Audio2Face uses gRPC APIs. The following instructions demonstrate usage of a model using Python client. The current available models are Mark, Claire, and James. ### Prerequisites You will need a system with Python 3+ installed. ### Prepare Python Client Start by creating a python venv using ```bash $ python3 -m venv .venv $ source .venv/bin/activate ``` ### Download A2F Python Client Download Python client code by cloning [ACE Github Repository](https://github.com/NVIDIA/Audio2Face-3D-Samples/). ```bash $ git clone https://github.com/NVIDIA/Audio2Face-3D-Samples.git $ cd Audio2Face-3D-Samples/scripts/audio2face_3d_api_client ``` Install the proto files by installing the python wheel: ```bash $ pip3 install ../../proto/sample_wheel/nvidia_ace-1.2.0-py3-none-any.whl ``` Then install the required dependencies: ```bash $ pip3 install -r requirements ``` ### Run Python Client To run with Claire model: ```bash $ python ./nim_a2f_3d_client.py ../../example_audio/Claire_neutral.wav config/config_claire.yml \ --apikey <%- apiKey %>\ --function-id 0961a6da-fb9e-4f2e-8491-247e5fd7bf8d ``` To run with Mark model: ```bash $ python ./nim_a2f_3d_client.py ../../example_audio/Claire_neutral.wav config/config_mark.yml \ --apikey <%- apiKey %>\ --function-id 8efc55f5-6f00-424e-afe9-26212cd2c630 ``` To run with James model: ```bash $ python ./nim_a2f_3d_client.py ../../example_audio/Claire_neutral.wav config/config_james.yml \ --apikey <%- apiKey %>\ --function-id 9327c39f-a361-4e02-bd72-e11b4c9b7b5e ``` The script takes four mandatory parameters, an audio file at format PCM 16 bits, a yaml configuration file for the emotions parameters, the API Key generated by API Catalog, and the Function ID used to access the API function. --apikey for the API Key generated through the API Catalog --function-id for the Function ID provided to access the API function for the model of interest ### What does this example do? 1. Reads the audio data from a wav 16bits PCM file 2. Reads emotions and parameters from the yaml configuration file 3. Sends emotions, parameters and audio to the A2F Controller 4. Receives back blendshapes, audio and emotions 5. Saves blendshapes as animation key frames in a csv file with their name, value and time codes 6. Same process for the emotion data. 7. Saves the received audio as out.wav (Should be the same as input audio) ### Connect from any client For gRPC connection from any client, use the following endpoint and function-id alongside the API Key. To generate a new API Key, click the Get API Key button on this page. ``` grpc.nvcf.nvidia.com:443 or https://grpc.nvcf.nvidia.com:443 authorization: Bearer <%- apiKey %> function-id: ``` #### Function IDs Functions with tongue animation enabled (v1.3.16 new feature): Mark model: `8efc55f5-6f00-424e-afe9-26212cd2c630` Claire model: `0961a6da-fb9e-4f2e-8491-247e5fd7bf8d` James model: `9327c39f-a361-4e02-bd72-e11b4c9b7b5e` With tongue animation disabled (for legacy application): Mark model: `cf145b84-423b-4222-bfdd-15bb0142b0fd` Claire model: `617f80a7-85e4-4bf0-9dd6-dcb61e886142` James model: `8082bdcb-9968-4dc5-8705-423ea98b8fc2`