
Detect and track speaker identities across video frames.
You may upload a video, data and audio files, which we will use for the sole purpose of providing you with this demo experience. We retain this information for the duration of the demo experience. For more information about our data processing practices, see our Privacy Policy. By clicking "Upload File" or "Upload Video", you consent to our collection, recording, and use of such information and the NVIDIA API Trial Terms of Service.
Upload a pre-generated JSON file to align known speaker segments with the video. For generating diarization data from an audio stream, refer to NVIDIA RIVA ASR Services.
GOVERNING TERMS: This trial service is governed by the NVIDIA API Trial Terms of Service. Use of the models is governed by the NVIDIA Open Model License. Additional Information: MIT.