NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

nvidia

Active Speaker Detection

DownloadableFree Endpoint

Detect and track speaker identities across video frames.

broadcast-loggingdubbinglocalizationnvidia ai for mediaspeaker detection
Get API Key
API ReferenceAPI Reference
Accelerated by DGX Cloud

Input

Drop files here.mp4
.mp4

You may upload a video, data and audio files, which we will use for the sole purpose of providing you with this demo experience. We retain this information for the duration of the demo experience. For more information about our data processing practices, see our Privacy Policy. By clicking "Upload File" or "Upload Video", you consent to our collection, recording, and use of such information and the NVIDIA API Trial Terms of Service.

Diarization File
.json

Upload a pre-generated JSON file to align known speaker segments with the video. For generating diarization data from an audio stream, refer to NVIDIA RIVA ASR Services.

Using free API for development

GOVERNING TERMS: This trial service is governed by the NVIDIA API Trial Terms of Service. Use of the models is governed by the NVIDIA Open Model License. Additional Information: MIT.

Output

Click Run to process video