State-of-the-art accuracy and speed for English transcriptions.
Riva uses gRPC APIs. Instructions below demonstrate usage of parakeet-ctc-0_6b-asr model using Python gRPC client.
You will need a system with Git and Python 3+ installed.
$ pip install -r https://raw.githubusercontent.com/nvidia-riva/python-clients/main/requirements.txt $ pip install --force-reinstall git+https://github.com/nvidia-riva/python-clients.git
Download Python client code by cloning Python Client Repository.
$ git clone https://github.com/nvidia-riva/python-clients.git
Open a command terminal and execute below command to transcribe audio. Make sure you have a speech file in 16-bit Mono format in WAV/OGG/OPUS container. If you have generated the API key, it will be auto-populated in the command.
$ python python-clients/scripts/asr/transcribe_file.py \ --server grpc.nvcf.nvidia.com:443 --use-ssl \ --metadata function-id "d8dd4e9b-fbf5-4fb0-9dba-8cf436c8d965" \ --metadata "authorization" "Bearer $API_KEY_REQUIRED_IF_EXECUTING_OUTSIDE_NGC" \ --language-code en-US \ --input-file <path_to_audio_file>
Riva uses gRPC APIs. Proto files can be downloaded from Riva gRPC Proto files and compiled to target language using Protoc compiler. Example Riva clients in C++ and Python languages are provided below.