nvidia/parakeet-ctc-1.1b-asr
RUN ANYWHERE
Record-setting accuracy and performance for English transcription.
Getting Started
Riva uses gRPC APIs. Instructions below demonstrate usage of parakeet-ctc-1_1b-asr model using Python gRPC client.
Prerequisites
You will need a system with Git and Python 3+ installed.
Install Riva Python Client
$ pip install -r https://raw.githubusercontent.com/nvidia-riva/python-clients/main/requirements.txt $ pip install --force-reinstall git+https://github.com/nvidia-riva/python-clients.git
Download Python Client
Download Python client code by cloning Python Client Repository.
$ git clone https://github.com/nvidia-riva/python-clients.git
Run Python Client
Open a command terminal and execute below command to transcribe audio. Make sure you have a speech file in 16-bit Mono format in WAV/OGG/OPUS container. If you have generated the API key, it will be auto-populated in the command.
$ python python-clients/scripts/asr/transcribe_file.py \ --server grpc.nvcf.nvidia.com:443 --use-ssl \ --metadata function-id "1598d209-5e27-4d3c-8079-4751568b1081" \ --metadata "authorization" "Bearer $API_KEY_REQUIRED_IF_EXECUTING_OUTSIDE_NGC" \ --language-code en-US \ --input-file <path_to_audio_file>
Support for gRPC clients in other languages
Riva uses gRPC APIs. Proto files can be downloaded from Riva gRPC Proto files and compiled to target language using Protoc compiler. Example Riva clients in C++ and Python languages are provided below.