NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

26 results for

Filters (1)

  • Free Endpoint
    10
  • Partner Endpoint
    2
  • Download Available
    16
  • Launchable
    0
  • Enterprise
    0
  • Speech-to-Text
    8
  • Text Translation
    3
  • Text-to-Speech
    3
  • Object Detection
    2
  • Image Generation
    1
  • Deep Infra
    2
  • Fireworks AI
    1
  • Together AI
    1
  • GMI Cloud
    0
  • Bitdeer AI
    0
  • NVIDIA
    24
  • Mistral AI
    1
  • OpenAI
    1
  • Meta
    0
  • Microsoft
    0
  • NVIDIA AI
    0
  • NVIDIA Omniverse
    0
  • NVIDIA BioNemo
    0
  • NVIDIA Isaac GR00T
    0
  • NVIDIA NIM
  • NVIDIA
    Downloadable

    audio2face-3d

    Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
    Model
    Digital Humans
    9mo
    NVIDIA
    Downloadable

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    6.31K
    11mo
    NVIDIA
    Free Endpoint

    gliner-pii

    GLiNER PII detects Personally Identifiable Information in text.
    Model
    PII Detection
    177K
    1mo
    NVIDIA
    Free Endpoint

    magpie-tts-flow

    Expressive and engaging text-to-speech, generated from a short audio sample.
    Model
    TTS
    829
    8mo
    NVIDIA
    Downloadable

    magpie-tts-multilingual

    Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
    Model
    TTS
    42.34K
    9mo
    NVIDIA
    Free Endpoint

    magpie-tts-zeroshot

    Expressive and engaging text-to-speech, generated from a short audio sample.
    Model
    TTS
    1.24K
    9mo
    NVIDIA
    Downloadable

    maisi

    MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
    Model
    Image Generation
    1.18K
    1y
    NVIDIA
    Downloadable

    megatron-1b-nmt

    Enable smooth global interactions in 36 languages.
    Model
    Neural machine translation
    11mo
    Mistral AI
    Free Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    chat
    274K
    10mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    12.67K
    2w
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    5.1K
    2w
    NVIDIA
    Free Endpoint

    nv-dinov2

    NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
    Model
    computer vision
    1.18M
    1y
    NVIDIA
    Free Endpoint

    nv-grounding-dino

    Grounding dino is an open vocabulary zero-shot object detection model.
    Model
    Object Detection
    4.98K
    1y
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    7.63K
    11mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    3.97K
    9mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-es

    Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
    Model
    ASR
    70
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-vi

    Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
    Model
    ASR
    879
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    5.88K
    6mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    Model
    ASR
    288
    5mo
    NVIDIA
    Downloadable

    parakeet-ctc-1.1b-asr

    Record-setting accuracy and performance for English transcription.
    Model
    ASR
    64.37K
    9mo
    NVIDIA
    Downloadable

    parakeet-tdt-0.6b-v2

    Accurate and optimized English transcriptions with punctuation and word timestamps
    Model
    ASR
    2.3K
    8mo
    NVIDIA
    Free Endpoint

    retail-object-detection

    EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
    Model
    Object Detection
    511
    1y
    NVIDIA
    Downloadable

    riva-translate-1.6b

    Enable smooth global interactions in 36 languages.
    Model
    Neural machine translation
    4.08K
    9mo
    NVIDIA
    Free Endpoint

    riva-translate-4b-instruct-v1_1

    Translation model in 12 languages with few-shots example prompts capability.
    Model
    nvidia nim
    252K
    3mo
    Items per page
    of 2 pages