NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

23 results for

Filters

  • Free Endpoint
    10
  • Partner Endpoint
    8
  • Download Available
    14
  • Retrieval Augmented Generation
    5
  • Text-to-Embedding
    4
  • Code Generation
    2
  • Image-to-Text
    2
  • Speech-to-Text
    2
  • Together AI
    5
  • Deep Infra
    4
  • Fireworks AI
    4
  • Bitdeer AI
    2
  • CoreWeave
    2
  • NVIDIA
    8
  • Mistral AI
    3
  • Igenius
    2
  • Meta
    2
  • Microsoft
    2
  • NVIDIA
    Downloadable

    magpie-tts-multilingual

    Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
    Model
    TTS
    33.54K
    8mo
    NVIDIA
    Downloadable

    parakeet-1.1b-rnnt-multilingual-asr

    High accuracy and optimized performance for transcription in 25 languages
    Model
    Automatic Speech Recognition
    30.51K
    10mo
    NVIDIA
    Free Endpoint

    llama-3.1-nemotron-safety-guard-8b-v3

    Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
    Model
    content moderation
    609K
    4mo
    Sarvamai
    Downloadable

    sarvam-m

    Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
    Model
    coding
    551K
    7mo
    Mistral AI
    Free Endpoint

    magistral-small-2506

    High performance reasoning model optimized for efficiency and edge deployment
    Model
    coding
    4.44M
    8mo
    Mistral AI
    Downloadable

    mistral-small-24b-instruct

    Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
    Model
    chat
    605K
    8mo
    Utter-project
    Downloadable

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Model
    chat
    4.34K534K
    9mo
    Z.ai
    Free Endpoint

    glm-4.7

    GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
    Model
    Tool Calling
    15.91M
    1mo
    Opengpt-x
    Downloadable

    teuken-7b-instruct-commercial-v0.4

    Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.
    Model
    sovereign ai
    534K
    7mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    60.14K
    11mo
    Meta
    DownloadableFree Endpoint

    llama-4-scout-17b-16e-instruct

    A multimodal, multilingual 16 MoE model with 17B parameters.
    Model
    language generation
    64.95K
    8mo
    Google
    Downloadable

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    chat
    4.45K550K
    10mo
    NVIDIA
    Downloadable

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    Model
    nemo retriever
    179K
    8mo
    NVIDIA
    Free Endpoint

    llama-3_2-nemoretriever-300m-embed-v1

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    97.31K
    7mo
    NVIDIA
    Downloadable

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    86.62K
    5mo
    Meta
    Free Endpoint

    llama-4-maverick-17b-128e-instruct

    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    Model
    chat
    3.96M
    8mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    523K
    2w
    Mistral AI
    Free Endpoint

    mistral-small-3.1-24b-instruct-2503

    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    Model
    chat
    2.22M
    10mo
    Microsoft
    Free Endpoint

    phi-3.5-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    chat
    8.64M
    1y
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    chat
    2.82M
    10mo
    Igenius
    Free Endpoint

    italia_10b_instruct_16k

    Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry
    Model
    chat
    535K
    10mo
    NVIDIA
    Downloadable

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    Model
    nemo retriever
    6.45M
    8mo
    Igenius
    Free Endpoint

    colosseum_355b_instruct_16k

    NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
    Model
    chat
    81.25K
    10mo
    Items per page
    of 1 pages