NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

12 results for

Filters

  • Free Endpoint
    2
  • Partner Endpoint
    11
  • Download Available
    10
  • Code Generation
    6
  • Deep Infra
    8
  • Together AI
    5
  • CoreWeave
    2
  • GMI Cloud
    2
  • Digital Ocean
    1
  • Meta
    4
  • Mistral AI
    3
  • NVIDIA
    3
  • Moonshotai
    1
  • Qwen
    1
  • Mistral AI
    Deprecation in 1dFree Endpoint

    magistral-small-2506

    High performance reasoning model optimized for efficiency and edge deployment
    Model
    coding
    Items per page
    of 1 pages
    1.23M
    10mo
    Moonshotai
    Deprecation in 2dFree Endpoint

    kimi-k2-instruct

    State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
    Model
    coding
    10.35M
    9mo
    NVIDIA
    Downloadable

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    math
    1.03M
    10mo
    NVIDIA
    Downloadable

    llama-3.3-nemotron-super-49b-v1

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    math
    3.42M
    9mo
    NVIDIA
    Downloadable

    llama-3.3-nemotron-super-49b-v1.5

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    math
    3.06M
    9mo
    Mistral AI
    Downloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    2.21M
    9mo
    Mistral AI
    Downloadable

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    648K
    9mo
    Meta
    Downloadable

    llama-3.3-70b-instruct

    Advanced LLM for reasoning, math, general knowledge, and function calling
    Model
    Instruction following
    11.73M
    11mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    23.03M
    10mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    28.38K422K
    11mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    18.42K1.13M
    11mo
    Qwen
    Deprecation in 2dDownloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    Model
    code completion
    2.58M
    10mo