⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Download Available

API Endpoint

Use Case

Code Generation

Retrieval Augmented Generation

Drug Discovery

Image-to-Text

Object Detection

Publisher

DeepSeek AI

Mistral AI

Qwen

Sarvamai

Tiiuae

Labels (1)

Coding

8 models

Sort By

Sarvamai

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

coding

421K

7mo

Mistral AI

magistral-small-2506

High performance reasoning model optimized for efficiency and edge deployment

coding

3.41M

7mo

Qwen

qwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

coding

3.32M

8mo

DeepSeek AI

deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

Distillation

4.14M

8mo

DeepSeek AI

deepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding

2.44K4.17M

9mo

DeepSeek AI

deepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding

2.07K3.78M

9mo

DeepSeek AI

deepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding

2.21K4.11M

9mo

Tiiuae

falcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

Coding

489K

9mo

Items per page

of 1 pages