Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
9 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Downloadable
llama-nemotron-embed-1b-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
1.86M
1mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
9.31M
2mo
NVIDIA
Downloadable
llama-3_2-nemoretriever-300m-embed-v2
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
137
6mo
NVIDIA
Free Endpoint
llama-3_2-nemoretriever-300m-embed-v1
Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
Text-to-Embedding
+2
361K
8mo
NVIDIA
Deprecation in 5d
Downloadable
llama-3.2-nemoretriever-1b-vlm-embed-v1
Multimodal question-answer retrieval representing user queries as text and documents as images.
nemo retriever
+3
194K
9mo
NVIDIA
Downloadable
llama-3.2-nv-embedqa-1b-v2
Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
nemo retriever
+3
2.37M
9mo
NVIDIA
Downloadable
nv-embedqa-e5-v5
English text embedding model for question-answering retrieval.
Embedding
+4
12.15M
9mo
NVIDIA
Free Endpoint
nv-embed-v1
Generates high-quality numerical embeddings from text inputs.
Non-Commercial Use Only
+2
4.48M
9mo
BAAI
Downloadable
bge-m3
Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.
Embeddings
+3
11.99M
1y
Items per page
24
1
1
of 1 pages