NVIDIA
Explore
Models
Blueprints
GPUs
Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for development
Accelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUsDocsForums

workstations

  • Run on RTX
  • Run on Spark

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation

industries

  • Automotive
  • Financial Services
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Financial Services

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Accelerate Financial Workflows With NVIDIA Technology

Developer examples designed for quick-start AI development in financial services, including artifacts like Docker containers and Jupyter Notebooks, allowing for fast deployment with tools like Docker compose and Brev Launchable.

nvidiaQuantitative Portfolio Optimization

Enable fast, scalable, and real-time portfolio optimization for financial institutions.

Blueprintalgorithmic tradingLaunchablecuoptdeveloper examplefinancial servicesportfolio optimization

nvidiaAI Model Distillation for Financial Data

Distill and deploy domain-specific AI models from unstructured financial data to generate market signals efficiently—scaling your workflow with the NVIDIA Data Flywheel Blueprint for high-performance, cost-efficient experimentation.

Nemotronalgorithmic tradingLaunchableblueprintNVIDIA AIdata flywheeldeveloper examplefinancial servicesllmnim

nvidiaFinancial Fraud Detection

Detect and prevent sophisticated fraudulent activities for financial services with high accuracy.

BlueprintFinancial ServicesFraud DetectionGNNPaymentsLaunchableNVIDIA AI

Explore NVIDIA Blueprints

Comprehensive reference workflows that accelerate application development and deployment, featuring NVIDIA acceleration libraries, APIs, and microservices for AI agents, digital twins, and more.

Enterprise

nvidiaBuild an AI Agent for Enterprise Research

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

BlueprintLlama NemotronNIMNeMo RetrieverReasoningRetrieval-Augmented GenerationEnterpriseLaunchableNVIDIA AI
Enterprise

nvidiaRefine AI Agents through Continuous Model Distillation with Data Flywheels

Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.

BlueprintData FlywheelNIMNeMo microservicesEnterpriseLaunchableNVIDIA AI
Enterprise

nvidiaBuild an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

BlueprintNIMNeMo RetrieverNemotronRetrieval-Augmented GenerationEnterpriseLaunchableNVIDIA AI
Enterprise

nvidiaAI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

AI Weather PredictionBlueprintClimate ScienceEarth-2EnterpriseNVIDIA AIWeather Simulation
Enterprise

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

BlueprintEnterpriseLaunchableNVIDIA AIchatgenerative AIvideo-to-textvision

Chat With Your Industry Domain Expertise

Leverage retrieval-augmented generation to ground large language models in your proprietary data.

Run Anywhere

nvidianvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

reasoningthinking budget
Run Anywhere

metallama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

ChatLanguage GenerationLarge Language modelsText-to-TextCode Generation
Run Anywhere

nvidiacuopt

World-record accuracy and performance for complex route optimization.

Route Optimization
Run Anywhere

qwenqwen3-next-80b-a3b-thinking

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

ReasoningText-to-Text
Run Anywhere

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

ASRMultilingualNVIDIA NIMStreamingSpeech-to-Text
Run Anywhere

openaigpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

mathchatreasoningtext-to-text
Run Anywhere

nvidiaaudio2face-3d

Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.

Audio-to-FaceDigital HumansNVIDIA NIMSpeech-to-Animation
Run Anywhere

hivedeepfake-image-detection

Advanced AI model detects faces and identifies deep fake images.

AI safetyContent moderationcomputer visiondeep fake detection