Try NVIDIA NIM APIs

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

13.91M

2mo

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

10.59M

2mo

Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.

2.83M

2mo

Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.

10.27M

5mo

Excels in agentic coding and browser use and supports 256K context, delivering top results.

2.89M

6mo