
Transform your scene idea into ready-to-use 3D assets using Llama 3.1 8B, NV SANA, and Microsoft TRELLIS

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

English text embedding model for question-answering retrieval.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.


State-of-the-art accuracy and speed for English transcriptions.

Enhance speech by correcting common audio degradations to create studio quality speech output.

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

FLUX.1 is a state-of-the-art suite of image generation models

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.