The ability to connect real-world data to reasoning models unlocks new opportunities to improve the efficiency and accuracy of various clinical development processes, including R&D, literature review, protocol generation, clinical trial screening, and pharmacovigilance. In addition, the ability to create tools for LLMs to use gives agents the ability to interact with the domain-specific services that researchers use, such as domain specific models or APIs. Reasoning LLMs enable state-of-the-art planning, validation, and insight generation capabilities during the research process, while the agentic framework AgentIQ accelerates the production-ready agent workflows, and their unique data sources can be easily composed together to create specialized, production-ready agentic workflows. The underlying NIMs run securely and privately on any compute, thus protecting the privacy of a clinical development organization’s valuable real-world data. The combination of these capabilities in Biomedical AI-Q Research Agent increases the number and quality of insights generated from specialized and often proprietary biomedical data sources, improves the traceability and interpretability of experiments and research, decreases human hours spent generating and validating reports for regulatory and downstream knowledge building, and improves the ability of leaders and researchers to make informed decisions based on the knowledge generated.
Adapt Reasoning Agents to Scientific Workflows
Ease of development
Advanced semantic query
Fast reasoning
Users may have to wait 5–10 minutes for the instance to start, depending on cloud availability.
NVIDIA Container ToolKit
GPU Driver - 530.30.02 or later
CUDA version - 12.6 or later
The biomedical research agent blueprint supports the following hardware and system configurations:
Use | Service(s) | Recommended GPU* |
---|---|---|
Nemo Retriever Microservices for multi-modal document ingest | graphic-elements, table-structure, paddle-ocr, nv-ingest, embedqa | 1 x H100 80GB* 1 x A100 80GB |
Reasoning Model for Report Generation and RAG Q&A Retrieval | llama-3.3-nemotron-super-49b-v1 with a FP8 profile | 1 x H100 80 GB* 2 x A100 80GB |
Instruct Model for Report Generation | llama-3.3-70b-instruct | 2 x H100 80GB* 4 x A100 80GB |
Generative Model for Small Molecule Drug Development | nvcr.io/nim/nvidia/molmim:1.0.0 | Single Ampere/L40 GPU with at least 3 GB memory (doc) |
Generative Model for Molecular Docking | nvcr.io/nim/mit/diffdock:2.1.0 | 1 x H100 80GB 1 x A100 40GB 1 x A6000 48GB 1 x A10 24GB 1 x L40S 48GB (doc) |
Total | Entire Biomedical AIQ Research Agent | 5 x H100 80GB* 8 x A100 80GB |
This blueprint can be run entirely with hosted NVIDIA NIM Microservices, see https://build.nvidia.com/ for details.
NVIDIA believes Trustworthy AI is a shared responsibility, and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their supporting model team to ensure the models meet requirements for the relevant industry and use case and addresses unforeseen product misuse. For more detailed information on ethical considerations for the models, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI Concerns here.
Use of the models in this blueprint is governed by the NVIDIA AI Foundation Models Community License.
GOVERNING TERMS: The NVIDIA Biomedical AI-Q Research Agent Developer Blueprint and Biomedical AI-Q Research Agent Brev launchable are governed by the Apache 2.0 License. The remaining software and materials are governed by the NVIDIA Software License Agreement and Product-Specific Terms for NVIDIA AI Products; except as follows: (a) the models, other than the Llama-3.3-Nemotron-Super-49B-v1 model, are governed by the NVIDIA Community Model License; (b) the Llama-3.3-Nemotron-Super-49B-v1 model is governed by the NVIDIA Open Model License Agreement; (c) the NeMo Retriever extraction is governed by the Apache 2.0 license, and (d) data from the RCSB Protein Data Bank is governed by CC0 1.0 Universal.
ADDITIONAL INFORMATION: For NVIDIA Retrieval QA Llama 3.2 1B Reranking v2 model, NeMo Retriever Graphic Elements v1 model, and NVIDIA Retrieval QA Llama 3.2 1B Embedding v2: Llama 3.2 Community License Agreement. For Llama-3.3-70b-Instruct model, Llama 3.3 Community License Agreement. Built with Llama.
Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint