NVIDIA Developer Blog→ original

NVIDIA Released AI-Q for Deep Research in Agent Frameworks

NVIDIA launched AI-Q — an extension for agents (Claude Code, Codex) that adds deep research capabilities via a separate backend. The system processes requests i

AI-processed from NVIDIA Developer Blog; edited by Hamidun News
NVIDIA Released AI-Q for Deep Research in Agent Frameworks
Source: NVIDIA Developer Blog. Collage: Hamidun News.
◐ Listen to article

NVIDIA introduced AI-Q — a system for adding specialized deep research capabilities to agent frameworks. Instead of embedding research capabilities into general-purpose agents, AI-Q separates this function into a dedicated backend, allowing orchestrators like Claude Code to focus on workflow management.

Why Regular Agents Fall Short

Frameworks like Claude Code, Codex, and LangChain Deep Agents work great as orchestrators: they manage sessions, tool chains, execute code, and understand developer intent. But when it comes to serious research — synthesizing multiple documents, creating well-reasoned reviews, analysis with source citations — complexity skyrockets. Embedding this logic into an agent is inefficient and cumbersome.

How AI-Q Works

AI-Q operates as a separate skill — an additional capability that an agent can invoke. The system includes a complete research pipeline: intent classification, user dialogue for clarification, surface-level search for quick answers, and deep analysis for multi-source synthesis.

  • Request classification — determining whether a quick reference or full research is needed
  • Human clarification — if the question is unclear, the system asks clarifying questions
  • Surface-level search — quickly finding answers for simple queries
  • Deep analysis — synthesizing multiple sources and providing citations

Quality is assessed against standards like FreshQA and Deep Research Bench — NVIDIA uses real benchmarks to ensure the system delivers reliable results.

Enterprise Data Built-In

Here's a cool feature: AI-Q supports authenticated MCP (Model Context Protocol) servers as data sources. This means agents can research internal company documents without exposing them externally. NVIDIA has provided three authentication patterns: open servers, service accounts for shared enterprise data, and bearer tokens to preserve user identity.

"The complete research pipeline, including classification,

clarification, surface-level and deep analysis, is provided as a high-level capability"

Flexible Deployment

Organizations can deploy AI-Q on their own infrastructure — Docker Compose on a developer laptop or Helm in a regulated data center. Sensitive source data stays internal; only cited results go outside. This is critical for companies that don't want to send internal documents to the cloud.

What This Means

AI-Q demonstrates a 2026 trend: component specialization instead of universality. Agents become better when they can delegate complex tasks to specialized tools. For developers, this means deep research is now built into the ecosystem rather than the LLM itself — more reliable, more transparent, with its own sources.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…