Updated Agents & RAG Architecture: Scalable AI Framework

Background

In 2023, Andreessen Horowitz published a fantastic architecture on building and deploying RAG-based LLM solutions (link here).

Inspired by their work, we’ve developed and iterated on the framework to better support the integration of both RAG and Agents in modern Generative AI applications.

Updated Reference Architecture for Agents & RAG

Below is our Updated Reference Architecture for Agents & RAG, designed to meet today’s requirements for flexibility, scalability, and interactive AI solutions.

Key Changes and Why?

Expanded the LLM APIs and Hosting and introduced Agents

Agents from Marketplace – Introduced a dedicated section for marketplace agents (e.g., OpenAI) to facilitate plug-and-play integration.
Custom Agents – Added custom agent support to enable tailored workflows and specific use cases.
Added Custom Models – Added a block to allow organizations to build and deploy custom models.

Internal/External Systems

Agents will interact directly with both internal and external systems, represented with a new block for seamless integration.

Orchestration

Enhanced the orchestration layer to include Routing, ensuring agents can operate efficiently across multiple workflows.

Interactive Experiment Playground

Generalized the playground to accommodate all configurations and experimentation by power users.
Replaced the “Prompt-few shot examples” label with Power Users, reflecting their expanded role beyond just prompts.

Legend

Updated the legend to include Interactions/Actions, representing any interaction type—be it scripts, function calls, or API invocations.

What’s Next

Let us know if you’d like more details! We can expand on any section, or you can simply ask your favourite LLM to customize this content for your needs.

Read More: A Practical Guide to RAG for Business Leaders

The Agents are coming. Are you ready to deploy yours?

At LUMIQ, we can help you deploy your own “army” of Agents. Want to know how? Reach out to us at [email protected].

[Updated] Reference Architecture for Agents & RAG

Background

Updated Reference Architecture for Agents & RAG

Key Changes and Why?

What’s Next

The Agents are coming. Are you ready to deploy yours?

Leave a comment

You May Also Like

The Looming Failure of LLM Investments: What You’re Missing

AI Knowledge Workers: LiteCone Launch Transforms Insurance at AWS AI Conclave 2025