Architecture

A hybrid RAG system combining retrieval and generation for grounded conversations

System Overview

MindVault combines Azure Foundry (for semantic retrieval) with Claude API (for generation) to create conversations that are both knowledgeable and grounded in source materials.

Rather than fine-tuning models or storing parameters, MindVault uses in-context learning: each query retrieves relevant documents, which are then passed to Claude for thoughtful generation. This keeps the system transparent and cost-efficient.

The Hybrid RAG Flow

1

User Query

User sends a message in the chat interface

2

Frontend to API

Request routed to FastAPI backend with bot ID and message history

3

Semantic Retrieval

Query embedded and searched in Azure Foundry for relevant documents

4

Context Assembly

Retrieved documents combined with system prompt and conversation history

5

Claude Generation

Prompt sent to Claude API for coherent, in-character response

6

Response Delivery

Generated response + source documents returned to frontend

The 26 Bots

Each bot has a unique system prompt, data sources, and personality. The bots span five categories:

Literary (8 bots)

Characters from classic novels: Frankenstein's Creature, Sherlock Holmes, Captain Nemo, Alice, Dracula, The Time Traveller, Dr. Jekyll, The Invisible Man

Philosophers (4 bots)

Historical thinkers: Marcus Aurelius, Sun Tzu, Nietzsche, Machiavelli

Experts (10 bots)

Specialized knowledge: Mythologist, Battlefield Historian, Cryptid Hunter, Ancient Engineer, Plague Doctor, Codebreaker, Alchemist, Cartographer, Dream Interpreter, War Correspondent

NASA (3 bots)

Space exploration: Space Guide, Mission Control, Asteroid Tracker (with live NASA APIs)

Meta (1 bot)

The Architect: Knowledge about MindVault itself

Cost Model

MindVault is designed to be cost-efficient with transparent pricing.

Component Cost per 1000 ops Monthly (typical)
Claude API (prompt + completion) $0.80 (Sonnet 3.5) - $6.00 (Opus) $5-15
Azure Foundry (embedding + retrieval) $0.02 per embedding, $0.002 per query $0.50-2
Frontend hosting (static) - $3-5
FastAPI backend (Azure Container Apps) - $0.50-3
Total - $9-25

Costs scale linearly with usage. At 100 messages per day average, monthly costs stay under $20. Using Sonnet instead of Opus cuts costs by ~60%.

Tech Stack

Frontend

  • Astro 4 (static SSG)
  • React islands (ChatPanel)
  • Tailwind CSS
  • 26 custom bot themes

Backend

  • FastAPI (Python)
  • Azure Container Apps
  • Azure Key Vault (secrets)
  • Managed identity (RBAC)

Data

  • Azure Foundry (vector DB)
  • Semantic search
  • Project organization
  • ~2.5M tokens indexed

AI

  • Claude API (Sonnet/Opus)
  • System prompts per bot
  • In-context learning
  • Message history tracking

Design Philosophy

MindVault reflects several core design choices:

  • Transparency: Sources are always shown. Users know where information comes from.
  • Character Integrity: Each bot stays in character using carefully crafted system prompts.
  • Grounded Responses: Answers draw from real sources, not hallucinations.
  • Cost Efficiency: Semantic search + in-context learning vs. fine-tuning.
  • Scalability: Serverless architecture with managed identity and container orchestration.
  • Beautiful UX: 26 distinct themes, one for each bot, creating immersive experiences.

Built By

MindVault was built by Ian Kelley as a portfolio project demonstrating full-stack AI systems design.

The architecture prioritizes clarity, elegance, and maintainability over complexity.