Dilip Singh logo
🤖

AImind Agent Hub

Unified Multi-Agent AI Platform

Client
Hureka Technologies Inc
Industry
AI / Enterprise Automation
Duration
12 months
Role
Lead AI Architect
4
Agent Types
40%
Cost Reduction
Qdrant
Vector DB
WebSocket
Streaming

Overview

A centralized multi-agent platform where Email, Voice, Chat, and Action agents share a single Qdrant-powered RAG brain — with per-tenant knowledge bases, multi-LLM support, and real-time streaming.

The Challenge

Clients needed multiple AI agents (email support, voice calls, web chat, automated actions) but duplicating knowledge bases across agents was expensive and inconsistent. Agents needed shared context, real-time streaming responses, and strict per-tenant data isolation across a growing customer base.

The Solution

Designed the Shared RAG Brain pattern: all agents read from a single Qdrant collection per tenant. Built FastAPI backend with WebSocket streaming, Celery workers for async email processing, and a common BaseAgent interface with specialized system prompts per agent type. Implemented namespace isolation for Qdrant collections, Redis keys, and Celery queues.

Results

  • 4 agent types (Email, Voice, Chat, Action) sharing one RAG knowledge base
  • Real-time streaming responses via FastAPI WebSocket
  • Per-tenant Qdrant collections with HNSW indexing
  • Embedding cache in Redis reducing inference costs by 40%
  • LangFuse observability across all agent interactions

Tech Stack

Next.js 14FastAPIQdrantPostgreSQLRedisCeleryDockerWebSocket