DevChoreo — System Architecture

AI-Powered RAG Assistant for the WSO2 Choreo Platform

👤
Web Browser · HTTPS
FRONTEND React 18 + Vite + Tailwind CSS
💬 Chat UI App.jsx · ChatInput.jsx
📝 Message Renderer Markdown · Code · Sources
📊 Mermaid Diagrams Interactive Zoom · Fullscreen
📂 Sidebar Multi-Chat · Search · CRUD
⚡ SSE Streaming Progressive Token Display
💾 localStorage History · Theme · Summary
BACKEND API GATEWAY FastAPI · Python 3.12+ · Uvicorn ASGI
/api/ask/stream
/api/ask
/api/ask_graph
/api/ingest/github
/api/ingest/org
/api/webhook/github
/api/health
/metrics
CORS Middleware · Metrics Middleware · Lazy Initialization
CORE SERVICES
🧠 AI & RAG Pipeline
LLMService GPT-4 Chat · ada-002 Embeddings · Streaming
ConversationMemoryManager Token Tracking · Smart Summarization · 75% Trigger
ContextManager Query → Embed → Vector Search → Retrieve
DiagramDetectionService Mermaid Type Detection · Prompt Enhancement
RAGGraph LangGraph · LangChain · Multi-step Reasoning
📦 Data Processing
IngestionService GitHub → Chunk → Embed → Store · SHA Dedup
DocumentChunker 3000-char Chunks · 200-char Overlap · Pre-split
GitHubService API Client · Org Bulk Ingestion · Webhooks
MarkdownProcessor Clean · Normalize · Remove Images
ImageProcessingService Google Vision OCR · Text Extraction
✅ Quality & Validation
URLValidator Concurrent HTTP HEAD · Caching · 404 Prevention
URLGroundingService Link Correctness · Registry Lookup
ChoreoRepoRegistry 30+ Components · Monorepo URL Auto-Fix
LLMRepoMatcher Intelligent Repo Search · Context Matching
VectorClient PyMilvus SDK · Cosine Search · Batch Insert
EXTERNAL CLOUD SERVICES
🤖
Azure OpenAI
GPT-4 Chat Completions
text-embedding-ada-002
1536 Dimensions
🗄️
Milvus Cloud
Zilliz Serverless
Cosine Similarity
Vector Storage
🐙
GitHub API
Repository Content
Webhooks
Org Operations
👁️
Google Vision
Cloud OCR API
Image Text Extraction
OBSERVABILITY
Prometheus
23+ Metrics
Grafana
8 Panels
Alertmanager
7 Alert Rules
Structured Logs
JSON · Rotation
🐳 Docker · Python 3.11-slim · Non-root Container
☁️ Choreo Platform · WSO2 · API Gateway
Startup · <5s · Lazy Initialization
🔒 Security · Choreo Secrets · .env
📡 Streaming · SSE · First Token 1-2s