ArchitectureBackendSystem Design

System Architecture: End-to-End Flow of a Single Question

Follow one question through every box in our backend — classifier, memory, LLM, YouTube, response, feedback, eval.

Rohit Mehta

5 April 2025

11 min read

The full path

User → API Gateway → Classifier → Memory → LLM → YouTube → Response → Feedback → Eval (judge model).

Every record carries a tenant_id. Institutions get their own row-level-secured slice, custom system prompts, and per-tenant analytics.

Every request emits a trace with spans for each box above. p95 end-to-end is currently 2.8s.