At a Glance

LayerComponentsStack
EdgeAuth, rate limiting, routing, cachingEdge workers, distributed KV, SQLite at the edge, global CDN
ApplicationBusiness logic, inference, agent orchestrationPython, FastAPI
DataRelational store, cache, object storageManaged Postgres, Redis, object storage

Edge Plane

The only plane that touches the public internet.

ComponentPurpose
FrontendReact 19 + Vite 6 SPA with immutable static assets
GatewayFirst point of contact for all API traffic — auth, rate limiting, request routing
AuthJWT issuance and verification; refresh tokens stored in distributed KV
Distributed KVRead-heavy cache: feature flags, rate-limit counters, public config
SQLite at the edgeSmall relational datasets: feature flags history, search autocomplete

Application Plane

Stateless replicas behind an internal load balancer.

ServicePurpose
FastAPI HubPrimary API — user accounts, billing, tenant management
Inference WorkerRoutes prompts to the cheapest provider within the latency budget
Search WorkerMulti-provider search with classification and caching
Chat WorkerConversational AI with streaming responses

Data Plane

StorePurpose
Managed PostgresTenant content, audit logs, user uploads
Managed RedisApplication cache, session state
Object StorageUser uploads, backups

All inter-service communication uses signed JWTs. Service-to-service calls use mTLS. Data at rest is encrypted.