Distributed Runtime Flow

Dynavera behaves like a streaming agentic system rather than a simple CRUD app. Runtime responsibility is split into three buckets.

1) MCP Surface (Django-side tool layer)

This is the tool-facing layer that lets the model request structured actions such as retrieval and session updates.

Typical tool intents:

Conceptually, this layer translates model tool calls into standard Django queries and vector lookups.

The orchestrator lives in the WebSocket runtime and coordinates each user request lifecycle.

Typical interaction path:

This is the central control plane for session continuity, tool usage, and response streaming.

The GPU service is designed as a passive inference engine:

Using OpenAI-style request/response patterns keeps integration predictable.

Component	Typical Path / Endpoint	Role
MCP Surface	Internal Django tool handlers (and/or MCP endpoint)	Data/tool translation
Orchestrator	`apps/onboarding/consumers/`	Coordination + streaming
GPU Inference	`gpu_server.py` HTTP endpoints	Generation + embeddings