Why agents do not call models directly

February 18, 2026· 1 min readAgentic Quant Platform

The operational reason for routing spec-driven agents through AgentRuntime.

Agentic Quant PlatformSystems DesignLocal FirstDevelopment Timeline

Why this mattered

Direct model calls are tempting until you need telemetry, cost caps, guardrails, and reproducibility.

This belongs in the development timeline because Agentic Quant Platform is not a single feature. It is a local-first quant research and trading platform with FastAPI, Celery, Postgres, Iceberg, DuckDB, MLflow, Redis-backed RAG, strategy factories, agents, bots, streaming, and paper trading. The project only became useful once its infrastructure decisions were written down well enough to be repeated.

Design decision

AgentRuntime turns a YAML spec into an auditable run with model choice, context, tools, and outputs in one place.

The practical stack around this decision includes Python, FastAPI, Celery, Redis, Postgres, SQLAlchemy, Alembic, Iceberg, DuckDB, MLflow, LiteLLM, CrewAI, LangGraph, vectorbt-pro, Kafka, Flink, Next.js. I try to keep the interfaces small: configuration describes intent, runtime code owns behavior, and operational notes explain what a future maintainer should check first.

What I would repeat

That is the difference between a clever agent and an operated agent.

The repeatable pattern is to make the boring path explicit. For this project that means clear repository boundaries, documented setup, predictable deployment commands, and enough observability to know whether the system is healthy or merely quiet.

Reader takeaway

If you are building something similar, start with the workflow you need to repeat every week. Then add only the platform pieces that make that workflow easier to recover, explain, and extend.

Paper trading needs a risk loop

Mar 17, 2026

Why paper sessions share broker, data, queue, strategy, and risk concepts with live trading.

Inspiration rehydration as engineering discipline

Mar 14, 2026

How external open-source ideas are adapted into platform-native assets.

The web UI as a research control plane

Mar 11, 2026

How the Next.js UI organizes agents, RAG, data, ML, bots, and streaming into workflows.

Progress events are part of user trust

Mar 8, 2026

How task progress shapes the usability of long-running research workflows.