Pipelines without Spark

April 4, 2026· 1 min readAgentic Assistants

How Kedro-style pipelines keep assistant data processing understandable.

Agentic AssistantsSystems DesignLocal FirstDevelopment Timeline

Why this mattered

A local framework should not need a distributed engine for every transformation.

This belongs in the development timeline because Agentic Assistants is not a single feature. It is a local-first assistant framework with a CLI, FastAPI and WebSocket server, MCP bridge, Next.js control panel, indexing, scoped retrieval, knowledge bases, pipelines, discovery, and training workflows. The project only became useful once its infrastructure decisions were written down well enough to be repeated.

Design decision

The pipeline layer uses graph concepts, pluggable runners, and clear artifacts so processing can scale from a laptop to heavier backends later.

The practical stack around this decision includes Python, Poetry, Click, FastAPI, WebSockets, MCP, LanceDB, Chroma, DuckDB, Polars, PyArrow, CrewAI, LangChain, LangGraph, Ollama, MLflow, OpenTelemetry, Next.js, Docusaurus. I try to keep the interfaces small: configuration describes intent, runtime code owns behavior, and operational notes explain what a future maintainer should check first.

What I would repeat

That keeps the first experience small while leaving room for orchestration.

The repeatable pattern is to make the boring path explicit. For this project that means clear repository boundaries, documented setup, predictable deployment commands, and enough observability to know whether the system is healthy or merely quiet.

Reader takeaway

If you are building something similar, start with the workflow you need to repeat every week. Then add only the platform pieces that make that workflow easier to recover, explain, and extend.

Template assets for repeatable assistants

Apr 28, 2026

How templates make assistant deployment repeatable across projects.

Training and alignment in local workflows

Apr 25, 2026

How LoRA, QLoRA, RLHF, and DPO fit into the assistant project as optional capability.

Docusaurus docs and the Next control panel

Apr 22, 2026

Why the assistant project keeps documentation and operations in separate frontends.

Observability for assistant runs

Apr 19, 2026

Why traces, MLflow, and run metadata belong in local assistant workflows.