SynapseKit — Async-first Python framework for LLM applications

Build RAG pipelines, agents, and graph workflows in Python.
Async-native · Streaming-first · 2 core dependencies.

$pip install synapsekit[openai]

Why SynapseKit

Built for production Python

Designed for engineers who want full control without writing everything from scratch.

⚡

Async-native

Every API is async/await first. No sync-first retrofit. Sync wrappers included for scripts and notebooks.

🌊

Streaming-first

Token-level streaming is the default, not an afterthought. Works identically across all 13 LLM providers.

🪶

2 hard dependencies

numpy and rank-bm25 only. Every other capability is behind an optional extra. Install what you need.

🔌

One interface

13 LLM providers and 5 vector stores behind the same API. Swap providers without rewriting a single line.

🧩

Fully composable

RAG pipelines, agents, and graph nodes are interchangeable. Wrap anything as anything.

🔍

No magic

No hidden chains, no callbacks, no global state. Every step is plain Python you can read and override.

Explore the docs

From a 3-line quickstart to production graph workflows.

Retrieval-augmented generation with streaming, BM25 reranking, conversation memory, and token tracing.

ReAct loop for any LLM. Native function calling for OpenAI, Anthropic, Gemini, and Mistral. 16 built-in tools, fully extensible.

DAG-based async pipelines. Parallel execution, conditional routing, typed state, fan-out/fan-in, SSE streaming, event callbacks, human-in-the-loop.

OpenAI, Anthropic, Ollama, Gemini, Cohere, Mistral, Bedrock, Azure, Groq, DeepSeek, OpenRouter, Together, Fireworks — all behind one interface.

InMemory, ChromaDB, FAISS, Qdrant, Pinecone. One interface for all backends. Swap without rewriting.

Complete reference for every public class and method in SynapseKit.