Smart Routing Butler

One interface, every model at your command. Your local AI routing butler.

Quick Start · Features · Configuration · Security

Smart Routing Butler is a 100% self-hosted, OpenAI-compatible API smart router purpose-built for AI agents (OpenClaw, Cursor, Continue, etc.) and developer tools. It automatically balances cost, latency, and quality — connect to a single endpoint and seamlessly dispatch requests across cloud LLMs and local models.

📑 Table of Contents

💡 Why Smart Routing Butler?
🔀 Comparison with Alternatives
✨ Core Features
📸 UI Preview
🎯 Rule Creation — Three Ways
🔌 OpenAI-Compatible Local Proxy
🔍 Routing Layers Deep Dive
🏗️ Architecture Overview
🚀 Quick Start (Self-Hosted)
⚙️ Configuration Summary
📂 Repository Structure
🛠️ Development & Health Checks
🗺️ Roadmap
⚖️ Open-Source Governance
🛡️ Security & Privacy
🤝 Contributing
📜 License & Disclaimer
🙏 Acknowledgments

💡 Why Smart Routing Butler?

When using AI agents (OpenClaw, Cursor, Continue, etc.) and IDE-assisted coding daily, we constantly hit these pain points:

Steep API costs — Whether it's a simple spell check or complex architecture design, tools always use default models which may not at the right price.
Rigid global config — No way to assign the right model per task type (code completion, long-form summarization, multi-step reasoning).
Black-box fragility — Routing logic is opaque; when a single model provider goes down, the entire agent workflow collapses.

Smart Routing Butler turns "which model to use" into a policy-driven, hot-reloadable configuration problem. It acts as your local proxy layer, intercepts all LLM requests, and intelligently dispatches them based on your rules and semantic understanding.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
compose		compose
contracts		contracts
dashboard		dashboard
docs/images		docs/images
proxy		proxy
router		router
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SECURITY.md		SECURITY.md
docker-compose.release.yml		docker-compose.release.yml
docker-compose.yml		docker-compose.yml

Dimension	Typical Cloud API Gateway	Smart Routing Butler
Integration	Requires dedicated plugins, browser extensions, or SDK wrappers per tool	Standard OpenAI-compatible endpoint — any tool that supports `base_url` + API key works instantly. No plugins needed.
Data privacy	Traffic routed through third parties — leak risk	100% self-hosted, data stays on your local network
Routing logic	Platform black-box, no user control	L0–L3 white-box, transparent, configurable, explainable
Rule customization	Limited or no user-defined rules	Full visual editor + natural language + AI wizard for custom routing rules
Compliance	Dependent on vendor terms, region-locked	Deploy on your own network, meets the strictest enterprise requirements
Cost control	Platform fees or fixed monthly charges	Zero platform fees, route on-demand to maximize free/cheap model value

Field	Value
Rule name	Coding Rule
Priority	900 (high)
Condition	Task type = `coding`
Target model	`Alibaba/qwen3-coder-plus`
Fallback	`Alibaba/qwen3.5-plus`

Endpoint	Method	Description
`/v1/chat/completions`	`POST`	Chat completions (streaming and non-streaming)
`/v1/models`	`GET`	List all available models (includes a synthetic `auto` model for smart routing)

Condition	Description
`taskType`	Auto-detected task category (coding, translation, analysis, math, creative, chat, summarization, general)
`keywords`	Case-insensitive substring match on the last user message
`tokenCount`	Estimated token count within a min/max range
`maxCost`	Input cost per million tokens <= threshold
`maxLatency`	Provider average latency <= threshold
`providerHealth`	Provider health status matches

Dependency	Description
Docker & Compose	One-command orchestration of `proxy` / `router` / `dashboard` / `postgres` / `redis`
Ollama (optional)	For L3 local model arbitration; containers access the host via `host.docker.internal:11434`

Category	Entry Point
Global & ports	`.env.example`, `compose/ports.env`
Proxy / routing	`PYTHON_ROUTER_URL`, `OLLAMA_URL`, `ARCH_ROUTER_MODEL`, `ROUTING_ENABLE_L2` / `L3`, etc.
Dashboard & auth	`BETTER_AUTH_URL`, `BETTER_AUTH_SECRET`, `DATABASE_URL`, `PROXY_URL`
Pre-built images	`GHCR_OWNER`, `SMARTROUTER_IMAGE_TAG`

Directory	Description
`proxy/`	Node.js proxy: OpenAI-compatible API, L0/L1 cache & rules, SSE
`router/`	FastAPI: semantic routing, caching, L3 integration
`dashboard/`	Next.js: rules, providers, logs, settings
`contracts/`	Inter-service contracts

Document	Description
LICENSE	MIT License
CODE_OF_CONDUCT.md	Community standards based on Contributor Covenant 2.1
CONTRIBUTING.md	Contribution workflow, IP policy, and coding standards
SECURITY.md	Vulnerability reporting & responsible disclosure

Folders and files

Latest commit

History

Repository files navigation

Smart Routing Butler

💡 Why Smart Routing Butler?

🔀 Comparison with Alternatives

✨ Core Features

📸 UI Preview

🎯 Rule Creation — Three Ways to Build Your Routing Strategy

1. Custom Rule Editor

2. Natural Language Rule Generator

3. AI Rule Wizard (Guided Questionnaire)

🔌 OpenAI-Compatible Local Proxy

How it works

What happens under the hood

🔍 Routing Layers Deep Dive

L0 — Exact Cache

L0.5 — Semantic Cache

L1 — User-Defined Rule Engine

L2 — Semantic Route

L3 — Local Model Arbitration (Arch-Router)

Fallback

🏗️ Architecture Overview

🚀 Quick Start (Self-Hosted)

Prerequisites

Deploy in 3 Minutes

npm Dependencies for Local Development

⚙️ Configuration Summary

📂 Repository Structure

🛠️ Development & Health Checks (Maintainers)

🗺️ Roadmap

⚖️ Open-Source Governance

🛡️ Security & Privacy

🤝 Contributing

📜 License & Disclaimer

🙏 Acknowledgments

📚 Further Reading

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages