#

model-routing

Here are 319 public repositories matching this topic...

CommonstackAI / UncommonRoute

Automatic LLM router — 82% cost savings, 79.4% accuracy, 93.4% pass rate. Drop-in OpenAI proxy.

agent router ai openai cost-optimization llm anthropic model-routing

Updated May 12, 2026
Python

NadirRouter / NadirClaw

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.

Updated Jun 12, 2026
Python

codeking-ai / cligate

Private AI assistant, AI agent, and unified model proxy for Claude Code, Codex CLI, Gemini CLI & OpenClaw. Skills, MCP, tools, channels, tasks,model routing, accounts, keys, logs, dashboard.

Updated Jun 23, 2026
JavaScript

greynewell / infermux

Route inference across providers.

Updated Feb 17, 2026
Go

openfreerouter / freerouter

Free, self-hosted AI model router. OpenRouter / ClawRouter alternative using your own API keys. 14-dimension classifier routes to the right model (Anthropic/OpenAI/Kimi) automatically. No middleman, no markup. Built for OpenClaw.

self-hosted openai cost-optimization anthropic llm-proxy llm-router ai-proxy model-routing openclaw openrouter-alternative ai-model-router

Updated Feb 14, 2026
TypeScript

fengzhizi715 / OpenVitamin

OpenVitamin is a local-first AI execution platform that unifies Agents, Workflows, and multi-model inference into a single programmable system — designed for building real, production-grade AI applications.

agent workflow ai multi-model execution-engine ai-agents rag fastapi ai-platform local-first onnxruntime llm llama-cpp local-ai ai-platforms agent-orchestration openai-compatible agent-runtime model-routing

Updated May 15, 2026
Python

nshkrdotcom / trinity_coordinator

TRINITY in Elixir (An Evolved LLM Coordinator): route LLM calls via a small-model hidden-state router + Axon coordination head, with Thinker/Worker/Verifier orchestration and policy loop for acceptance-driven completion.

Updated May 29, 2026
Elixir

guanbear / OctoClaw

Auto-delegation, status truth, and cost-aware model routing for OpenClaw agents

slack automation typescript ai-agents feishu llmops agent-orchestration subagents model-routing openclaw

Updated Jun 18, 2026
TypeScript

syrin-labs / syrin-harness

The Python Harness for Production AI Multi-Agent Systems

budgeting multi-agent memory-management observability harness multi-agent-systems rag multimodal-agent agentic-rag context-engineering model-routing harness-engineering

Updated May 18, 2026
Python

heyxiaoc / not-fade-away

在自己的 Mac 上搭一套常驻、自愈、走订阅、墙内也能用的自托管 AI 伴侣 · 人看版讲思路，机看版给完整规格 · 文 / 小C & Grace

tutorial self-hosted claude llm anthropic ai-companion claude-code model-routing fable-5 claude-fable

Updated Jun 24, 2026
Python

Aaryan-Kapoor / ModelGate-Hackathon

🏆Winning Project | ModelGate is a contract-aware AI control plane that ingests customer contracts, extracts SLA/privacy/routing constraints, and generates an OpenAI-compatible endpoint that automatically routes every request to the optimal model. Simple queries go to cheap models. Complex queries go to premium ones.

reinforcement-learning ai hackathon model nextjs routing openai lora quantization cost-optimization fine-tuning fastapi openai-api llm llamacpp gguf grpo openai-compatible model-routing

Updated Mar 22, 2026
TypeScript

tzachbon / claude-model-router-hook

Claude Code hooks that auto-switch model tier based on task complexity

Updated Mar 13, 2026
Python

joeseesun / qiaomu-llm-mcp

把多模型 Provider、本地密钥和 HeavySkill 讨论统一成 MCP 网关 | Local MCP gateway for multi-provider LLM routing, secrets, and HeavySkill discussions.

python mcp multi-model codex zai llm ai-workflow deepseek claude-code model-routing

Updated Jun 19, 2026
Python

megeezy / Chameleon

Stateless LLM runtime that dynamically routes, loads, executes, and unloads models per request with bounded VRAM caching and intelligent model selection.

systems-programming llm generative-ai ai-infrastructure latency-optimization model-routing vram-optimization model-scheduling

Updated Apr 12, 2026
Rust

claude-router

0xrdan / claude-router

Intelligent model orchestration for Claude Code - routes queries to optimal Claude model (Haiku/Sonnet/Opus) based on complexity. It also includes many more features. If this project is working well for you and would like to support me, just help spread the word. Thanks!

claude cost-optimization llm anthropic claude-code model-routing

Updated Jan 26, 2026
Python

RagavRida / mmcp

Multi-Model Collaboration Pipeline — orchestrate AI models as a DAG. RL routing, multi-verifier voting, agent mesh, self-improving. Works with OpenAI, Anthropic, Gemini, DeepSeek. npm install mmcp-core | pip install mmcp-core

Updated Jun 3, 2026
TypeScript

kalibr-ai / kalibr-sdk-python

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated Jun 3, 2026
Python

walidboulanouar / maestro

Open-source Fugu: the open-source LLM orchestration brain.

self-hosted fugu trinity sakana openrouter ollama ai-gateway llm-router llm-orchestration openai-compatible claude-code model-routing anthropic-compatible sakana-fugu open-source-fugu frugalgpt

Updated Jun 23, 2026
TypeScript

Ruthwik000 / tokenfirewall

Scalable LLM cost enforcement middleware for Node.js with budget protection and multi-provider support

nodejs middleware typescript gemini openai budget cost-control llm anthropic token-counter model-routing automatic-failover ai-cost-management gssoc26

Updated May 27, 2026
TypeScript

xorbitsai / xrouter-llm

A prompt-aware LLM router that predicts which models can complete each request, then selects the cheapest capable one: 52.4% lower cost and +1.7 pts completion on our tested dataset.

ai model-selection ai-agents cost-optimization llm llmops openrouter llm-router llm-routing model-routing prompt-routing

Updated Jun 24, 2026
Python

Improve this page

Add a description, image, and links to the model-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-routing topic, visit your repo's landing page and select "manage topics."