GitHub - try-works/role-model: role-model is a protocol for assigning the right model for the right job. Use local and cloud AI together, or route between sever

Source: https://github.com/try-works/role-model

![Image 1: Role Model](https://github.com/try-works/role-model/blob/main/docs/public/role-model-hero.png)

`role-model` is an open protocol for capability-aware AI routing, plus a reference router that implements that protocol.

It gives a router a durable contract for describing **what a request needs**, **what an endpoint can do**, **what policy allows**, and **why a final routing decision was made**.

![Image 2: role-model runtime overview](https://github.com/try-works/role-model/blob/main/docs/public/images/runtime-overview.png)

What role-model does

[](https://github.com/try-works/role-model#what-role-model-does)

Every AI workload eventually faces the same question: _which model should handle this request?_ The answer depends on task type, required capabilities, cost, latency, and whether the model is running locally or in the cloud. `role-model` makes that decision explicit, explainable, and portable.

At a high level, `role-model` separates AI routing into a few stable pieces:

1. **Requests** describe task type, required capabilities, modalities, tool needs, and constraints. 2. **Endpoint identities and profiles** describe concrete routable endpoints rather than abstract model names. 3. **Routing policy** applies hard denies, preferences, budgets, and tie-break rules. 4. **Observability artifacts** record the decision, trace, usage, and observed performance.

The reference router supports **hybrid routing** across three deployment shapes:

- **Local / Local** - route between local models (e.g., llama-swap peers) based on role, task, and capability

- **Local / Cloud** - route between a local model and a cloud provider based on cost, latency, and task difficulty

- **Cloud / Cloud** - route between cloud providers (e.g., OpenAI, DeepSeek, Moonshot) based on capability and economics

This means a single runtime can serve a quick chat request from a fast local model, route a complex coding task to a capable cloud model, and fall back to a cheaper cloud endpoint when the primary is degraded, all under one explainable routing contract.

Install the runtime

[](https://github.com/try-works/role-model#install-the-runtime) For end users, prefer the packaged standalone runtime over a source build.

macOS and Linux

[](https://github.com/try-works/role-model#macos-and-linux)

curl -fsSL https://raw.githubusercontent.com/try-works/role-model/main/scripts/install.sh | sh

The installer downloads the latest GitHub Release archive, installs it under `~/.local/share/role-model-router/<version>/<target>/`, and exposes a `role-model-router` launcher in `~/.local/bin`.

Windows

[](https://github.com/try-works/role-model#windows)

irm https://raw.githubusercontent.com/try-works/role-model/main/scripts/install.ps1 | iex

The installer downloads the latest GitHub Release archive, installs it under `%LOCALAPPDATA%\Programs\RoleModelRouter\<version>\<target>\`, and creates a `role-model-router.cmd` launcher.

Manual downloads

[](https://github.com/try-works/role-model#manual-downloads) If you do not want to use installer scripts, download the matching archive from GitHub Releases.

| Platform | Archive | Launch | | --- | --- | --- | | Windows | `role-model-router-win32-x64.zip` | `Role-Model.bat` or `role-model-runtime.exe` | | macOS x64 | `role-model-router-darwin-x64.tar.gz` | `role-model-runtime` | | macOS arm64 | `role-model-router-darwin-arm64.tar.gz` | `role-model-runtime` | | Linux x64 | `role-model-router-linux-x64.tar.gz` | `role-model-runtime` |

Installation for Pi

[](https://github.com/try-works/role-model#installation-for-pi) The `pi-role-model` package connects Pi to an externally running Role-Model runtime.

Start the Role-Model runtime first, then install the public Pi package:

pi install npm:@try-works/pi-role-model

For local checkout testing from this repository, install the package directly:

pi install ./packages/pi-role-model

Inside Pi, run:

``` /role-model setup /role-model doctor /role-model alias list /role-model alias choose /role-model alias use <alias> /role-model requests /role-model explain latest ```

By default the package connects to `http://127.0.0.1:3456` and registers Role-Model as the `role-model` provider using `/api/role-model/downstream/openai`. Set `ROLE_MODEL_ENDPOINT` before starting Pi to use a different local runtime. Remote endpoints require explicit trusted `allowRemote` behavior, and runtimes that report `authentication.required` fail closed unless a future supported token source is configured. For local development installs and the full command reference, see `packages/pi-role-model/README.md`.

Develop from source

[](https://github.com/try-works/role-model#develop-from-source)

Prerequisites

[](https://github.com/try-works/role-model#prerequisites)

- **Node.js 24** (required for `node:sqlite` and SEA support)

- **pnpm 10.x** (via `corepack enable`)

- **Go 1.24+** (for llama-swap vendor binary and Windows launcher)

corepack enable corepack pnpm install

Smoke test

[](https://github.com/try-works/role-model#smoke-test)

corepack pnpm run smoke

For a fuller walkthrough, see `docs/public/quickstart.md`.

Development build

[](https://github.com/try-works/role-model#development-build) Run the bridge and UI in development mode (separate processes):

Terminal 1: bridge server

cd role-model-router/apps/runtime-host-bridge corepack pnpm exec tsx scripts/start-for-qa.ts

Terminal 2: UI dev server

cd role-model-router/apps/runtime-ui corepack pnpm exec react-router dev --port 5173 --host 127.0.0.1

Then open `http://127.0.0.1:5173` in your browser.

Production build (all platforms)

[](https://github.com/try-works/role-model#production-build-all-platforms) Build the UI and package the SEA runtime:

Build UI static files

corepack pnpm --filter @role-model-router/runtime-ui run build

Package the bridge as a single executable

corepack pnpm run runtime:package-sea

Output: `role-model-router/dist/release/<platform-arch>/role-model-runtime`

Windows desktop launcher

[](https://github.com/try-works/role-model#windows-desktop-launcher) Build a complete Windows package with dedicated browser window:

1. Build UI

corepack pnpm --filter @role-model-router/runtime-ui run build

2. Package bridge SEA runtime

corepack pnpm run runtime:package-sea

3. Build Go launcher

cd role-model-router/apps/launcher go build -o ../../dist/release/win32-x64/role-model-launcher.exe main.go

4. Bundle UI files

cp -r ../runtime-ui/build/client ../../dist/release/win32-x64/

Then double-click `role-model-launcher.exe` in `dist/release/win32-x64/`. It will:

- Start the bridge server on port 3456

- Open Microsoft Edge in app mode (dedicated window)

- Serve the UI directly from the bridge (no separate dev server)

Documentation

[](https://github.com/try-works/role-model#documentation) | Read this | If you want | | --- | --- | | `docs/public/README.md` | the docs hub | | `docs/public/introduction.md` | what role-model is and why it exists | | `docs/public/quickstart.md` | a real end-to-end smoke run | | `docs/public/concepts/how-role-model-works.md` | the system flow | | `docs/public/concepts/protocol-overview.md` | the protocol surface | | `docs/public/concepts/routing-overview.md` | how routing decisions happen | | `protocol/README.md` | canonical schemas and fixtures | | `role-model-router/README.md` | reference router packages and runtime apps | | `docs/protocol/routing-policy.md` | routing policy reference | | `docs/protocol/taxonomy-v1.md` | taxonomy V1 groups, roles, tasks, and Pi classification | | `docs/protocol/roles.md` | role metadata reference | | `docs/protocol/tasks.md` | task metadata reference | | `docs/operations/02-ci-and-release-flow.md` | CI, release automation, and workflow ownership | | `CHANGELOG.md` | release history |

Acknowledgements

[](https://github.com/try-works/role-model#acknowledgements) `role-model` builds on the work of several open-source projects:

- **llama-swap** - the vendored local model lifecycle manager that handles process supervision, request forwarding, and model swapping for local endpoints

- **LiteLLM** - the unified LLM API abstraction whose provider catalog, model metadata, and pricing data inform the routing-compatible provider inventory

License

[](https://github.com/try-works/role-model#license)

This repository is licensed under `BUSL-1.1` with a project-specific Additional Use Grant. Internal production use, evaluation, development, modification, and non-production redistribution are permitted under the root license. Hosted or managed third-party services, paid product embedding, and third-party commercialization require a separate commercial license.

See LICENSE for the full terms. Contributions require acceptance of the Contributor License Agreement before they can be merged.

Only individual contributions are accepted. Please do not submit work owned by an employer, client, company, or other entity unless you personally have the right to contribute it under this project's terms.