AWS Model Runner

Self-hosted, cost-optimized AI coding assistant with automatic 3-tier model routing on AWS Bedrock.

                                              AWS Bedrock (eu-west-2)
OpenCode --> NadirClaw (localhost) --+--> Qwen3 Coder 30B  (budget)
                |                   +--> Qwen3 Coder 480B (mid)
         Classifies prompt          |
         complexity (~10ms)         |    OpenRouter
                |                   +--> Qwen 3.6 Plus 1M (premium)
         +------+------+                 Free in preview
         v      v      v
       Simple   Mid   Complex/Large ctx

NadirClaw classifies each prompt's complexity using a local sentence-embedding model and routes it to the cheapest model that can handle it. Bedrock models handle budget/mid tiers; Qwen 3.6 Plus (1M context, free preview) on OpenRouter handles complex tasks and large-context overflow (>256K tokens).

Why

No throttling -- pay-per-use, no message caps or peak-hour limits
50% cheaper than Claude Max 5x at heavy usage (~55 vs ~110 EUR/mo)
Multi-region -- route models to different AWS regions (eu-west-2, us-west-1, etc.) via per-model endpoint overrides
Model flexibility -- 6+ models, swap anytime
Open source -- NadirClaw + OpenCode, fully auditable
Private -- VPC endpoint + Tailscale, traffic never hits public internet

Quick Start (Docker — recommended)

# 1. Install
pixi install

# 2. Deploy AWS infra (VPC, Bedrock endpoint, IAM)
cd infra && cdk deploy ModelRunnerNetwork ModelRunnerBedrock --require-approval never

# 3. Start NadirClaw in Docker (binds to 127.0.0.1:4000 only)
pixi run -e dev up

# 4. Verify
pixi run -e dev verify

Then connect OpenCode (uses global config at ~/.config/opencode/opencode.json):

opencode

VS Code users: The container starts automatically when you open this project (via .vscode/tasks.json).

Without Docker

mkdir -p ~/.nadirclaw && cp config/nadirclaw.env ~/.nadirclaw/.env
pixi run -e dev start

See SETUP_GUIDE.md for the full step-by-step walkthrough.

Models (multi-region)

Tier	Model	Provider	Context	Cost ($/M in/out)
Budget	Qwen3 Coder 30B	Bedrock (eu-west-2)	32K	$0.15 / $0.60
Mid	Qwen3 Coder 480B	Bedrock (eu-west-2)	256K	$0.45 / $1.80
Premium	Qwen 3.6 Plus	OpenRouter	1M	Free (preview)

Also available as fallbacks: Kimi K2.5, DeepSeek V3.2, GLM-4.7, GLM-5, MiniMax M2.1.

Cost (Solo Developer, incl. 20% VAT)

Usage	Monthly (EUR)
Light (30 req/day)	~30-40
Medium (80 req/day)	~55-75
Heavy (200 req/day)	~80-120

See AWS_Coding_Model_Cost_Security_Analysis.md for the full cost and security analysis with benchmark correlations.

Commands

# Docker (recommended)
pixi run -e dev up         # Start NadirClaw container (127.0.0.1:4000)
pixi run -e dev down       # Stop container
pixi run -e dev logs       # Follow container logs

# Direct (without Docker)
pixi run -e dev start      # Start NadirClaw router directly

# Shared
pixi run -e dev verify     # Test all 3 model tiers
pixi run -e dev report     # Cost report by model
pixi run -e dev savings    # How much you saved vs all-premium
pixi run -e dev dashboard  # Live terminal dashboard

Project Structure

aws-model-runner/
  Dockerfile                 # NadirClaw container image
  docker-compose.yml         # Localhost-only binding, healthcheck, AWS creds mount
  config/nadirclaw.env       # NadirClaw 3-tier routing config (Bedrock + OpenRouter)
  .vscode/tasks.json         # Auto-start container on folder open
  scripts/
    setup.py                 # Automated setup wizard
    verify_connection.py     # End-to-end connection test
  infra/                     # AWS CDK (Python)
    stacks/
      network_stack.py       # VPC + private subnets
      bedrock_stack.py       # VPC endpoint + IAM role
      tailscale_stack.py     # Tailscale subnet router (optional)
  docs/
    AWS_Coding_Model_Cost_Security_Analysis.md  # Full analysis
    SETUP_GUIDE.md             # Step-by-step recipe

Security

VPC endpoint for Bedrock -- traffic stays on AWS backbone
Tailscale subnet router -- encrypted WireGuard tunnel (optional)
IMDSv2 enforced -- SSRF protection on EC2 instances
iptables blocking 169.254.169.254 from forwarded traffic
IAM least-privilege -- bedrock:InvokeModel restricted to 3 model ARNs
CloudWatch prompt logging disabled -- prompts are not stored
NadirClaw runs on localhost only -- Docker binds to 127.0.0.1:4000, not exposed to the network

Known Issues

NadirClaw optimizer strips tool fields (fix: c37cad8) -- optimize_messages() rebuilds messages as {"role", "content"} only, dropping tool_calls and tool_call_id. This breaks all tool-use conversations. Fix: set NADIRCLAW_OPTIMIZE=off in config/nadirclaw.env.
NadirClaw content: null on tool-call messages (fix: c37cad8) -- When an assistant message has only tool_calls (no text), text_content() returns "" but a falsy check converts it to None. Bedrock/Mantle rejects content: null. Fix: patched in Dockerfile with sed.
LiteLLM missing some Bedrock models (litellm#24993) -- some newer Bedrock models not in LiteLLM's registry; the bedrock/converse/ route doesn't support tools. Fix: use Bedrock Mantle endpoint (openai/ prefix) instead.
OpenCode injects all skill descriptions into system prompt (opencode#13188) -- With ~1,500 skills installed, the system prompt grows to ~600KB (~150K tokens), exceeding model context windows. Fix: move skills out of ~/.config/opencode/skills/ and ~/.claude/skills/antigravity/; use @zenobius/opencode-skillful for lazy loading.
docker compose restart does not reload config/nadirclaw.env -- use docker compose down && docker compose up -d instead.

See SETUP_GUIDE.md for detailed troubleshooting.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.claude		.claude
.github/workflows		.github/workflows
.vscode		.vscode
config		config
docs		docs
infra		infra
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
.opencodeignore		.opencodeignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pixi.lock		pixi.lock
pixi.toml		pixi.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS Model Runner

Why

Quick Start (Docker — recommended)

Without Docker

Models (multi-region)

Cost (Solo Developer, incl. 20% VAT)

Commands

Project Structure

Security

Known Issues

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AWS Model Runner

Why

Quick Start (Docker — recommended)

Without Docker

Models (multi-region)

Cost (Solo Developer, incl. 20% VAT)

Commands

Project Structure

Security

Known Issues

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages