#

token-optimization

Here are 462 public repositories matching this topic...

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rust cli productivity open-source developer-tools command-line-tool llm cost-reduction anthropic ai-coding claude-code token-optimization agentic-coding

Updated May 2, 2026
Rust

diegosouzapw / OmniRoute

Never stop coding. The free AI gateway — one endpoint, 160+ providers, zero downtime. Smart 4-tier auto-fallback (Subscription → API → Cheap → Free), prompt compression (save 15-75% tokens), 3-level proxy for geo-blocks, MCP Server (29 tools), A2A Protocol, 10 multi-modal APIs, and Desktop/Android/PWA apps.

gateway proxy-server llm-tools token-optimization mcp-gateway

Updated May 2, 2026
TypeScript

chopratejas / headroom

The Context Optimization Layer for LLM Applications

python agent compression ai proxy mcp openai rag fastapi llm langchain anthropic context-window token-optimization context-engineering

Updated May 2, 2026
Python

yvgude / lean-ctx

The context layer for AI coding agents Reduce token waste in Cursor, Claude Code, Copilot, Windsurf, Codex, Gemini & more by 60–95% (up to 99% on cached reads) Shell Hook + MCP Server · 49 tools · 10 read modes · 90+ patterns · Single Rust binary

rust ai mcp developer-tools cursor copilot llm gemini-cli ai-coding mcp-server claude-code token-optimization agentic-coding context-engineering reduce-token-costs

Updated May 2, 2026
Rust

cytostack / openwolf

Sharper context. Fewer tokens. Open-source middleware for Claude Code.

cli open-source middleware developer-tools anthropic claude-ai claude-code token-optimization

Updated Mar 20, 2026
TypeScript

token-optimizer

alexgreensh / token-optimizer

Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.

token-usage context-window claude-code token-optimization context-engineering claude-plugin claude-code-skill token-optimizer agentskills ghost-tokens

Updated May 1, 2026
Python

GMaN1911 / claude-cognitive

Working memory for Claude Code - persistent context and multi-instance coordination

productivity developer-tools claude-ai context-management claude-code token-optimization

Updated Jan 17, 2026
Python

lucasrosati / claude-code-memory-setup

Up to 71.5x fewer tokens per session on Claude Code with Obsidian + Graphify. Persistent memory, codebase knowledge graphs, and chat import pipeline. 🇧🇷 PT-BR included.

knowledge-graph obsidian zettelkasten developer-productivity second-brain ai-tools graphify claude-code token-optimization coding-agent

Updated Apr 20, 2026

clauditor

IyadhKhalfallah / clauditor

Stop Claude Code from burning through your quota in 20 minutes. Auto-rotates oversized sessions and preserves context.

cli hooks claude-code token-optimization

Updated Apr 16, 2026
TypeScript

ooples / token-optimizer-mcp

Intelligent token optimization for Claude Code - achieving 95%+ token reduction through caching, compression, and smart tool intelligence

caching compression ai mcp claude llm mcp-server token-optimization

Updated Apr 20, 2026
TypeScript

nadimtuhin / claude-token-optimizer

Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.

documentation automation developer-tools ai-assistant claude-code token-optimization setup-template

Updated Nov 10, 2025
Shell

juyterman1000 / entroly

Entroly-Daemon: Self-Evolving Daemon. Compress 2M-token repos into a razor-sharp Principal Engineer's context. 85–99% fewer tokens, 100% accuracy retention (verified by live API benchmarks). Built for Cursor, Claude Code, Opus, Codex, GPT & Custom Providers.

productivity open-source ai mcp developer-tools cursor ai-agents claude rag ai-tools llm chatgpt anthropic mcp-server claude-code agents-sdk token-optimization openclaw claude-opus-4-6

Updated May 2, 2026
Python

elusznik / mcp-server-code-execution-mode

An MCP server that executes Python code in isolated rootless containers with optional MCP server proxying. Implementation of Anthropic's and Cloudflare's ideas for reducing MCP tool definitions context bloat.

python docker mcp orchestration agents code-execution claude podman anthropic agentic-ai model-context-protocol claude-code token-optimization

Updated Dec 5, 2025
Python

oxygen-fragment / claude-modular

Production-ready modular Claude Code framework with 30+ commands, token optimization, and MCP server integration. Achieves 2-10x productivity gains through systematic command organization and hierarchical configuration.

productivity development-workflow modular-framework template-repository ai-development claude-code token-optimization adhd-friendly mpc-servers

Updated Jul 16, 2025

skibidiskib / ai-codex

Generate a compact codebase index for AI assistants — saves 50K+ tokens per conversation

typescript nextjs developer-tools cursor claude llm-tools ai-coding claude-code token-optimization codebase-index

Updated Apr 26, 2026
TypeScript

Lap-Platform / LAP

Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10× smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.

Updated Mar 26, 2026
Python

edouard-claude / snip

CLI proxy that reduces LLM token usage by 60-90%. Declarative YAML filters for Claude Code, Cursor, Copilot, Gemini. rtk alternative in Go.

Updated May 1, 2026
Go

ojuschugh1 / sqz

Compress LLM context to save tokens and reduce costs

javascript python api rust cli open-source ai extensions context tokens developer-tools token cost-optimization llms agentic-ai token-optimization

Updated May 2, 2026
Rust

mpecan / tokf

Config-driven CLI tool that compresses command output before it reaches an LLM context

rust cli homebrew toml command-line developer-tools ai-tools llm context-window claude-code token-optimization output-filter

Updated Apr 29, 2026
Rust

omni

fajarhide / omni

A smart context filter that removes noise, improves responses, and reduces token usage up to 90%

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency

Updated May 2, 2026
Rust

Improve this page

Add a description, image, and links to the token-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-optimization topic, visit your repo's landing page and select "manage topics."