prompt-caching

Star

Here are 57 public repositories matching this topic...

esengine / reasonix

Star

DeepSeek-native agent framework: Cache-First Loop, R1 Thought Harvesting, Tool-Call Repair. TypeScript + Ink TUI.

agent typescript tui ink r1 tool-use agent-framework llm prompt-caching deepseek

Updated Apr 30, 2026
TypeScript

AdieLaine / multi-agent-reasoning

Star

The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integration for optimal answers. Simulating a team that discusses, debates, and refines responses, it enables complex problem-solving and precise results. Now with Prompt Caching to reduce latency and costs.

python chatbot multi-agent openai swarm agent-based-modeling reasoning o1 prompt-caching

Updated Jan 23, 2025
Python

liushuangls / go-anthropic

Star

Anthropic Claude API wrapper for Go

go golang ai vision streaming-api claude tool-use llm prompt-caching anthropic claude-ai claude-api function-calling

Updated Apr 29, 2026
Go

cablate / claude-code-research

Star

Independent research on Claude Code internals, Claude Agent SDK, and related tooling.

research mcp reverse-engineering prompt-caching system-prompt claude-code token-optimization claude-agent-sdk

Updated Mar 31, 2026
HTML

flightlesstux / prompt-caching

Star

Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessions, and long coding conversations - zero config.

typescript mcp developer-tools claude llm cost-reduction prompt-caching anthropic claude-code token-optimization

Updated Apr 16, 2026
TypeScript

montevive / autocache

Star

🚀 Autocache - Intelligent Anthropic API Cache Proxy Automatically inject cache-control fields into Claude API requests to reduce costs by up to 90% and latency by up to 85%. Works as a transparent drop-in replacement for popular AI platforms like n8n, Flowise, Make.com, LangChain, and LlamaIndex—no code changes required

agent ai proxy cache claude n8n prompt-caching flowise agentic-ai

Updated Feb 2, 2026
Go

GPTSafe / PromptGuard

Star

Build production ready apps for GPT using Node.js & TypeScript

prompt openai gpt gpt-2 gpt-3 prompt-engineering chatgpt prompt-attack prompt-injection prompt-caching prompt-hardening

Updated May 8, 2023
TypeScript

agynio / claude-map-reduce-memory

Star

Global, unlimited persistent memory for Claude Code agents. Context-activated hints injected automatically via hooks using scatter-gather map-reduce.

cli map-reduce ai-agents claude llm prompt-caching agent-memory claude-code cmr-memory

Updated Apr 15, 2026
TypeScript

kevinhermawan / swift-llm-chat-anthropic

Sponsor

Star

Interact with Anthropic and Anthropic-compatible chat completion APIs in a simple and elegant way. Supports vision, prompt caching, and more.

swift vision claude prompt-caching anthropic claude-3-opus claude-3-haiku claude-3-5-sonnet

Updated Nov 3, 2024
Swift

Kernel-Dirichlet / CoTARAG

Star

Agentic-AI framework w/o the headaches

sql multi-modal indexing-engine rag llm chain-of-thought prompt-caching prompt-engineering-for-programmers hallucination-detection agentic-framework agentic-workflow hallucination-mitigation agentic-rag agentic-ai

Updated Jan 19, 2026
Python

NumexaHQ / captainCache

Star

prompt caching to save dollars on generative AI API usage.

redis golang caching monitoring metrics usage openai observability prompts finops generative-ai prompt-caching

Updated Jul 14, 2023
Go

masteragentcoder / agentcache

Star

Cache-aware orchestration for LLM agents. Fork helpers that share cached prefixes, detect cache breaks, and cut token costs by 38%+.

python agent research deep orchestration multi-agent openai swarm agents ai-agents cache-optimization llm prompt-caching anthropic litellm task-dag

Updated Apr 2, 2026
Python

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated Apr 30, 2026

mallard1983 / openclaw-kvcache-proxy

Star

FastAPI proxy that strips volatile fields from OpenClaw requests to dramatically improve llama-server KV cache hit rates (~22× faster prompt eval)

proxy fastapi kv-cache llm prompt-caching llama-cpp local-llm llama-server openclaw amd-vulkan

Updated Feb 23, 2026
Python

pomagrenate / cheeserag

Star

Using local AI as easy and light weight as a crab eating the cheese

ai ram cuda edge low-performance llm prompt-caching local-ai cheesecrab context-squeezing

Updated Apr 7, 2026
Go

comisai / comis

Star

Self-hosted AI agent teams inside your messaging apps.

Updated Apr 30, 2026
TypeScript

ParthJadhav / image-read-cache

Sponsor

Star

Agent Skill that caches LLM image descriptions as XMP metadata inside image files, reducing token usage by ~92% on repeated reads. Works with 30+ compatible agents.

opencode image-processing cursor image-cache xmp-metadata ai-agents llm prompt-caching agent-skills claude-code token-optimization

Updated Mar 31, 2026
Python

cauchyturing / agent-harness-engineering

Star

91 production-proven patterns for building AI agents, extracted from a 512K-line codebase. Covers agentic loops, tool systems, permissions, MCP, prompt caching, multi-agent orchestration.

Updated Apr 2, 2026

Keshab0310 / agent-memory

Star

Save 60-90% on LLM token costs with intelligent memory compression for multi-agent systems

memory mcp multi-agent ai-agents claude llm prompt-caching anthropic ollama token-optimization

Updated Apr 2, 2026
Python

LvcidPsyche / claude-api-optimization

Star

🚀 Comprehensive Claude API cost optimization toolkit - reduce costs by 50-95%

ai batch-api claude cost-optimization llm prompt-caching anthropic api-optimization

Updated Mar 15, 2026
JavaScript

Improve this page

Add a description, image, and links to the prompt-caching topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-caching topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prompt-caching

Here are 57 public repositories matching this topic...

esengine / reasonix

AdieLaine / multi-agent-reasoning

liushuangls / go-anthropic

cablate / claude-code-research

flightlesstux / prompt-caching

montevive / autocache

GPTSafe / PromptGuard

agynio / claude-map-reduce-memory

kevinhermawan / swift-llm-chat-anthropic

Kernel-Dirichlet / CoTARAG

NumexaHQ / captainCache

masteragentcoder / agentcache

pleasedodisturb / awesome-llm-token-optimization

mallard1983 / openclaw-kvcache-proxy

pomagrenate / cheeserag

comisai / comis

ParthJadhav / image-read-cache

cauchyturing / agent-harness-engineering

Keshab0310 / agent-memory

LvcidPsyche / claude-api-optimization

Improve this page

Add this topic to your repo