Introduction

This manual covers codescout: an MCP server that gives AI coding agents IDE-grade code navigation, optimized for token efficiency.

The Problem

When an AI coding agent tries to understand a codebase with conventional file tools, it faces a mismatch between what the tools produce and what the task actually requires.

Consider a routine task: "find all callers of authenticate_user and check whether they handle the error case." With standard tools, the agent has a few options:

grep — returns every line containing the string, including comments, string literals, documentation, and test fixtures. Disambiguation is the agent's problem.
cat — dumps the entire file when the agent needs one function. A 1,000- line module floods the context for a 30-line function.
find — locates files by name, but has no awareness of what is inside them.

None of these tools understand code structure. They operate on bytes and lines, not symbols, definitions, or references. The result is that agents burn most of their context window on navigation overhead: reading full files to find one function, re-reading the same module multiple times from different entry points, asking questions they already answered two tool calls ago.

The downstream effects compound:

Shallow understanding. When an agent can only see fragments at a time, it builds an incomplete picture and fills gaps with plausible-sounding guesses.
Hallucinated edits. Functions that do not exist, arguments in the wrong order, return types copied from the wrong overload.
Constant course-correction. The human has to re-read the agent's output, identify what it got wrong, and re-explain the structure it missed.

The tools are structurally blind. Every coding agent using only file primitives runs into this wall, regardless of model capability.

The Solution

codescout exposes the same information an IDE uses — symbol definitions, references, call hierarchies, type information, git history — through a standard MCP interface that any agent can call.

It is a Rust binary that runs alongside your coding agent. The agent sends MCP tool calls; codescout delegates to the right backend (LSP server, tree-sitter, git, embedding index) and returns structured, compact results.

Four pillars:

LSP Navigation (9 tools, 9 languages)

The Language Server Protocol is how IDEs answer questions like "where is this defined?" and "who calls this?". codescout runs LSP servers on your behalf and exposes their answers as agent-friendly tools.

find_symbol — locate any symbol by name across the project
list_symbols — the outline of a file or directory: classes, functions, structs, in tree form
find_references — all callers/usages of a given symbol
replace_symbol — replace a function body by name, not by line number
remove_symbol — delete a named symbol entirely
insert_code — add code relative to a named symbol (position: "before" or position: "after")
goto_definition — jump to the definition of a symbol at a given line
hover — type info and documentation for a symbol at a given position
rename_symbol — rename across the entire codebase via LSP

Supported languages: Rust, Python, TypeScript/JavaScript, Go, Java, Kotlin, C/C++, C#, Ruby.

Semantic Search (3 tools)

Sometimes you know the concept but not the name. Semantic search finds code by meaning using embeddings, not keywords.

semantic_search — "authentication middleware", "retry with exponential backoff", "how errors are serialized" — returns ranked code chunks. The optional scope parameter restricts search to project code, a specific library, or all sources.
index_project — build or incrementally update the embedding index (smart change detection via git diff → mtime → SHA-256 fallback)
index_status — show index stats: file count, chunk count, embedding model, last update time, and optional per-file drift scores

The embedding backend is configurable: OpenAI, Ollama, or any compatible endpoint.

For git history and diffs, use run_command with shell git commands (e.g. run_command("git log src/auth.rs") or run_command("git diff HEAD")).

Persistent Memory (1 tool)

Agents are stateless across sessions by default. codescout provides a lightweight key-value store backed by markdown files in .codescout/memories/.

memory — unified dispatch tool: action: "read" / "write" / "list" / "delete" for the file store; "remember" / "recall" / "forget" for natural-language semantic memory

Use this to record decisions, gotchas, and conventions so the agent picks them up on the next session without re-discovery.

Library Navigation (1 tool)

Navigate third-party dependency source code without leaving your agent workflow. Libraries auto-register when LSP goto_definition returns paths outside the project root.

list_libraries — see all registered libraries and their status (use index_project with a library path to build a semantic index for it)

The Rest

Beyond these pillars: 6 file operation tools (directory listing, file reading, pattern search, file search, file creation, find-and-replace editing), 2 workflow tools (project onboarding, shell commands), 2 config tools, and 5 GitHub tools — 29 tools total.

Token Efficiency by Design

Every tool defaults to the most compact representation that is still useful. Full bodies are available via detail_level: "full". Paginated results use offset and limit. Tools never dump unbounded output.

The design follows two modes:

Exploring (default) — names and locations, capped at 200 items. Low token cost. Right for orientation.
Focused — full detail, paginated. Use once you know what you are looking at.

When results overflow the cap, the tool tells you how to narrow the query rather than silently truncating. You get guidance, not garbage.

Who This Manual Is For

This manual is written for three audiences.

Operators

You are setting up codescout for a team or configuring it to work with Claude Code, Cursor, or another MCP-capable agent. You need to understand installation, the MCP configuration format, embedding backend options, and which LSP servers to install for your languages.

Start here: Installation, then Project Configuration.

End-User Developers

You are a developer using Claude Code (or another agent) with codescout already set up. You want to understand what the tools do and when to reach for each one, so you can ask the agent better questions and interpret its reasoning.

Start here: Progressive Disclosure and Tool Selection, then browse the Tool Reference for the categories you use most.

Contributors

You want to add a language, write a new tool, or swap in a different embedding backend. You need to understand the internal architecture: the Tool trait, the LSP client, the embedding pipeline, the output guard system.

Start here: Architecture, then Adding Languages and Writing Tools.

How to Read This Manual

The manual is organized into three sections:

User Guide — everything you need to install, configure, and use codescout. Reads linearly for first-time setup; use it as a reference once you are familiar.

Tool Reference — one page per tool category. Each page covers what the tools do, their parameters, output format, and when to prefer them over alternatives. You do not need to read this cover to cover; look up the category you need.

Development — architecture internals, extension guides, and troubleshooting. Oriented toward contributors and operators debugging unexpected behavior.

Get Started

Installation — build the binary, register the MCP server, install LSP servers
Your First Project — onboarding, indexing, and your first tool calls
Routing Plugin — the plugin that ensures Claude always reaches for codescout tools

A Quick Example

Here is what a concrete agent interaction looks like with codescout versus without it.

Without codescout — the agent uses read_file on auth.rs (850 lines), scans for authenticate_user, reads the function, then uses grep for callers, gets 23 hits including test fixtures, reads three more files to disambiguate, and still misses that the error type changed in a recent refactor.

With codescout:

list_symbols("src/auth.rs")
  → authenticate_user [fn, line 142], SessionStore [struct, line 12], ...

find_references("authenticate_user", "src/auth.rs")
  → middleware/auth_guard.rs:88, handlers/login.rs:34, handlers/api.rs:201

run_command("git log src/auth.rs")
  → 3 days ago: "refactor: change AuthError to return structured payload"

find_symbol("AuthError", include_body=true)
  → enum AuthError { ... } with full definition

Four targeted calls. The agent sees the symbol tree, the exact call sites, the relevant git history, and the type definition — without reading a single full file. That is the difference codescout makes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduction

The Problem

The Solution

LSP Navigation (9 tools, 9 languages)

Semantic Search (3 tools)

Persistent Memory (1 tool)

Library Navigation (1 tool)

The Rest

Token Efficiency by Design

Who This Manual Is For

Operators

End-User Developers

Contributors

How to Read This Manual

Get Started

A Quick Example

FilesExpand file tree

introduction.md

Latest commit

History

introduction.md

File metadata and controls

Introduction

The Problem

The Solution

LSP Navigation (9 tools, 9 languages)

Semantic Search (3 tools)

Persistent Memory (1 tool)

Library Navigation (1 tool)

The Rest

Token Efficiency by Design

Who This Manual Is For

Operators

End-User Developers

Contributors

How to Read This Manual

Get Started

A Quick Example