quant-cache

English | 日本語

An economic cache decision framework for CDN operators.

quant-cache evaluates cache policies through an economic objective function ($/period) that unifies latency savings, origin cost reduction, and freshness penalties into a single metric. It reveals hidden costs that hit-rate-only evaluation misses — for example, GDSF achieves the highest hit rate but scores negative on economic objective due to stale content penalties.

quant-cache is not a replacement for eviction policies like SIEVE or S3-FIFO. It is a decision and evaluation layer that answers:

Which objects are economically worth caching?
How does your cache policy perform when freshness costs are accounted for?
How close is your greedy heuristic to the mathematical optimum?

Key Finding

Evaluated across 20 synthetic traces (Zipf α=0.6, 500 objects, 50k requests):

Policy	Objective$ (mean)	Hit% (mean)	CostSavings$ (mean)
SIEVE	392.57	36.4%	361.74
S3-FIFO	380.48	35.1%	350.79
LRU	324.90	35.0%	349.66
GDSF	-133.19	44.1%	562.64

GDSF has the highest hit rate and cost savings, but its economic objective is deeply negative because it caches high-update-rate objects that incur stale penalties. This kind of insight is invisible without an explicit economic model.

Quick Start

cargo build --workspace --release

# Generate a synthetic trace
qc generate --num-objects 10000 --num-requests 1000000 --output trace.csv

# Compare policies with economic evaluation
qc compare --input trace.csv --capacity 50000000 --preset ecommerce

# Import real CDN logs (CloudFront, Cloudflare, or Fastly)
qc import --provider cloudfront --input access.log --output trace.csv
qc import --provider cloudflare --input els.ndjson --output trace.csv
qc import --provider fastly --input realtime.ndjson --output trace.csv

# Optimize: find the economically optimal cache set
qc optimize --input trace.csv --output policy.json --capacity 50000000 --preset ecommerce

# Search for the best policy configuration (grid, SA, or QUBO)
qc policy-search --input trace.csv --capacity 50000000 --method sa --output best-ir.json

# Calibrate economic parameters
qc calibrate --train train.csv --validation val.csv --capacity 50000000

End-to-End: Trace → CDN Config

# 1. Import logs (or generate synthetic)
qc import --provider cloudfront --input access.log --output trace.csv

# 2. Search for the best policy configuration
qc policy-search --input trace.csv --capacity 50000000 \
  --preset ecommerce --method sa --output best-policy.json

# 3. Evaluate the policy on the trace
qc policy-eval --input trace.csv --policy best-policy.json --preset ecommerce

# 4. Generate optimizer scores for admission gate
qc optimize --input trace.csv --output scores.json \
  --capacity 50000000 --preset ecommerce

# 5. Compile to any CDN target + validate
qc compile --policy best-policy.json --scores scores.json \
  --target cloudflare --output cloudflare-config.json --validate
qc compile --policy best-policy.json --scores scores.json \
  --target cloudfront --output cloudfront-config.json --validate
qc compile --policy best-policy.json --scores scores.json \
  --target fastly --output fastly-config.json --validate
qc compile --policy best-policy.json --scores scores.json \
  --target akamai --output akamai-config.json --validate

# 6. Compare all CDN outputs side-by-side
qc compile-compare --policy best-policy.json --scores scores.json

# 7. Pre-deploy safety check
qc deploy-check --input trace.csv --policy best-policy.json --preset ecommerce

What It Does

1. Economic Scoring

For each cached object, compute expected economic benefit over time window T:

benefit  = E[requests] × (latency_saving × λ_latency + origin_cost)
freshness_cost = E[requests] × P(stale) × stale_penalty    (TTL-Only model)
net_benefit = benefit - freshness_cost

Two scoring versions:

V1 (frequency): all requests assumed to hit if cached
V2 (reuse-distance): p_hit = exp(-rd_p50 / cache_capacity_objects) — discounts by temporal locality

2. Replay Evaluation

Replay traces against 7 policies (LRU, GDSF, SIEVE, S3-FIFO, Belady, EconSieve, EconS3FIFO) and measure both traditional metrics (hit rate, byte hit rate) and economic objective.

3. Bounded Optimality

Solve the 0-1 knapsack with GreedySolver (O(n log n)) and verify against ExactIlpSolver. Observed optimality gap: median 0.01%, p95 0.72% (n=1000, 50 cases).

4. Parametric Validation

2,880+ parameter sweep validates invariants across Zipf α, capacity ratio, object count, update rate, and scoring version:

Belady hit ratio ≥ all online policies
Solver respects capacity constraints
No NaN/Inf propagation
Objective monotone in capacity
Deterministic results

CLI Commands

Command	Description
`qc import`	Convert CDN logs to canonical trace CSV (CloudFront, Cloudflare, Fastly)
`qc generate`	Generate synthetic traces with configurable distributions
`qc optimize`	Find economically optimal cache set (greedy, ILP, or SA solver)
`qc simulate`	Replay trace against a saved policy
`qc compare`	Compare LRU, GDSF, SIEVE, S3-FIFO, Belady side-by-side
`qc calibrate`	Auto-tune economic parameters using train/validation split
`qc policy-eval`	Evaluate PolicyIR configurations on traces
`qc policy-search`	Search backend/admission/bypass/prewarm space (grid/SA/QUBO)
`qc compile`	Generate deployment config + validate (Cloudflare/CloudFront/Fastly/Akamai)
`qc deploy-check`	Pre-deploy safety check (LRU/SIEVE comparison + thresholds)
`qc compile-compare`	Compile same PolicyIR to all 4 providers and compare

Baselines

Policy	Type	Source
LRU	Online eviction	Classic
GDSF	Online eviction (cost-aware)	Cao & Irani, 1997
SIEVE	Online eviction (lazy promotion)	Zhang et al., NSDI 2024 (Best Paper)
S3-FIFO	Online eviction (3-queue)	Yang et al., SOSP 2023
Belady	Offline oracle (future knowledge)	Belady, 1966
EconSieve	SIEVE + economic admission gate	quant-cache
EconS3FIFO	S3-FIFO + economic admission gate	quant-cache
EconomicGreedy	Offline knapsack selection	quant-cache (Dantzig, 1957)
ExactILP	Offline optimal	HiGHS solver
SA/QUBO	Offline quadratic (co-access interactions)	quant-cache

Presets

Preset	Use Case	λ_latency ($/ms)	Stale Penalty
`ecommerce`	Product pages, catalogs	0.00005	High ($0.10/event)
`media`	Video/image streaming	0.00001	Low ($0.001/event)
`api`	REST APIs, auth tokens	0.0001	InvalidationOnUpdate

Architecture

quant-cache/
├── crates/
│   ├── qc-model/      Data types, configs, presets, economic parameters
│   ├── qc-solver/     Scorer trait (V1/V2), Greedy/ILP/SA solvers, QUBO DSL, calibration
│   ├── qc-simulate/   Replay engine, 7+ baseline policies, synthetic generator, co-access
│   └── qc-cli/        CLI (11 commands), 3 log parsers, 4 CDN compilers
├── docs/              Design documents, roadmap, related work (29 papers)
└── CHANGELOG.md       Release history

~8,200 lines of source + ~5,100 lines of tests across 30 test suites.

Testing

cargo test --workspace                            # 160+ tests (smoke tier, <30s)
cargo test --release --workspace -- --ignored     # full parametric validation (2,880+ combos)
cargo clippy --all-targets -- -D warnings         # lint
cargo fmt --check                                 # format

Roadmap

Version	Focus	Status
V1.0	Economic knapsack + trace replay	Done
V1.1	CloudFront log import	Done
V1.5	Belady oracle, calibration	Done
V1.6	Reuse-distance scoring (V2)	Done
V2.0	Quadratic SA, co-access interactions	Done
Phase B	Policy IR + IR evaluator	Done
Phase C	Policy search (grid/SA/QUBO)	Done
Phase D	4 CDN deployment scaffolds	Done
Phase E	Parametric validation + QUBO DSL	Done

Academic Context

quant-cache is grounded in 29 surveyed papers spanning classical algorithms (Belady, GDSF), modern eviction (SIEVE, S3-FIFO, TinyLFU), ML approaches (LRB, CACHEUS), optimization theory (Dantzig knapsack, Lucas QUBO), and production systems (CacheLib).

See docs/related-work.md for the full survey.

Contributing

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github		.github
analysis		analysis
crates		crates
data		data
docs		docs
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
SECURITY.md		SECURITY.md
clippy.toml		clippy.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

quant-cache

Key Finding

Quick Start

End-to-End: Trace → CDN Config

What It Does

1. Economic Scoring

2. Replay Evaluation

3. Bounded Optimality

4. Parametric Validation

CLI Commands

Baselines

Presets

Architecture

Testing

Roadmap

Academic Context

Contributing

License

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

quant-cache

Key Finding

Quick Start

End-to-End: Trace → CDN Config

What It Does

1. Economic Scoring

2. Replay Evaluation

3. Bounded Optimality

4. Parametric Validation

CLI Commands

Baselines

Presets

Architecture

Testing

Roadmap

Academic Context

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages