Framework Injection Benchmark Suite

Components

fi_generator.py — Generates injectable frameworks for any domain
benchmark.py — Full PE vs CE vs FI benchmark (10 domains x 3 conditions x N runs)
validate.py — Quick validation (3 domains x 1 run)

Usage

# Generate a framework
python fi_generator.py --domain "corporate law M&A due diligence"

# Quick validation (9 API calls)
python validate.py

# Full benchmark (90+ API calls)
python benchmark.py --domains all --runs 3

Methodology

Three conditions compared:

PE (Prompt Engineering): simple task instruction
CE (Context Engineering): task + expert context + structure
FI (Framework Injection): complete 5-type injectable framework + task

Evaluation: LLM-as-judge on 6 criteria (1-5 scale):

Domain Accuracy
Reasoning Depth
Completeness
Actionability
Hallucination Check
Professional Tone

Reference

Gomes, R. A. (2026). From Commands to Cognition: Digital Craftsmanship and the Framework Injection Paradigm. DOI: 10.5281/zenodo.19344789

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
frameworks		frameworks
results		results
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
benchmark_hard.py		benchmark_hard.py
fi_generator.py		fi_generator.py
fi_generator_v2.py		fi_generator_v2.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Framework Injection Benchmark Suite

Components

Usage

Methodology

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Framework Injection Benchmark Suite

Components

Usage

Methodology

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages