Skip to content
@evalops

EvalOps

EvalOps is an AI testing and monitoring platform that helps engineering teams ship reliable AI features with confidence.

Popular repositories Loading

  1. cognitive-dissonance-dspy cognitive-dissonance-dspy Public

    A multi-agent LLM system for detecting and resolving cognitive dissonance.

    Python 269 20

  2. dspy-micro-agent dspy-micro-agent Public

    Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama support.

    Python 65 6

  3. founder-email-optimizer founder-email-optimizer Public

    DSPy-powered email optimization for startup founders: drop in your 3 best emails, get optimized outreach for new leads

    Python 38 1

  4. orbit-agent orbit-agent Public

    A brutally honest "high‑orbit" startup advisor you can text or run from the CLI. Built with DSPy, it provides opinionated, YC-style advice and financial tools for founders.

    Python 16

  5. nimbus nimbus Public

    Self-hosted CI infrastructure optimized for AI evaluation workloads. Run evals on bare metal with Firecracker isolation, built-in observability, and zero cloud egress costs

    Python 7

  6. bandit_dspy bandit_dspy Public

    A DSPy library for security-aware LLM development using Bandit.

    Python 5 1

Repositories

Showing 10 of 36 repositories
  • tokio-hotel Public

    Ephemeral, cancellable task rooms with scoped resources for Tokio runtimes

    evalops/tokio-hotel’s past year of commit activity
    Rust 0 MIT 0 0 0 Updated Dec 6, 2025
  • keep Public

    PoC zero-trust access stack with Google SSO, Envoy, OPA, and device attestation

    evalops/keep’s past year of commit activity
    Go 0 0 0 8 Updated Nov 24, 2025
  • nimbus Public

    Self-hosted CI infrastructure optimized for AI evaluation workloads. Run evals on bare metal with Firecracker isolation, built-in observability, and zero cloud egress costs

    evalops/nimbus’s past year of commit activity
    Python 7 0 0 1 Updated Nov 10, 2025
  • cursor-triplane Public

    Cursor-style tri-plane RL stack: Firecracker microVM envs, Ray inference, FP8 MoE trainer

    evalops/cursor-triplane’s past year of commit activity
    Python 0 0 0 0 Updated Nov 8, 2025
  • agent-pm Public

    Agent PM: OpenAI Agents-powered product management orchestrator with automated PRDs, tickets, and comms.

    evalops/agent-pm’s past year of commit activity
    Python 3 MIT 1 0 0 Updated Oct 21, 2025
  • provenance Public

    Agent provenance & risk analytics API: FastAPI service with Semgrep detection, Redis, analytics dashboards

    evalops/provenance’s past year of commit activity
    Python 0 0 0 0 Updated Oct 17, 2025
  • cognitive-dissonance-dspy Public

    A multi-agent LLM system for detecting and resolving cognitive dissonance.

    evalops/cognitive-dissonance-dspy’s past year of commit activity
    Python 269 20 0 0 Updated Oct 14, 2025
  • meta-secdev-agent Public

    Static analysis meta agent for security and tenant isolation workflows

    evalops/meta-secdev-agent’s past year of commit activity
    Python 0 0 0 0 Updated Oct 13, 2025
  • github-ona Public

    Organizational Network Analysis tool for GitHub repositories - analyze collaboration patterns through PR and review data

    evalops/github-ona’s past year of commit activity
    Python 0 0 0 0 Updated Oct 11, 2025
  • env-drift-detector Public

    Detect environment variable drift across monorepo services using OpenAI Codex SDK - catches naming inconsistencies, missing vars, and undocumented secrets

    evalops/env-drift-detector’s past year of commit activity
    TypeScript 1 0 0 0 Updated Oct 8, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…