adversarial-testing

Star

Here are 37 public repositories matching this topic...

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated Mar 17, 2026
Python

jhlee0409 / elenchus-mcp

Sponsor

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Feb 28, 2026

zakky8 / llm-jailbreak-taxonomy

Star

Systematic LLM jailbreak taxonomy — 40 attack patterns, 10 categories, empirical evaluation across 4 frontier models. AI safety research with responsible disclosure.

taxonomy jailbreak alignment ai-safety security-testing responsible-disclosure jailbreak-detection adversarial-attacks red-teaming ai-security model-robustness adversarial-ml prompt-injection red-teaming-tools llm-security llm-evaluation llm-jailbreaks ai-red-teaming adversarial-testing

Updated Mar 15, 2026
Jupyter Notebook

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

tasumermaf / the-adversary

Star

Agent-driven adversarial paper audit framework

python ai-agents scientific-writing research-tools adversarial-testing paper-audit

Updated Mar 17, 2026
Python

mcptrust / mcp-adversarial-suite

Star

Adversarial MCP server benchmark suite for testing tool-calling security, drift detection, and proxy defenses

security benchmark mcp red-team security-testing ai-security llm-security tool-calling model-context-protocol adversarial-testing

Updated Dec 27, 2025
JavaScript

yangyihe0305-droid / llm-red-team-research

Star

Systematic exploration of LLM alignment boundaries through logical stress testing

nlp machine-learning alignment language-models ai-safety security-research red-teaming llm prompt-engineering adversarial-testing

Updated Mar 9, 2026
Shell

inaciovasquez2020 / urf-application-stress-test

Star

Description URF Application Stress Test — adversarial and scalability tests for Unified Rigidity Framework applications, validating limits under load, noise, and edge cases.

reproducible-research scalability stress-testing formal-verification robustness adversarial-testing unified-rigidity-framework systems-validation

Updated Feb 24, 2026
Shell

YaswanthGhanta / llm-logical-integrity-benchmark

Star

Adversarial testing of LLMs on constraint satisfaction deadlocks

reinforcement-learning gemini grok claude hallucination prompt-engineering chain-of-thought chatgpt rlhf qwen llm-evaluation sycophancy deepseek safety-alignment ai-red-teaming kimi-k2 adversarial-testing

Updated Jan 27, 2026

anotherben / claude-enterprise-skills

Star

9-stage enterprise development pipeline for Claude Code. TDD, adversarial testing, mechanical verification. Any stack.

Updated Mar 14, 2026
Shell

alpha-one-index / ai-red-teaming-index

Sponsor

Star

Comprehensive AI red teaming index: tools, frameworks, benchmarks, datasets, and vulnerability leaderboards for LLM safety and adversarial testing.

Updated Mar 16, 2026
HTML

adeolasopade / AI-Security-Audit-Cryptocurrency-Exchange-

Star

Identified critical AI governance gaps: no adversarial testing, undocumented third-party models, and missing incident response. Delivered roadmap to secure high-risk KYC and transaction monitoring systems against evolving threats.

cryptocurrency-exchange nist-ai-rmf iso-42001 adversarial-testing ai-security-audit

Updated Mar 14, 2026

mcp-tool-shop-org / mcp-stress-test

Star

Red team toolkit for stress-testing MCP security scanners — find detection gaps before attackers do

python security mcp stress-testing fuzzing red-team ai-safety testing-framework security-testing llm llm-security model-context-protocol mcp-server adversarial-testing

Updated Mar 18, 2026
Python

NathanMaine / garak-compliance-probes

Star

Compliance-focused vulnerability probes for NVIDIA garak, targeting LLMs in regulated industries (CMMC, NIST, HIPAA, DFARS)

nist nvidia compliance hipaa red-teaming cmmc vulnerability-testing llm-security garak adversarial-testing

Updated Feb 17, 2026
Python

North-Shore-AI / crucible_adversary

Star

Adversarial testing and robustness evaluation for the Crucible framework

machine-learning elixir otp research ai beam reliability robustness security-testing adversarial-examples adversarial-attacks red-teaming ensemble-methods statistical-testing model-robustness llm adversarial-testing nshkr-crucible

Updated Dec 29, 2025
Elixir

light-research / solana-sim-engine

Star

LLM-powered fuzzing and adversarial testing framework for Solana programs. Generates intelligent attack scenarios, builds real transactions, and reports vulnerabilities with CWE classifications.

smart-contracts fuzzing solana adversarial-testing

Updated Jan 19, 2026
Python

leenathomas01 / doctrine-of-externalization

Star

A governance doctrine for AI systems based on explicit oversight. Externalizes trust and uncertainty into auditable, adversarial, and constrainable layers. A design framework, not an implementation guide.

provenance systems-architecture explainability ai-governance adversarial-testing risk-containment ai-safety-design oversight-framework capability-gating

Updated Mar 1, 2026

nulone / pytest-adversarial

Star

Generate adversarial pytest tests using LLM. Tries to find edge cases in your Python code.

python testing ai pytest openai test-generation llm adversarial-testing

Updated Jan 22, 2026
Python

Improve this page

Add a description, image, and links to the adversarial-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-testing

Here are 37 public repositories matching this topic...

sherifkozman / the-red-council

jhlee0409 / elenchus-mcp

stchakwdev / Gaslight_EVAL

alejandrosaenz117 / bonfires-marketplace

zakky8 / llm-jailbreak-taxonomy

vibheksoni / jailbench

tasumermaf / the-adversary

mcptrust / mcp-adversarial-suite

yangyihe0305-droid / llm-red-team-research

inaciovasquez2020 / urf-application-stress-test

YaswanthGhanta / llm-logical-integrity-benchmark

anotherben / claude-enterprise-skills

alpha-one-index / ai-red-teaming-index

adeolasopade / AI-Security-Audit-Cryptocurrency-Exchange-

mcp-tool-shop-org / mcp-stress-test

NathanMaine / garak-compliance-probes

North-Shore-AI / crucible_adversary

light-research / solana-sim-engine

leenathomas01 / doctrine-of-externalization

nulone / pytest-adversarial

Improve this page

Add this topic to your repo