Build software better, together

21Lochan / 3DMark-Pro-Benchmark-Core

The Ultimate 3DMark Professional Guide 2026 - GPU Benchmarking and Performance Testing

gpu-benchmark 3dmark gaming-benchmark pro-software-2026 windows-software-2026 trial-software 3dmark-pro 3dmark-2026 ul-3dmark directx12-test

Updated May 29, 2026
HTML

brookclimberjerk / 3DMark-Professional-cracked

Star

3DMark Professional -- GPU and gaming PC benchmark suite with DirectX 12 tests, ray-tracing benchmark, stress tests, and detailed score reports. Pro version with full features. Trial available. Compatible with Windows 10/11 (64-bit). Updated 2026.

gpu-benchmark 3dmark activation-tools gaming-benchmark pro-software-2026 windows-software-2026 trial-software 3dmark-pro 3dmark-2026 ul-3dmark directx12-test

Updated May 29, 2026

doloMing / gpu-cpu-stress-tests

Star

This is a uer-friendly Python codebase designed for stress testing of Nvidia GPUs, intel CPUs, and AMD CPUs in various modes

stress-testing nvidia-gpu cpu-benchmark gpu-testing intel-cpu amd-cpu gpu-benchmark cpu-testing

Updated Mar 2, 2025
Python

stlin256 / CUDABurner

Star

An stress and benchmark utility for NVIDIA GPUs. Measures performance across various precisions (FP64, FP32, TF32, FP16, INT8) and monitors real-time vitals like power, temperature, and clock speeds.

benchmarking performance sparsity cpp hpc cuda nvidia stress-testing fp16 int8 gpu-benchmark fp32 fp64 fp8 fp4 tf32 tf16

Updated Dec 12, 2025
C++

crosschainer / e-peen

Star

Benchmark CPU, Benchmark GPU, Storage, RAM using Python

benchmarking benchmark numpy pyopencl cpu-benchmark gpu-benchmark

Updated Jul 5, 2024
Python

morpheuslord / PCBench

Star

PCBench is a versatile Python-based system performance benchmarking tool designed to empower users with insights into their hardware's capabilities. Whether you're a tech enthusiast, a PC gamer, or a developer optimizing your code, PCBench provides comprehensive benchmarking for both CPUs and GPUs.

benchmarking cpu hardware gpu monitoring-tool cpu-benchmark gpu-benchmark

Updated Sep 19, 2023
Python

MD-Zayed-Al-Sajed / Real-Time-GPU-Physics-Rendering-Benchmark-Tool

Star

High-performance GPU benchmarking tool built with Vulkan, CUDA, and ImGui — featuring real-time physics simulation, custom rendering, and modular engine architecture.

graphics vulkan realtime imgui cuda gpu-benchmark cplusplus-

Updated Jun 9, 2025
C++

CoreDev-HUB / StressGPU

Star

🌌 High-performance WebGL Stress Test. Advanced Raymarching fractal engine with real-time RGB shading and kernel injection.

javascript html webgl stress-test raymarching gpu-benchmark coredev-hub

Updated May 10, 2026
HTML

zzxqasdcc / Kernel-V8-GPU-Architecture-Stress-Test

Star

**Kernel-V8** is a high-performance GPU benchmarking engine built on the WebGL2 API. By rendering a complex 8th-order **Mandelbulb** fractal in real-time, it generates intense arithmetic workloads to evaluate the stability, thermal throttling, and peak compute throughput of modern graphics hardware.

graphics stress-test webgl2 performance-testing mandelbulb raymarching gpu-benchmark

Updated Jan 12, 2026
JavaScript

amitgambhir / llm-inference-bench

Star

Platform-agnostic benchmark harness for LLM inference endpoints. Measures TTFT, throughput, and failure rate against any OpenAI-compatible /v1/completions API (vLLM, SGLang, Baseten, RHOAI, …) and recommends a vLLM config grounded in real benchmark data.

benchmark mlops gpu-benchmark ttft openai-api inference-benchmark kserve llm fp8 llm-serving vllm llm-inference openshift-ai sglang nvidia-l4

Updated May 23, 2026
Python

yasser1-0 / FP16-vs-FP32-A-GPU-Lab-in-Frames

Star

🎬 Explore GPU training efficiency with FP32 vs FP16 in this modular lab, utilizing Tensor Core acceleration for deep learning insights.

performance-engineering deep-learning reproducible-research cuda pytorch fp16 cupy mixed-precision nsight gpu-benchmark nvtx fp32 tensor-core

Updated Feb 20, 2026
Python

parvez86 / DeepLearning

Star

Implementing and Visualizing Deep Learning Models

deep-learning keras cnn classification ann bert data-augmentation word-embedding gensim-word2vec gpu-benchmark tensorflow2

Updated Jun 9, 2023
Jupyter Notebook

Dartayous / FP16-vs-FP32-A-GPU-Lab-in-Frames

Star

A reproducible GPU benchmarking lab that compares FP16 vs FP32 training on MNIST using PyTorch, CuPy, and Nsight profiling tools. This project blends performance engineering with cinematic storytelling—featuring NVTX-tagged training loops, fused CuPy kernels, and a profiler-driven README that narrates the GPU’s inner workings frame by frame.

performance-engineering deep-learning reproducible-research cuda pytorch fp16 cupy mixed-precision nsight gpu-benchmark nvtx fp32 tensor-core

Updated Apr 25, 2026
Python

ChharithOeun / wsl-benchmark

Star

GPU vs CPU performance benchmarking for PyTorch and JAX. Works on AMD ROCm, DirectML, CUDA, MPS, CPU. Optimized for RX 5700 XT in WSL2.

python windows benchmark machine-learning performance rocm amd-gpu gpu-benchmark wsl2 directml

Updated Apr 19, 2026
Python

vasimr / llm_gpu_benchmarks

Star

benchmark scaled LLM inference and training

benchmark transformers pytorch distributed-training gpu-benchmark image-inference ml-benchmark

Updated Dec 2, 2025
Python

ahmadrezarazian / OpenCL_MultiDevice_Bandwidth_Analyzer

Star

OpenCL benchmarking tool to measure host-device bandwidth and kernel global memory throughput across GPUs and CPUs.

opencl parallel-computing memory-bandwidth gpu-benchmark gpu-validation compute-benchmark

Updated Mar 16, 2026
C

Tennisee-data / benchHUB

Star

benchHUB is a Python-based project to parse, aggregate, and visualize system and performance benchmarks. It includes a Streamlit dashboard to display and compare results.

mac benchmarking data-science machine-learning hardware leaderboard gpu-computing leaderboards performance-testing gpu-benchmark fastapi streamlit benchmarking-utility apple-silicon cpu-benchmarks gpu-benchmarking

Updated May 26, 2026
Python

capetron / ptg-gpu-bench

Star

GPU benchmark suite for AI inference workloads. Test throughput, latency, and power efficiency across NVIDIA, AMD, and Apple Silicon. By Petronella Technology Group.

pytorch nvidia performance-testing mlx gpu-monitoring gpu-benchmark apple-silicon ai-inference llm vllm ai-infrastructure

Updated Apr 14, 2026
Python

paulplee / ppb-mcp

Star

Poor Paul's Benchmark as an MCP server. Queryable GPU inference data — quantization, throughput, VRAM, concurrent users — for Claude Desktop, Cursor, Windsurf, Cline, and any MCP client. Self-host or use the free hosted endpoint.

inference quantization gpu-benchmark huggingface llm local-llm model-context-protocol fastmcp poor-pauls-benchmark

Updated May 27, 2026
Python

paulplee / poor-pauls-benchmark

Star

Benchmark your GPU against any GGUF model and contribute to the public leaderboard. Measures throughput, TTFT, ITL, and VRAM limits across quantizations and context sizes.

quantization vram gpu-benchmark huggingface home-lab inference-benchmark llm llama-cpp local-llm poor-pauls-benchmark

Updated May 27, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu-benchmark

Here are 24 public repositories matching this topic...

21Lochan / 3DMark-Pro-Benchmark-Core

brookclimberjerk / 3DMark-Professional-cracked

doloMing / gpu-cpu-stress-tests

stlin256 / CUDABurner

crosschainer / e-peen

morpheuslord / PCBench

MD-Zayed-Al-Sajed / Real-Time-GPU-Physics-Rendering-Benchmark-Tool

CoreDev-HUB / StressGPU

zzxqasdcc / Kernel-V8-GPU-Architecture-Stress-Test

amitgambhir / llm-inference-bench

yasser1-0 / FP16-vs-FP32-A-GPU-Lab-in-Frames

parvez86 / DeepLearning

Dartayous / FP16-vs-FP32-A-GPU-Lab-in-Frames

ChharithOeun / wsl-benchmark

vasimr / llm_gpu_benchmarks

ahmadrezarazian / OpenCL_MultiDevice_Bandwidth_Analyzer

Tennisee-data / benchHUB

capetron / ptg-gpu-bench

paulplee / ppb-mcp

paulplee / poor-pauls-benchmark

Improve this page

Add this topic to your repo