p-eagle

Here is 1 public repository matching this topic...

carlosfundora / sglang-1-bit-turbo

AMD ROCm (gfx1030) inference fork with RotorQuant/TurboQuant KV compression, PHANTOM-X zero-copy draft speculation, EAGLE3 speculative decoding, 12 RDNA2 crash fixes, and PrismML Bonsai Q1_0_G128 1-bit GGUF support.

triton hip bonsai rocm amd-gpu gguf speculative-decoding sglang rdna2 eagle3 turboquant prismml gfx1030 p-eagle radix-cache

Updated Apr 13, 2026
Python

Improve this page

Add a description, image, and links to the p-eagle topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the p-eagle topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

p-eagle

Here is 1 public repository matching this topic...

carlosfundora / sglang-1-bit-turbo

Improve this page

Add this topic to your repo