Appgithub-com-atomicbot-ai-atomic-llama-cpp-turboquant

Install Pinokio

Log in Register

Log in Register

atomic-llama-cpp-turboquant

https://github.com/atomicbot-ai/atomic-llama-cpp-turboquantupdated 5/13/2026, 5:26:59 PMindexed 6/6/2026, 7:05:25 PM

llama.cpp fork with TurboQuant WHT-rotated KV cache & weight compression + Gemma 4 MTP and Qwen 3.6 NextN speculative decoding (+30-50% throughput).

Pinokio Apps Using This Repo

No Pinokio apps using this repo yet.

Community tagsLoading...

Community

Sort by

Post about atomic-llama-cpp-turboquant...Post

Loading...

Claim ownership

Own atomicbot-ai/atomic-llama-cpp-turboquant?

Check-ins (0)

Platforms (0)

No reports yet.

Arch (0)

No reports yet.

GPU (0)

No reports yet.

RAM (0)

No reports yet.

VRAM (0)

No reports yet.

Recent commits

Merge pull request #14 from AtomicBot-ai/b1-mtp-qwen-rebase

Ooze2 months ago0a635dc

Enhance multimodal support and speculative decoding in atomic-llama-cpp-turboquant

Biogenic Ooze2 months agoead60fb

Merge pull request #13 from AtomicBot-ai/b1-mtp-qwen-rebase

Ooze2 months ago8893692

Update documentation and scripts for AtomicChat UDT quantization and Qwen 3.6 NextN enhancements

Biogenic Ooze2 months agoc7e6138

Enhance UDT benchmarking scripts and add chat calibration sample

Biogenic Ooze2 months ago33e9b6d

Enhance documentation and scripts for AtomicChat UDT quantization and Qwen 3.6 NextN

Biogenic Ooze2 months ago5c1717f

Merge pull request #11 from AtomicBot-ai/b1-mtp-qwen-rebase

Ooze2 months ago514e600

Update benchmark results and documentation for Qwen 3.6 NextN and Gemma 4 MTP

Biogenic Ooze2 months ago877c27b

Merge origin/feature/turboquant-kv-cache into b1-mtp-qwen-rebase

Biogenic Ooze2 months ago00e8d49

Merge pull request #10 from sujitvasanth/fix/turbo-rope-shift-gemma4

Ooze2 months agob1a7d71

Pinokio