15h ago

oMLX 0.3.9rc1 Releases With Mac Stability Fixes And Chunked Prefill

0
Original post

oMLX 0.3.9rc1 released. Highlights: - Low-memory Macs stay stable instead of getting killed by the OS - DFlash bumped to v0.1.7 (thanks to @bstnxbt's dflash-mlx). Qwen thinking/GDN fix, Etc. - Chunked prefill. A long prompt no longer blocks decode for everyone else - Multi-tasking in the admin chat. Run multiple chats in parallel - Real-time memory bar in the admin dashboard - Hermes Agent quick launch, "omlx launch hermes" Plus a lot of bug fixes and new contributors in this cycle. Thanks everyone! https://github.com/jundot/omlx/releases/tag/v0.3.9rc1

2:23 AM · May 19, 2026 View on X