§ — Blog
Notes from the team.
Technical deep dives, open-source releases, and developer guides for on-device AI.
Jun 22, 2026 · jetson
How to build an offline voice assistant on NVIDIA Jetson Orin
A builder's guide to running a fully offline voice agent on Jetson Orin — local STT, an SLM, and TTS, with real latency numbers and where the DIY Whisper + llama.cpp + Piper stack breaks down.
Read→Jun 20, 2026 · on-device-ai
Why we ripped cloud voice out of our robots: from Deepgram + OpenAI + ElevenLabs to on-device
We started building robot voice the easy way — Deepgram, OpenAI, and ElevenLabs glued together in Python. It demoed great and fell apart in the real world. Here's everything that went wrong, and why we're rebuilding the whole stack on-device.
Read→Jun 10, 2026 · open-source
turboquant.cpp: near-optimal vector quantization in 400 lines of C++, no training required
We open-sourced turboquant.cpp, a C++23 implementation of TurboQuant: compress embeddings to 1-4 bits per coordinate with provable distortion bounds — no training, no codebooks.
Read→FIG. — Engineering notes, benchmarks, and field reports.