A developer on Reddit shared a Dockerized implementation of Nemotron 3.5 ASR, migrating from Parakeet. The system supports over 40 languages and features a native streaming architecture that avoids full-file buffering. Using the onnxruntime-genai backend, it achieves 4.5x real-time speed on CPU, with CUDA support planned but untested.
Prominent AI scholar and commentator Nathan Lambert, in his latest edition of Latest Open Artifacts (#20), has compiled the major recent developments in the…