Thinking Machines’ TML-Interaction-Small: Setting New Standards in Real-Time Speech Processing
Thinking Machines presents TML-Interaction-Small with 276B parameters for natural real-time speech interaction. The encoder-free model uses 200ms microturns and demonstrates outstanding cache efficiency. Skepticism grows around TurboQuant while open-source models continue to gain performance rapidly and outpace Moore’s









