Thinking Machines wants to build an AI that actually listens while it talks

https://techcrunch.com/wp-content/uploads/2026/05/GettyImages-2271655163.jpg?resize=1200,879

In Brief

9:52 PM PDT · May 11, 2026

Thinking Machines Lab, the AI startup founded last year by former OpenAI CTO Mira Murati, on Monday announced something called interaction models, which, at its essence, sounds like AI that can interrupt you.

Right now, every AI model you’ve ever used works the same way. You talk, it listens. It responds, you listen. Thinking Machines is trying to change that by building a model that processes your input and generates a response at the same time, so it’s more like a phone call than a text chain.

The technical term for this is “full duplex,” and the company claims its model, TML-Interaction-Small, responds in 0.40 seconds, which is roughly the speed of natural human conversation and significantly faster than comparable models from OpenAI and Google.

Still, this is a research preview, not a product. The company isn’t releasing it to...

Copyright of this story solely belongs to techcrunch.com. To see the full text click HERE

Read more