Interaction Models - Hacker News

Nimitz14 8 hours ago|

Really really cool. If they can serve this efficiently it would disrupt a lot of things.

zuzululu 7 hours ago||

am i the only person not impressed by this ? it just feels akward still with pauses and doesnt openai offer voice cadence already

gyre007 6 hours ago|

Same here. I dont see anything there that nobody else can catch up on eventually. I must be missing something here. It's all cute, but mmm

mchusma 5 hours ago||

What I will say is that this is probably the first model after gemini live to do some of these things. It feels similar to gemini live, which I don't think is what they were going for exactly, but IMO it is still impressive as I don't think anyone else has matched full duplex video/audio/tool calling.

Next gemini releases coming next week though, we will see how that matches up!

Ozzie-D 2 hours ago||

[dead]

modeless 8 hours ago|

This deserves to be at the top of HN, shame it seems like it's not going to make it. Some of the demos are hilarious. Clearly having the model appropriately choose when to speak is a major thing that has been missing from voice models to date. It seems like the latency is still a touch too high to be truly human-like though.