Posted by mooreds 13 hours ago
Would love to see benchmarks on Mac Studio with its 7.4 GB/s SSD bandwidth — feels like the sweet spot for this technique.
But a speech-to-text and text-to-speech system that I know is "understanding" me would be great rather than waiting music. The shop could even sell it as "As a small shop, most of our employees are busy fixing cars, so we are using AI to help with calls" (Although then people who are anxious about AI stealing jobs might hang up). The robot can ask me what I need, and then say "So for [this service], the price would be..." (to tell the caller what it has understood).
If the AI can even look at gaps in the shop's schedule and set an appointment time, the customer might even be happy that they just spent a minute on the phone instead of 10+...
A friend of mine worked for a call center that did car rentals, old people would call them and ask to rent a car.
Maybe the AI system should have "Press 1 to talk to AI, press 2 to leave a message" so experts like you can press 2.
Even if the new model that came out last week totally fixed all the problems this time for real, most people's experience with chatbots is that they are prone to misunderstanding or making false statements. "Hallucinations"
I have yet to experience any degree of confidence in any output from an LLM, so I'd rather leave the message. I don't know how common this point of view is.
I went through hell on a home remodel project 6 months ago around this stuff. I got a quote from a reputable plumber and went to schedule the rough-in session. An AI receptionist answered, got confused during the scheduling flow and could not understand my address, asking me to repeat it over and over. And it couldn't forward to me to human.
If I'm paying you tens of thousands of dollars for remodeling work, I damn well better be able to get in touch with you. I found a different contractor and never looked back.
Claude will hallucinate anyway, sometimes.
I don't think there's any way around this other than a cli or MCP that says "press the 'play prerecorded .WAV file button that says the brake repair service info and prices.'"