Posted by voisin 3 days ago
> The Jeopardy! match with Watson was essentially rigged due to the computer’s superhuman reaction time.
From many anecdotes I've heard from previous Jeopardy contestants, buzzer technique is more than half the battle. That's part of the reason Ken Jennings was able to go on winning as long as he did - he continuously refined and developed better intuition for when to buzz.
Many contestants (a large portion of whom likely participated in college Trivia Bowls and were seasoned trivia buffs) could answer the vast majority of Jeopardy clues with relative ease.
The reason for this is that Jeopardy, in order to have household appeal, cannot legitimately engage in an increasingly difficult arms race for its questions - it has to have a capped ceiling in terms of clue difficulty.
1. We've advanced the state of the art in terms of being able to reason and parse complex language to a level the general population wasn't aware of.
2. Beating humans on the buzzer, of all things, feels intrinsically unfair and there's no graceful way to account for that.
Watson's ability to interpret, correctly, what most of the Jeopardy! "answers" were looking for was a sea change. It really was as big as Deep Blue, the crossing of a frontier we collectively thought would likely be much further in the future, and it was here now: complex clues, difficult to reason out context, and the ability to search its corpus of knowledge and assign a confidence value and then produce accurate results... it was mind blowing.
It's funny because, in a pinch, ChatGPT will answer essentially every Jeopardy! clue correctly every single time now, owing to the specific nature of generative transformers and the advent of attention, specifically. But Watson was pre "attention is all you need!" It was blood, sweat, and tears, and markov chains, and all kinds of craziness. It was a huge swing, and the fact that it actually worked was remarkable.
But back to the buzzer: most people sitting at home probably don't realize that the buzzer is locked out from buzzing in until the host (then Alex, now Ken) completes the read of the clue. At that point, an off-stage producer presses a switch to unlock the buzzers. So prospective players spend their weeks before their tape date spamming a retractable pen in front of the TV trying to nail the timing from the cadence of the host's voice, and hoping they can get it right. Because getting it wrong penalizes you: if you attempt to buzz in before the lock is lifted, your buzzer is individually locked for another 250ms or so. Another producer has a screen that monitors the buzzer activity so they can see if players are habitually buzzing in too early and getting locked, or to monitor for malfunctions, and it's wild to watch it light up like a Christmas tree in the 100 or so last milliseconds of the clue.
This is why you see players spam the buzzer: if they're locked out, the best thing they can do is keep attempting to get back in until the lock lifts. This is also why on rare occasions, you'll see all three players spamming and no one actually getting buzzed in: everyone locked.
So how do you translate this to a computer? The answer was basically to create a makeshift "finger" that would activate the buzzer, but would consistently buzz in at exactly 28ms after the lock cleared... every single time (so long as its confidence was high enough).
I think it's fair to say that there's nothing, at _all_, remarkable about a computer beating a human on a buzzer. It's a signal loop: wait for lock to clear, receive signal lock has cleared, activate buzzer 28ms later. Every device with a stopwatch or timer or clock is more accurate than a human attempting the same timing exercise. It's a solved problem, and not an interesting one. And it disproportionately took the wind out of the sails from the human competitors, who are both the most world-class, best-on-the-planet Jeopardy! players, who were losing by milliseconds to that signal loop.
It's still remarkable Watson was able to successfully answer so many clues correctly and end up winning, but the buzzer aspect was an (intractable) bummer.
As anyone who has experienced early video game consoles can attest --- even the <2MHz CPU of a NES is far more than fast enough to beat a human at reaction time, which is what makes it all the more impressive when humans do things like https://news.ycombinator.com/item?id=38841080
“Answering Jeopardy questions” is one of those tasks where being close to correct is not good enough. A system that hallucinates plausible-sounding answers would be a kind of awful approach to creating software that’s meant to 100% a game like Jeopardy
Go try it. Hit up J-Archive, prompt it that you'll feed it Jeopardy answers and want a simple response in the form of a question. I've yet to stump it, even with stuff like Rhyme Time: "a racket-&-net sport peril" (tennis menace) or other more complex clues. Every once in awhile, it'll switch to the thinking model, but most were immediate and correct.
ChatGPT is more than fast enough to get an answer under the same constraints.
Easy example:
"...dismissed as “disappointing,” “unfair,” and—my favorite—“a gimmick”..."
Is it really important for me to know Clare McNear's favourite? A couple of years ago I wouldn't have thought anything of it but now I'm so sickened constantly by LLMs adding so much needless cruft to everything they write I'm reading this thinking "just tell me the story, for the love of humanity!"
Doesn’t that mean it is true to form? Can you not imagine that some people like reading stories written by others?
Yes, that's exactly what it means. Like I say, a few years ago I wouldn't have thought anything of it, but now it's behind painfully clear how formulaic that kind of writing is.
> Can you not imagine that some people like reading stories written by others?
This is quite uncharitable and a straw man. I love reading stories written by others. I'm complaining very specifically about the over reliance on tropes of the form.
I also wasn't leading with the complaint, I was leading with the curiosity I feel about how writing will develop as a result of what I'm seeing here, as I'm sure I'm not the only one who suddenly feels different about human made writing. That's not too say I don't lament it, either, but it is what it is.