Top
Best
New

Posted by ahamez 11/2/2025

Why do AI models use so many em-dashes?(www.seangoedecke.com)
98 points | 96 commentspage 3
neuroelectron 11/2/2025|
I wonder what happens to all that 18 century books scanning data. I imagine it stays proprietary and I've heard a lot of the books they scan are destroyed afterwards.
iddan 11/2/2025||
I’m now reading Pride and Prejudice (first edition released in 1813) and indeed there are many em dashes. It also includes language patterns the models didn’t pick up (vocabulary, to morrow instead of tomorrow)
moffkalast 11/2/2025|
I'm gonna start calling it yes terday.
keiferski 11/2/2025|||
Yester-day feels plausible and kind of elegant.
hdgvhicv 11/2/2025||||
Yesterday’s yes terday is today’s yes today.
hshdhdhehd 11/2/2025||||
Yes. Turd day.
DonHopkins 11/2/2025|||
All my trou bles were so far away.
AbstractH24 11/3/2025||
My question is given their satirical association with AI, why haven’t the models been manually optimized not to use them?
qubex 11/5/2025||
I’m amongst those who used to use em-dashes and now seeks to actively avoid them.
danielodievich 11/3/2025||
In Russian written languages, the quotes for the people speaking are prefixed with em-dash, instead of double-quoted like it would be in typical English book:

Instead of

"The time has come," the Walrus said,

"To talk of many things:"

... it would be spelled as

— The time has come, — the Walrus said,

— To talk of many things:

I wonder how much of russian language content was in training model.

kristopolous 11/2/2025||
Are people surprised that training biases a distinct style? I'd think it's kind of expected
byyoung3 11/2/2025||
Because Sam Altman said so
DonHopkins 11/2/2025|
Then I prefer Sam Altman's pesky em-dashes to Elon Musk's relentless white supremacist propaganda.
byyoung3 11/5/2025||
Cool story bro
throwaway81523 11/2/2025||
I always figured it was because of training on Wikipedia. I used to hate the style zealots (MOStafarians in humorous wiki-jargon) who obsessively enforced typographic conventions like that. Well I still hate them, but I'm sort of thankful that they inadvertently created an AI-detection marker. I've been expecting the AI slop generators to catch on and revert to hyphens though.
kentbrew 11/3/2025||
Robert A. Heinlein used a lot of em-dashes and much of the Internet was created by Heinlein fanboys?
IshKebab 11/2/2025|
The conclusion is really a guess unfortunately.
More comments...