Posted by FromTheArchives 9/12/2025
Of course people use the em-dash, and of course LLMs use them at least 10x-100x more than your average human writer. Also, they add nothing to writing, 99.8% people just use an en-dash when typing where an em-dash would be used in print, and absolutely nothing is lost. Some dickheads (like myself) have used a compose key (or similar) to use actual em-dashes in order to seem sophisticated online.
The only people who need the em-dash, as far as I know, are Spanish-language writers. As for LLM-shaming, isn't it more shameful when you publish an article that could easily be entirely written by LLM, but definitely wasn't, like this one?
edit: articles like this make me want to misuse flagging.
Or like me, because they grew up in a different location, era, or career path where proper typography, spelling, grammar, and punctuation matter more than it does for most (print, web dev, advertising, etc) and now the use of that compose-key is just pure muscle-memory like high-speed "touch typing" is.
In certain contexts, em dashes are perfectly natural and human. That being said, everyone has encountered articles and posts that read so obviously like AI, and in those contexts the presence of numerous em dashes is certainly an additional data point.
The reason em dashes are a giveaway for AI generated text is simply because there is no em dash key on the keyboard - only an en dash key. The dash I used in that last sentence was an en dash, not an em dash.
Some publishing applications (including Microsoft Word) will automatically convert en dashes to em dashes where appropriate. But most email apps, chat apps, online posts/comments, and practically any application not designed for writing actual printed publications will not do that conversion for you. And without a dedicated key, it is far too cumbersome for most people to bother. They will just leave it as an en dash.
So yes, the em dash is still a reliable indicator of AI-generated content in many contexts.
But I agree that because LLMs are trained on public documents, and most of those are written in Microsoft Word which has auto-format enabled by default, that is probably the source of so many LLMs using them.
Almost nobody, relatively speaking, even knows they exist, let alone goes out of their way to figure out the ALT code combination to use them. Most people can’t get their, they’re, and there right.
You are right. Thanks for catching that.
And yet here we ware.
But I think worst of all it just gives me the fucking creeps, some uncanny-valley bullshit. I see hyphens a million times a day then out of nowhere comes this creepy slender-man looking motherfucker that's just a little bit too long than you'd expect or like, and is always touching all the letters around it when it shouldn't need to. It stands out looking like a weird print error... on my screen! Hopefully it keeps building a worse and worse reputation.