Always bet on text (2014)

Posted by jesseduffield 12/26/2025

Always bet on text (2014)(graydon2.dreamwidth.org)

347 points | 181 commentspage 2

begueradj 12/27/2025|

>Text is the oldest and most stable communication technology

That's completely false: Images were used for storytelling thousands of years before text (compare for instance the Lascaux paintings which are more than 17 000 years old, the Göbeklitepe sculptures and stone drawings (more than 12 000 years old), or the the more than 15 000 paintings of the City of Sefar (Algeria) which some estimate to date back as far as 20 000 years ago to the earliest text known in human history, Kish Tablet, Mesopotamia, around 3500 years old.

Dwedit 12/27/2025||

Saying that a 20x20 image of a Twitter logo is 4000 bytes is just so wrong.

The image is of a monochrome logo with anti-aliased edges. Due to being a simple filled geometric shape, it could compress well with RLE, ZIP compression, or even predictors. It could even be represented as vector drawing commands (LineTo, CurveTo, etc...).

In a 1-bit-per-pixel format, a 20x20 image ends up as 400 bits (50 bytes).

0xCE0 12/27/2025||

https://futuretextpublishing.com/ --> books vol 1-5

And what comes to original article, there is no "text [systems]" (or there is, like there are "number [systems]", just made up). "Text" like this very thing you are reading is 2D drawing. There are no character glyphs of any kind (latin, logograms etc.) defined by universe*, they are human invented and stored/interpreted at human collective level. Computers don't know anything about text, only "numbers" of some bit width, and with those numbers a system must be created that can map some number representation to some drawing in some method (e.g. with bitmap). Also there is a lot of difference between formal/executable and natural human languages. Anyways, it's not a about some text format/encoding, it's the human/computer defined/interpreted non-linguistical meaning behind it (Wittgenstein).

* DNA/RNA can be one such "universal character glyph/string", as the "textual" information is physically constructed and interpreted.

jamesgill 12/26/2025||

Related: https://sive.rs/plaintext

sweetsocks21 12/27/2025||

For a computer, text is a binary format like anything else. We have decades of tooling built on handling linear streams of text where we sometimes encode higher dimensional structures in it.

But I can't help feel that we try to jam everything into that format because that's what's already ubiquitous. Reminds me of how every hobby OS is a copy of some Unix/Posix system.

If we had a more general structured format would we say the opposite?

textnotalwabest 12/27/2025||

Text is not the best medium for the following situations:

- I want to learn how to climb rock walls

- I want to learn how to throw a baseball

- I want to learn how to do public speaking

- I want to learn how to play piano

- I want to make a fire in the woods

- I want to understand the emotional impact of war

- I want to be involved in my child's life

malloryerik 12/27/2025||

I agree with all of these except the emotional impact of war where though slower a novel or memoir might work best. Think "All Quiet on the Western Front." At the same time we do want images of the war and time for grounding.

derriz 12/27/2025|||

I don’t see the relevance to the topic. I could preface your list with something like “The monkey wrench is not the best tool for the following situations:”. It’s kinda vacuously true in a meaningless way but without expansion adds nothing to a discussion about the relative merits of monkey wrenches versus other similar tools like pliers or vice grips.

awesome_dude 12/27/2025|||

Why did you create an account just to post that?

In text format no less

marginalia_nu 12/27/2025|||

Honestly text is pretty good for conveying all of those things, though you'd also need to supplement it with practice in all but the emotional impact of war bit.

cindyllm 12/27/2025||

[dead]

seveibar 12/27/2025||

This is sort of the premise of all of us electronics-as-code startups. We think that a text-based medium for the representation of circuits is a necessity for AI to be able to create electronics. You can't skip this step and generate schematic images or something. You have to have a human-readable (which also means AI-compatible) text medium. Another confusion: KiCad files are represented in text, so shouldn't AI be able to generate them? No- AI has similar levels of spatial understanding to a human reading these text files. You can't have a ton of XY coordinates or other non-human-friendly components of the text files. Everything will be text-based and human-readable, at least at the first layer of AI-generation for serious applications

zephen 12/27/2025||

I agree 99%.

The 1% where something else is better?

Youtube videos that show you how to access hidden fasteners on things you want to take apart.

Not that I can't get absolutely anything open, but sometimes it's nice to be able to do so with minimal damage.

ilaksh 12/27/2025|

I wonder if some day there will be a video codec that is essentially a standard distribution of a very precise and extremely fast text-to-video model (like SmartTurboDiffusion-2027 or something). Because surely there are limits to text, but even the example you gave does not seem to me to be beyond the reach of a text description, given a certain level of precision and capability in the model. And we now have faster than realtime text to video.

zephen 12/28/2025|||

Maybe?

To the extent that that could work, I would imagine that I, personally, would be happy reading the textual description instead of watching the video, and for me, we'd now be even closer to text wins 100% of the time.

In other words, it's not that you _can't_ give excellent descriptions that would obviate the need for video, it's just that people _don't_, even, or perhaps even especially, when they think they do.

If someone writes text that creates a video that shows exactly how to get something apart, then _presumably_ they also watch the video to make sure it works.

So the video becomes a debugging tool for their instructions. Perhaps not as good as watching 100 people do it, but maybe even better in some ways.

So the video codec you describe could be a useful tool to help create more programmers.

https://www.commitstrip.com/en/2016/08/25/a-very-comprehensi...

tsimionescu 12/29/2025||

I think it's quite obvious that any textual description that had any hope of being converted to video in this way would be entirely useless for a human mind. It wouldn't say something like "the fastener is on the under side of the chair about 3/5s of the way", it would say somerhing like "there is a square-shaped object in view 5cm from the top of the view and 120cm from the right; the object is 2cm x 2.2cm, color 0x7F325A".

zephen 12/29/2025||

> entirely useless for a human mind.

You may be right, although, of course, current LLMs often do the right thing with "about 3/5ths of the way."

OTOH, as someone who has done CAD and schematic drawings by programming, I am not 100% convinced about the inevitability of unreadability.

In any case, though, the bar is not really whether any human can interpret the text, but whether the average human will interpret the text or video faster, and here, to your point, yes, the video probably still wins handily.

The closest analogy I can think of is animated math gifs like these:

https://en.wikipedia.org/wiki/User:LucasVB/Gallery

Which can be a huge aid in learning.

But this leads to another conundrum. Where do animated GIFs end and video begin? Because I could see a simple line-drawing style animated GIF being sufficient for most purposes.

egypturnash 12/27/2025|||

This sounds incredibly precarious and prone to breaking when you update to a new model.

ilaksh 12/27/2025||

It would be impossible to change the model. It would be like a codec, like H.264 but with 1-2GB of fixed data attached to that code name. Changing the model is like going to H.265. Different codec.

jesseduffield 12/26/2025||

Post from the creator of Rust, 11 years ago. Highly relevant to today.

stevenjgarner 12/27/2025|

From an information theory perspective, "Always bet on text" is a plea for symbolic efficiency. It argues that while binary or visual formats might have higher bandwidth, they often have lower meaning-per-bit for the complex, abstract logic that runs civilization. Text is the most entropy-resistant, highly-compressible, and universally-decodable format we have ever invented.

jcgl 12/27/2025|

This doesn’t track for me. How can text have lower bandwidth but higher meaning-per-bit? How does that jibe with entropy resistance (in an information theoretic sense)?

Text seems worse to me. First of all, binary encodings are a superset of text encodings. But less abstractly, binary enables content-transparent compression and error correction.

Like other commenters have pointed out, the downside of binary is needing sufficient tooling. Depending on the domain, that can indeed be a downside. But if that critique isn’t relevant for a given context, it’s extremely unlikely that plaintext (ASCII?) is superior.

Text seems more like the answer to a plea for lowest common denominator of tooling.

stevenjgarner 12/27/2025|||

Human-readability is the ultimate error correction for the most expensive link in the system: the human-in-the-loop.

The information-theoretic justification is that binary's efficiency assumes a perfectly known codec, but the entropy of time destroys codecs (bit rot/obsolescence). Text sacrifices transmission efficiency for semantic recovery - it remains decodable even when the specific tooling is lost, making it the most robust encoding for long-term information survival.

jcgl 12/27/2025|||

Human-readability isn't a feature of ASCII though. It's a feature of any encoding for which the user has sufficient tooling. Sure, that's an easier bar to clear for ASCII than for binary formats in general. But as I said, as long as you have the tooling, binary is no less readable. (Also, many binary formats will store strings as ASCII or UTF-8, so you can use the strings utility or whatever you want against them.)

> the entropy of time destroys codecs (bit rot/obsolescence)

Okay, so you don't mean "entropy" in an information theoretic sense. You're just talking about the decay of time. That's a much more specific claim than your original one, and I grant than that may be true for some use-cases. But you don't need semantic recovery if you don't need to do recovery at all, i.e. if your data format and/or storage medium transparently provide redundancy and/or versioning.

tsimionescu 12/29/2025|||

> it remains decodable even when the specific tooling is lost, making it the most robust encoding for long-term information survival.

This may be true if you mean text written on a physical medium (especially if it's engraved in stone or clay), but it's not true at all if you mean text stored in a computer medium. Text is just binary with a dedicated codec. Good luck interpreting Chinese plain text files after humanity has forgotten about Unicode and UTF-8.

While text-based representations may be easier to decipher than random binary data even without knowing the encoding (as in an archeological setting), it's hardly going to be the easiest. Bitmaps, for example, have a much more limited set of symbols than Unicode, so I'd bet it would be much easier to display a long lost .bmp file than a random .txt file even a few hundred years from now. Same goes for raw audio, too. Now, JPEG and MP3 might be much more difficult, because the encoding is doing much more work.

More comments...