The Codex app illustrates the shift left of IDEs and coding GUIs

Posted by straydusk 2 days ago

The Codex app illustrates the shift left of IDEs and coding GUIs(www.benshoemaker.us)

79 points | 199 commentspage 3

Xx_crazy420_xX 2 days ago|

Why create IDE without IDE features? Whats the benefit of this over using IDE with Codex plugin? I don't believe that you can review the code without code traversal by references, so looks like its directed towards toy projects/ noobs. And the agents are not yet near the autonomy that will omit the code review in complex systems.

bigwheels 2 days ago||

Why do the illustrations bear such a strong resemblance to those in the Gas Town article?

https://steve-yegge.medium.com/welcome-to-gas-town-4f25ee16d...

Is it a nano banana tendency or was it probably intentional?

straydusk 2 days ago|

It's nano banana - I actually noticed the same thing. I didn't prompt it as such.

Here's the prompt I used, actually:

Create a vibrant, visually dynamic horizontal infographic showing the spectrum of AI developer tools, titled "The Shift Left"

Layout: 5 distinct zones flowing RIGHT TO LEFT as a journey/progression. Use creative visual metaphors — perhaps a road, river, pipeline, or abstract flowing shapes connecting the stages. Each zone should feel like its own world but connected to the others.

Zones (LEFT to RIGHT):

1. "Specs" (leftmost) - Kiro logo, VibeScaffold logo, GitHub Spec Kit logo

   Label: "Requirements → Design → Tasks"

2. "Multi-Agent Orchestration" - Claude Code logo, Codex CLI logo, Codex App logo, Conductor logo

   Label: "Parallel agents, fire & forget"

3. "Agentic IDE" - Cursor logo, Windsurf logo

   Label: "Autonomous multi-file edits"

4. "Code + AI" - GitHub Copilot logo

   Label: "Inline suggestions"

5. "Code" (rightmost) - VS Code logo

   Label: "Read & write files"

Visual style: Fun, energetic, modern. Think illustrated tech landscape or isometric world. NOT a boring corporate chart. Use warm off-white background (#faf8f5) with amber/orange (#b45309) as the primary accent color throughout. Add visual flair — icons, small illustrations, depth, texture, but don't make it visually overloaded.

Aspect ratio: 16:9 landscape

ottah 2 days ago||

> Where IDEs are headed and why specs matter more than code.

We are very far away from this being a settled or agreed upon statement and I really struggle to understand how one vendor making a tool is indicative of an industry practice.

frank00001 2 days ago||

Clearly written by someone who has no systems of importance in production. If my code fail people loose money, planes halts, cars break down. Read. The. Code.

kwindla 2 days ago||

Yes, but also ... the analogy to assembly is pretty good. We're moving pretty quickly towards a world where we will almost never read the code.

You may read all the assembly that your compiler produces. (Which, awesome! Sounds like you have a fun job.) But I don't. I know how to read assembly and occasionally do it. But I do it rarely enough that I have to re-learn a bunch of stuff to solve the hairy bug or learn the interesting system-level thing that I'm trying to track down if I'm reading the output of the compiler. And mostly even when I have a bug down at the level where reading assembly might help, I'm using other tools at one or two removes to understand the code at that level.

I think it's pretty clear that "reading the code" is going to go the way of reading compiler output. And quite quickly. Even for critical production systems. LLMs are getting better at writing code very fast, and there's no obvious reason we'll hit a ceiling on that progress any time soon.

In a world where the LLMs are not just pretty good at writing some kinds of code, but very good at writing almost all kinds of code, it will be the same kind of waste of time to read source code as it is, today, to read assembly code.

gombosg 2 days ago|||

I think this analogy to assembly is flawed.

Compilers predictably transform one kind of programming language code to CPU (or VM) instructions. Transpilers predictably transform one kind of programming language to another.

We introduced various instruction architectures, compiler flags, reproducible builds, checksums exactly to make sure that whatever build artifact that's produced is super predictable and dependable.

That reproducibility is how we can trust our software and that's why we don't need to care about assembly (or JVM etc.) specifics 99% of the time. (Heck, I'm not familiar with most of it.)

Same goes for libraries and frameworks. We can trust their abstractions because someone put years or decades into developing, testing and maintaining them and the community has audited them if they are open-source.

It takes a whole lot of hand-waving to traverse from this point to LLMs - which are stochastic by nature - transforming natural language instructions (even if you call it "specs", it's fundamentally still a text prompt!) to dependable code "that you don't need to read" i.e. a black box.

wtetzner 2 days ago||||

The analogy to assembly is wrong. Even in a high level language, you can read the code and reason about what it does.

What's the equivalent for an LLM? The string of prompts that non-deterministically generates code?

Also, if LLM output is analogous to assembly, then why is that what we're checking in to our source control?

LLMs don't seem to solve any of the problems I had before LLMs existed. I never worried about being able to generate a bunch of code quickly. The problem that needs to be solved is how to better write code that can be understood, and easily modified, with a high degree of confidence that it's correct, performs well, etc. Using LLMs for programming seems to do the opposite.

sho_hn 2 days ago|||

I think it's the performative aspects that are grating, though. You're right that even many systems programmers only look at the generated assembly occasionally, but at least most of them have the good sense to respect the deeper knowledge of mechanism that is to be found there, and many strive to know more eventually. Totally orthogonal to whether writing assembly at scale is sensible practice or not.

But with the AI tools we're not yet at the wave of "sometimes it's good to read the code" virtue signaling blog posts that will make front page next year or so, and still at the "I'm the new hot shit because I don't read code" moment, which is all a bit hard to take.

sjsisjsh 2 days ago|||

I think a lot of AI bros are sleeping on quality. Prior startup wisdom was “move fast and break things”. Speed is ubiquitous now. Relatively anyone can vibe code a buggy solution that works for their happy path. If that’s the bar, why would I pay for your jank solution when I can make my own tailored to my exact needs? Going fast is a race to the bottom in the long run.

What’s worth paying for is something that is trustworthy.

Claude code is a perfect example: They blocked tools like opencode because they know quality is the only moat, and they don’t currently have it.

straydusk 2 days ago||

I mean, fair enough. Obviously there are different levels of criticality in any production environment. I'm building consumer products and internal tools, not safety-critical systems.

Even in those environments, I'd argue that AI coding can offer a lot in terms of verification & automated testing. However, I'd probably agree, in high-stakes safety environments, it's more of a 'yes and' than an either/or.

hollowturtle 2 days ago||

Hell I see the big banner picture hallucinated by a prompt and all I see is an unproductive mess. Won't comment on the takes the article makes they're just miserable

saadn92 2 days ago||

reason why I also ended up creating something like this: https://github.com/saadnvd1/aTerm

Sunscratch 2 days ago||

>Here’s the thing: I don’t read code anymore. I used to write code and read code. Now when something isn’t working, I don’t go look at the code. I don’t question the code. I either ask one of my coding agents, or - more often - I ask myself: what happened with my system? What can I improve about the inputs that led to that code being generated?

Good luck debugging any non trivial problem in such codebase

bonesss 1 day ago|

Not to mention data retention and upgrade management.

When an update script jacks up the guaranteed-to-be-robust vibed data setup in this first of a kind, one of a kind, singular installation… what then?

The pros have separate dev, test, QA, and prod environments. Immutable servers, NixOs, containers, git, and rollback options in orchestration frameworks. Why? Because uh-oh, oh-shit, say-what, no-you’re-kidding, oh-fuck, and oops are omnipresent.

MS Access was a great product with some scalability ceilings that took engineering to work past. MS Access solutions growing too big then imploding was a real concern that bit many departments. MS access was not dumping 15,000 LoC onto the laps of these non-developers and telling them they are hybrid spirit code warriors with next level hacking skills.

Ruby on Rails, Wordpress, SharePoint… there are legitimately better options out there for tiny-assed self-serving CRUD apps and cheap developer ecosystems. They’re not quite as fun, tho, and they don’t gas people up as well.

franze 2 days ago||

has someone figured out on how to set the codex app to yolo mode yet?

the constant asking drives me crazy

manmal 2 days ago||

There‘s a button looking like a shield, next to the voice dictation button.

CryptoBanker 2 days ago||

It's called the "Yeet" skill in the app

jwpapi 2 days ago||

I really wonder why nobody is talking about how it is more important to be able to test the code.

9/10 my ai generated code is bad before my verification layers 9/10 its good after.

Claude fights through your rules. And if you code in another language you could use other agents to verify code.

This is the challenge now, effectively verify the code. Whenever I end up with a bad response I ask myself what layers could i set to stop AI as early as possible.

Also things like namings, comments, tree traversal, context engineering, even data-structures, multi-agenting. I know it sounds like buzzword, but these are the topics a software-engineer really should think about. Everything else is frankly cope.

straydusk 2 days ago|

I think people attacking "don't read the code" are not considering the status quo - they're comparing to some perfect world where staff engineers are reading every line of code. That's not even close to happening. Test-driven development is something most engineers just won't put up with... AI's will do it, no problem. If I can automate ten different checks for every commit, is my code really getting looked at less?

sjsisjsh 2 days ago||

[dead]

timhh 2 days ago|

Not really what "shift left" means...