The Codex App - Hacker News

Posted by meetpateltech 13 hours ago

600 points | 415 commentspage 7

poolnoodle 12 hours ago||

These paid offerings geared toward software development must be a hell of a lot "smarter" than the regular chatbots. The amount of nonsense and bad or outright wrong code Gemini and ChatGPT throw at me lately is off the charts. I feel like they are getting dumber.

ghosty141 10 hours ago||

Yes they are, the fact that the agents have full access to your local project files makes a gigantic difference.

They do *very* well at things like: "Explain what this class does" or "Find the biggest pain points of the project architecture".

No comparison to regular ChatGPT when it comes to software development. I suggest trying it out, and not by saying "implement game" but rather try it by giving it clear scoped tasks where the AI doesn't have to think or abstract/generalize. So as some kind of code-monkey.

zitterbewegung 11 hours ago|||

I don’t understand why we are getting these software products that want to have vendor lock in when the underlying system isn’t being improved. I prefer Claude code right now because it’s a better product . Gemini just has a weird context window that poisons the rest of the code generated (when online) ChatGPT Codex vs Claude I feel that Claude is a better product and I don’t use enough tokens to for Claude Pro at $100 and just have a regular ChatGPT subscription for productivity tasks .

nkohari 11 hours ago||

> I don’t understand why we are getting these software products that want to have vendor lock in when the underlying system isn’t being improved.

I think it's clear now that the pace of model improvements is asymptotic (or at least it's reached a local maxima) and the model itself provides no moat. (Every few weeks last year, the perception of "the best model" changed, based on basically nothing other than random vibes and hearsay.)

As a result, the labs are starting to focus on vertical integration (that is, building up the product stack) to deepen their moat.

anematode 11 hours ago||

> I think it's clear now that the pace of model improvements is asymptotic

As much as I wish it were, I don't think this is clear at all... it's only been a couple months since Opus 4.5, after all, which many developers state was a major change compared to previous models.

nkohari 11 hours ago||

Like I said, lots of vibes and hearsay! :)

The models are definitely continuing to improve; it's more of a question of whether we're reaching diminishing returns. It might make sense to spend $X billion to train a new model that's 100% better, but it makes much less sense to spend $X0 billion to train a new model that's 10% better. (Numbers all made up, obviously.)

mceachen 11 hours ago||

It’s the inconsistency that gets me. Very similar tasks, similar complexity, same code base, same prompting:

Session A knocks it out of the park. Chef’s kiss.

Session B just does some random vandalism.

karmasimida 10 hours ago||

Not to rain on the parade, but this app feels to me ... unpolished. Some of the options in the demo feels less thought out and just put together.

I will try it out, but is this just me, or product/UX side of recent OpenAI products are sort of ... skipped over? It is good that agents help ship software quickly, but please no half-baked stuff like Altas 2.0 again ...

isodev 10 hours ago||

I don’t get why they announce it as a “Mac app” when the UI looks and feels nothing like a Mac app. Also electron… again.

Why not flex some of those codex skills for a proper native app…

tartoran 9 hours ago||

What else do you expect from vibecoding? Even the announcement for this app is LLM generated.

karmasimida 6 hours ago||

This is true. The font and animation feels basic to me, even as a programmer focused app

_rwo 11 hours ago||

seems like I need to update my toolset for the 3rd time this week

SunshineTheCat 13 hours ago||

This does look like it would simplify some aspects of using Codex on Mac, however, when I first saw the headline I thought this was going to be a phone app. And that started running a whole list of ideas through my brain... :(

But overall, looks very nice and I'm looking forward to giving it a try.

tomashubelbauer 13 hours ago|

I don't know why any frontier model lab can't ship a mobile app that doesn't use a cloud VM but is able to connect to your laptop/server and work against local files on there when on the same network (e.g.: on TailScale). Or even better act as a remote control for a harness running on that remote device, so you couldn't seamlessly switch between phone and laptop/server.

kzahel 10 hours ago||

I'm also so baffled by this. I had to write my own app to be able to do seamless handoff between my laptop/desktop/phone and it works for me (https://github.com/kzahel/yepanywhere - nice web interface for claude using their SDK, MIT, E2E relay included, no tailscale required) but I'm so baffled why this isn't first priority. Why all these desktop apps?

indigodaddy 5 hours ago||

This looks awesome! And incredibly polished. Exactly the approach I take to vibebin-- I may have to integrate yep anywhere into it (if that's ok) as an additional webui!

https://github.com/jgbrwn/vibebin

Although I would need it to listen on 0.0.0.0 instead of localhost because I use LXC containers so caddy on the host proxies to the container 10.x address. Hopefully yep has a startup flag for that. I saw that you can specify the port but didn't see listening address mentioned.

kzahel 1 hour ago||

Cool! Your project sounds really interesting. I would love to try it out, especially if you integrated yep! Yes it has yepanywhere --host 0.0.0.0 or you can use HOST env var.

ngrilly 12 hours ago||

Currently using opencode with Codex 5.2 and wondering why I should switch.

kblissett 12 hours ago||

Does this support users who access Codex via Azure OpenAI API keys?

asdev 13 hours ago||

Built an open source lightweight version of this that works with any cli agent: https://github.com/built-by-as/FleetCode

wpm 11 hours ago|

Maybe I'm just not getting it, but I just don't give a flying fuck about any of this crap.

Like, seriously, this is the grand new vision of using a computer, this is the interface to these LLMs we're settling on? This is the best we could come up with? Having an army of chatbots chatting to each other running basic build commands in a terminal while we what? Supervise them? Yell at them? When am I getting manager pay bumps then?

Sorry. I'll stick with occasionally chatting with one of these things in a sandboxed web browser on a single difficult problem I'm having. I just don't see literally any value in using them this way. More power to the rest of you.

wendgeabos 11 hours ago|

OK.

More comments...