Posted by strongpigeon 4 hours ago
And that includes usage of the API with any agent without risking being banned. OpenAI is also very supportive of open source software.
I'm using GPT-5.4 with Swival (https://swival.dev) for a while, alongside local models, and it's absolutely fantastic.
The population on Hacker News heavily skewed towards tech workers so I wouldn't draw a conclusion from that.
I wouldn't mistake this for any kind of capability plateau. There is a massive push towards making transformers the engine of humanoid (and other kinds of) robotics, we just haven't reached the hype moment for those yet.
Problem is that the fuel to get this train going relies on investors money. Investors aren't going to be happy with the quote I took from your message.
And that's the real bet really, can the industry turn the spark into fire before the investor money runs out?
And plenty of very wealthy folks see the writing on the wall wrt robotics.
And everyone serious uses the API rate billing anyway.
This myth about the inferiority of ChatGPT and Codex is becoming a meme.
I have active subscriptions to both. I am throwing at Codex all kinds of data engineering, web development and machine learning problems, have been working on non-tech tasks in the "Karpathy Obsidian Wiki" [1] style before he posted about it.
Not only does Codex crush Claude on cost, it's also significantly better at adherence and overall quality. Claude is there on my Mac, gathering dust, to the point I am thinking of not renewing the sub.
There are plenty of fellow HNers here who feel the same from what I read in the flamewars. I suspect none of us really has a horse in this race and many are half-competent (in other threads, they mention they do things like embedded programming, distributed DL systems, etc.)
I'm starting to suspect a vast majority of people pushing the narrative that Claude is vastly better haven't even tried the 5.3 / 5.4 models and are doing it out of sheer tribalism.
[1] https://gist.github.com/karpathy/442a6bf555914893e9891c11519...
Codex is closer to my taste, as it is at least a native app and not typescript slop. But the model is just not up to snuff.
Grok makes sense if you want s.th. less censored that is not biased towards woke ideology.
I don't see how this matters for coding though. I only use it to give me a summary of recent news (so I don't have to actually read the bs newspapers and X posts myself).