Posted by mfiguiere 3 days ago
i can't believe it
and i can't believe nobody else is complaining
my simulation is definitely on very hard mode
Anecdotal experience: asked it to change instances of a C++ class Foo to a compatible one Bar, it did that but failed to add the required include where it made the change.
Yes, I'm sure that with enough prompting/hand-holding it could do this fine. Is it too much to expect basics like this out of the box, though? If so, then I, for one, still can't relate to the current level of enthusiasm.
There are other CLI coding agents like Aider (https://aider.chat/) that will let you point at any model. The problem is that local models are dramatically worse than these big hosted models — I have not seen anyone claim they are good enough for these kind of tools. See https://aider.chat/docs/leaderboards/
I mean I kind of get it, but it does seem like they are almost penalizing people who could code in the browser with the canvas feature but prefer to use a terminal.
Do I have that right?
I've been working on something related—Plandex[1], an open source AI coding agent that is particularly focused on large projects and complex tasks.
I launched the v2 a few weeks ago and it is now running well. In terms of how to place it in the landscape, it’s more agentic than aider, more configurable and tightly controlled than Devin, and more provider-agnostic/multi-provider/open source than Claude Code or this new competitor from OpenAI.
I’m still working on getting the very latest models integrated. Gemini Pro 2.5 and these new OpenAI models will be integrated into the defaults by the end of the week I hope. Current default model pack is a mix of Sonnet 3.7, o3-mini with various levels of reasoning effort, and Gemini 1.5 Pro for large context planning. Currently by default, it supports 2M tokens of context directly and can index and work with massive projects of 20M tokens and beyond.
Very interested to hear HN’s thoughts and feedback if anyone wants to try it. I'd also welcome honest comparisons to alternatives, including Codex CLI. I’m planning a Show HN within the next few days.
^^ could I put in my free Gemini key so as to use Gemini Pro 2.5 ? I'm a bit beginner with everything around BYOB. Thanks..
I really appreciate your willingness to try Plandex and look forward to hearing your feedback! You can ping me in the Plandex Discord if you want—would love to get more HNers in the channel: https://discord.com/invite/plandex-ai
Or email me: dane@plandex.ai
> and didn't mean to to try to distract from their launch.
you should, that is the biggest reason I upvoted.
edit: And I can't say I disagree.
If the community considers that astroturfing, we have completely lost the plot what building is.
This is perhaps too cynical for this specific instance, but it's not overly cynical more broadly. Considering users of the site have to evaluate many of these offerings frequently, I don't blame them for having a negative gut reaction.