DeepSeek reasonix, DeepSeek native coding agent with high caching and low cost

Posted by Alifatisk 8 hours ago

DeepSeek reasonix, DeepSeek native coding agent with high caching and low cost(esengine.github.io)

Related ongoing thread:

DeepSeek makes the V4 Pro price discount permanent - https://news.ycombinator.com/item?id=48237663 - May 2026 (384 comments)

327 points | 162 commentspage 4

hmokiguess 5 hours ago|

Click on the download page, it's hilarious. It has a lot of information about the "smart probe" on the download and it's a realtime probe you can rerun.

That's the pinnacle of AI slop over engineered garbage in my opinion. All of that information is noise.

pkulak 5 hours ago||

Doesn't Pi Agent do exactly this? Assuming "append only" means they do some kind of compaction as well.

quotemstr 6 hours ago||

> no reordering, no marker-based compaction

Is this really the behavior you want? Yes, doing tool-result clearing and such will blow your cache, but if you do it only occasionally, it's still likely a win. Yes, cache hits are good, but not so good that it's okay to be profligate with context to preserve those precious, precious KVs.

Hfuffzehn 5 hours ago||

This is really tickling the conspiracy theorist part of my brain.

"Independent open-source project · not affiliated with DeepSeek" "Reasonix only targets DeepSeek because..." "Why DeepSeek only? Can I swap to Claude / GPT? It's a design choice, not a limitation"

The lady doth protest too much, methinks?

Nicely timed shortly after the making the rebate permanent anouncement.

Could just be Chinese devs trying to help western devs with some software and a western facing marketing campaign to raise awareness. Could be DeepSeek astroturfing. Could be "someone" in China trying to get more access to western data.

Who knows?

andai 5 hours ago||

But Claude made the website?

Alifatisk 4 hours ago|

What conclusion are you drawing from that?

andai 2 hours ago||

If Deepseek can't even make a static site, why would I want to use it for anything else? (Not saying it can't, just that it's a weird choice to present your Deepseek-oriented product.)

Alifatisk 42 minutes ago||

I see your point, but as we know, devs from Google and OpenAI regularly use Claude Code because of its edge in frontend. I think using another model to build your own thing is a pragmatic engineering decision, not a sign of failure.

am17an 5 hours ago||

This Claude front end skill is now soon to be slop.

auggierose 4 hours ago||

Oh, I was wondering why all new websites look shitty in the same way.

ricardobeat 5 hours ago||

Already is. Every new website looks exactly the same.

sergiotapia 7 hours ago||

What AI model did you use for the website design? This is the second one I see with the exact same font and color scheme. Just curious because Claude models lean towards purples for example. Thank you!

pcwelder 6 hours ago||

Opus 4.7 selects such palette and motifs by default. Might even be first iteration of claude design.

franga2000 7 hours ago|||

This design still screams Claude to me, but a newer version than what you're thinking of. At some point they added a markdown file that tells it to use obviously AI designs like lots of blue/purple and gradients. Since then, this is its new style.

sheepscreek 6 hours ago|||

DeepSeek v4 perhaps?

FergusArgyll 6 hours ago||

Frontend design skill by Anthropic specifically says not to use purple. I'd be surprised if it still uses purple. Have you seen that recently?

canadiantim 7 hours ago||

So what's best low cost coding agent these days? Kimi 2.6? Qwen's latest closed model? Composer 2.5? DeepSeek?

bwfan123 7 hours ago||

In my experience, it is claude-code paired with deepseek-v4. For penny-pinchers like me, I can have long coding sessions with it with no anxiety about the cost. Also, prompting it to what you want and verifying the outputs is more important than the quality of the model. So, I am better off with a cheaper model and taking the responsibility for prompting it and verifying the results.

esperent 6 hours ago|||

It's obviously much cheaper paying by the token but how does it compare to a codex subscription on cost?

epolanski 7 hours ago|||

Can you quantify the actual costs in a week and the use you make?

wongarsu 6 hours ago||

Not GP, but for my use I'd estimate $0.10-0.30 per hour of use per agent with DeepSeek v4 Pro

passive 7 hours ago|||

I've gone through ~600m tokens in Xiaomi Mimo though Claude, and it's been the most effective use of an agent I've had yet. It's very capable, but generally not ambitious, picking simple but effective solutions to most problems I give it. Going to write something longer about the experience when I get to a billion tokens.

Alifatisk 6 hours ago|||

I do have my eyes on the coding plan, which is quite generous.

https://mimo.mi.com

gandreani 7 hours ago|||

Are you using Mimo 2.5 pro?

passive 6 hours ago||

Yes. I tried a couple of weeks with non-Pro, and it was pretty good, but I had too many spare tokens, so I switched back to Pro. :)

throw10920 3 hours ago|||

Cursor with Composer 2.5 seems to be competitive with frontier models (Opus and GPT-5.5) for a significant price discount. Benchmarks are gamed, as always, but $0.55/task vs $11.02 a task definitely indicates that there's some cost advantage.

https://cursor.com/evals

ac29 7 hours ago|||

Kimi 2.6 is great. Qwen3.7-max benchmarks similarly but I havent used it yet

skeledrew 7 hours ago|||

Seems to be DeepSeek.

https://news.ycombinator.com/item?id=48237663

abalashov 5 hours ago|||

Although I have little interest in agentic coding, when I do use it, I have found Kimi K2.6 to give Opus-quality output, and have switched entirely to it for pretty much everything.

throw10920 5 hours ago||

I've used Opus extensively and tried K2.6 on a few projects, and the gap is huge. K2.6 is nowhere near the performance of Opus. That's fine because it's also far cheaper, but public benchmarks line up with my own personal experience that they aren't comparable in terms of intelligence.

(that is, different places on the Pareto efficiency graph)

abalashov 2 hours ago||

No two uses are alike, I suppose. For me, whatever difference is a wash. However, I probably tend to shy away from throwing high-complexity/long-horizon tasks at the model.

stavros 7 hours ago|||

For me, it's by far Deepseek. It's many times cheaper than competitors, and about as good as Sonnet 4.6.

fouric 5 hours ago||

I'd generally agree about Deepseek being as good as Sonnet - but I have extreme trouble with prompt compliance with V4 Pro in a way that I've never had with Sonnet. I'll tell it "find the bug, but don't fix it" or "please use this tool I just developed" and it'll ignore me a high fraction of the time.

It's bad enough that I'm working on guardrails at the harness level because prompting appears to be useless.

Do you have the same issue?

stavros 5 hours ago||

I have Opus make a fairly detailed plan, then Deepseek implements, and GPT reviews. With that setup, I have zero issues, probably because what you mention is handled (the plan keeps it on track and the reviewer catches any issues).

Now that you mention it, though, I have seen it do a few things that weren't in the plan. The reviewer caught them, though, so they didn't cause a problem, and it's so cheap that overall it's a massive improvement.

lostmsu 7 hours ago||

Just use codex with 5.5 on low reasoning levels

ankitwarbhe 4 hours ago||

you created it yourself ?

Alifatisk 4 hours ago|

No.

More comments...