Agents need control flow, not more prompts

Posted by bsuh 1 day ago

Agents need control flow, not more prompts(bsuh.bearblog.dev)

557 points | 270 commentspage 8

graphememes 19 hours ago|

> If you’ve ever resorted to MANDATORY or DO NOT SKIP, you’ve hit the ceiling of prompting.

using this is going to do the opposite of what you want

jarboot 22 hours ago||

I think this is a good usecase for temporal + pydantic-ai

morpheos137 4 hours ago||

It speaks to how dumbed down the human userbase has become that these kind of articles are even presented as insightful. "Agents" are not intelligent. they are pattern extrapolators. If you want a reliable deterministic output you need a deterministic harness. Think of agents as a montecarlo sampling tool. The harness defines the result over noise. it is hilarious to me the industry is going head long into more "intelligent" agents while ignoring that intelligence is an adaptation to constraints not some magical abstract general thing that just appears and can do useful work. AGI is a lie. Stocastic parrots + harness is a useful tool.

cesarvarela 22 hours ago||

This will remain a persistent problem without a definitive answer until models move from generative tools to actual AI.

aykutseker 1 day ago||

all caps in a prompt is a code smell. when you're typing MANDATORY, you should be writing a wrapper, not refining the prose.

Nizoss 21 hours ago|

Exactly! I have said this a couple of times but it was taken literally as in no capital letters or strong language. Glad to see someone else who shares this perspective.

2001zhaozhao 21 hours ago||

If we need control flows, then designing these control flows ought to be the future of agent engineering

ModernMech 1 day ago||

Slowly and surely we are replacing AI with programming languages.

mhotchen 21 hours ago||

HUMANS need control flow. It's a very effective strategy that has worked wonders in healthcare

stonewizard 21 hours ago|

[dead]

dnautics 23 hours ago||

Yes. Humans are also unreliable and nondeterministic (though certainly more reliable). Accordingly we have built software dev practices around this. I imagine it would be super useful for example to have a "TDD enforcer":

Phase 1: only test files may be altered, exactly one new test failure must appear.

Phase 2: only code files may be altered. The phase is cleared when the test now succeeds and no other tests fail.

If you get stuck, bail and ask for guidance

ManWith2Plans 23 hours ago|

I've been busy building and dogfooding open-artisan for my own development purposes. I've diverged quite a bit from main and am hoping to merge some of those changes back soon. It's basically an OpenCode plugin that forces open-code token-hungry state machine that tries to map the engineering process I follow, exposing only valid tools and states at every step of development. If you're interested, in following along or trying it out, it's available here:

https://github.com/yehudacohen/open-artisan/

Hopefully, I'll merge in my large structural changes in the next couple of weeks. These structural changes will enhance the state machine meaningfully, as well as adding support for hermes agenet.

sidcool 10 hours ago|

How does one achieve this?

philipp-gayret 10 hours ago|

Native integrations with agents, i.e. Claude Code's system of Hooks.

Harnesses, which kick off agents with what to do.

Tools, which show an agent where in a process it is, and what the next step should be.

In my experience I find Hooks to be extremely powerful cross-project. CLI Tools are easy to make also, and work really well for guiding agents.

More comments...