I need a version of this which swears loudly when an assumption it made turns out to be wrong, with the volume/passion/verbosity correlated with how many tokens it's burned on the incorrect approach.
shivaniShimpi_ 51 minutes ago|
i didnt realize i needed the volume scaling with tokens burned as much as i do now xD
imagine the screaming when it confidently refactors something for 40k tokens and then finds out the thing it deleted was load bearing
ben30 1 minute ago||
I have in my agents file “Chesterton’s fence” as pointer to think carefully before you remove something
AndreVitorio 1 hour ago||
This desperately needs a demo video in the repo.
shivaniShimpi_ 50 minutes ago|
hear hear!!!
8-prime 1 hour ago||
Does this actually relate to the code quality being observed by the agent? The readme isn't very clear on that IMO. I have some projects I'd love to try this out on, but only if I am to get an accurate representation of the LLMs suffering.