Posted by mfiguiere 14 hours ago
Resuming it cost 5% of the current session and 1% of the weekly session on a max subscription.
But right now it seems like, in the case of (3), these systems are really sensitive and unpredictable. I'd characterize that as an alignment problem, too.
At the same time, personally I find prioritizing quality over quantity of output to be a better personal strategy. Ten partially buggy features really aren't as good as three quality ones.
You have too many and the wrong benchmarks
LLMs over edit and it's a known problem.