Posted by david927 3 days ago
Ask HN: What Are You Working On? (March 2026)
It also comes with nice features and benchmarking abilities. For running evals, it has a companion called Calibra https://calibra.swival.dev
Published a demo/experiment under MalleableTodo [1] - and so far seen some pretty strange use cases...
Essentially, just allows each user to use an LLM to rewrite their own UI to add features/customisation.
[0]: https://aibenchy.com
I got it all done in probably an hour or two. But done in 10-15 min blocks over many days.
Very WIP and no docs, but I hope it will be helpful someday
Under the hood it uses a cool legal reasoning agent primarily designed for understanding litigation claims and objectives.
Sometimes I do wish I had a slack channel of like 30 attorneys so I can ask them questions and get feedback.
Now at 350k lines. Native and wasm binaries (you can try the limited wasm version online). Currently adding a full CPython test suite benchmark.
Just for fun, not trying to replace CPython here. Mainly to test the limits of current coding agents.