Posted by Anon84 5 hours ago
> We evaluate on a 116-question representative subset of the LongMemEval benchmark (Wu et al., 2025), which tests an agent’s ability to answer questions over long conversations spanning multiple sessions.
grep’s design is surprisingly winning, exceeding expectations to this day.
pretty fast and neat project to search code interactively with a lot of optimizations on finding the right thing
I wrote about it[1] and came away with a different view on both Palantir and the future of agentic workflows personally.
[1] sorry, LinkedIn: https://www.linkedin.com/pulse/fund-managements-killer-app-d...
- regex (grep) - hybrid search (bm25+vector)
this X vs Y is uninteresting when the answer can be both.
What do you mean by this? Do you mean not automatically build the index?
https://github.com/gitsense/gsc-cli
`gsc grep` is just an alias for `gsc rg`, mostly because agents are much more likely to reach for “grep” than “rg”.
It works pretty well, but it is not a perfect drop-in replacement. `grep` and `ripgrep` differ in a few details, especially around glob/wildcard behaviour and flags. What I found works is to not use `grep` in search examples, and have the CLI spit out an error message for the AI saying this is `ripgrep`, so it needs to use `ripgrep` syntax.
https://github.com/Genivia/ugrep#aliases
Claude Code may ship with ugrep already.
It depends on if it is using Grep the harness tool or Grep from the bash tool
If you'd told me a decade ago I'd finally learn some sed in 26 because I'd want to understand what the AI was doing I'd have told you you were crazy . . .
I'm currently working on a markdown kb / search tool for my agents, in part built on TS