Ask HN: How are you doing RAG locally?

Posted by tmaly 1/14/2026

I am curious how people are doing RAG locally with minimal dependencies for internal code or complex documents?

Are you using a vector database, some type of semantic search, a knowledge graph, a hypergraph?

413 points | 157 commentspage 5

motakuk 1/14/2026|

LightRAG, Archestra as a UI with LightRAG mcp

codebolt 1/15/2026||

Giving the LLM tools with an OData query interface has worked well for me. In C# it's pretty trivial to set up an MCP server with OData querying for an arbitrary data model. At work we have an Excel sheet with 40k rows which the LLM was able to quickly and reliably analyse using this method.

Bombthecat 1/15/2026||

AnythingLLM for documents, amazing tool!

robotswantdata 1/15/2026||

You don’t need a vector database or graph, it really depends on your existing infrastructure , file types and needs.

The newer “agent” search approach can just query a file system or api. It’s slightly slower but easier to setup and maintain as no extra infrastructure.

turnsout 1/15/2026||

The Claude Code model highlights the power of simple search (grep) and selective reads (only reading in excerpts). The only time I vectorize is when I explicitly want to similarity-based searching, but that's actually pretty rare.

softwaredoug 1/15/2026||

I built a Pandas extension SearchArray, I just use that (plus in memory embeddings) for any toy thing

https://github.com/softwaredoug/searcharray

lsb 1/15/2026||

I'm using Sonnet with 1M Context Window at work, just stuffing everything in a window (it works fine for now), and I'm hoping to investigate Recursive Language Models with DSPy when I'm using local models with Ollama

dvorka 1/15/2026||

Any suggestion what to use as embeddings model runtime and semantic search in C++?

SamLeBarbare 1/15/2026||

sqlite + FTS + sqlite-vec + local LLM for reranking results (reasoning model)

yandrypozo 1/15/2026|

this's pretty cool, which LLM are you using currently?

claylyons 1/15/2026|

Has anyone tried this? https://aws.amazon.com/s3/features/vectors/

More comments...