Top
Best
New

Posted by tmaly 1 day ago

Ask HN: How are you doing RAG locally?

I am curious how people are doing RAG locally with minimal dependencies for internal code or complex documents?

Are you using a vector database, some type of semantic search, a knowledge graph, a hypergraph?

367 points | 146 commentspage 7
mooball 21 hours ago|
i thought rag/embeddings were dead with the large context windows. thats what i get for listening to chatgpt.
jacekm 1 day ago||
I am curious what are you using local RAG for?
ramesh31 1 day ago||
SQLite with FTS5
sinandrei 1 day ago||
Anyone use these approaches with academic pdfs?
urschrei 1 day ago||
Another approach is to teach Claude Code how to use your Zotero library's full-text search: https://github.com/urschrei/zotero_search_skill.
alansaber 22 hours ago|||
I've not seen any impressive products. But products do exist ie https://scibite.com/solutions/semantic-search/
amelius 1 day ago||
Anyone using them for electronics datasheets?
bradfa 23 hours ago||
I would like to. I haven't yet found a solution that works well.

The problems with datasheets is tables which span multiple pages, embedded images for diagrams and plots, they're generally PDFs, and only sometimes are they 2-column layout.

Converting from PDF to markdown while retaining tables correctly seems to work well for me with Mistral's latest OCR model, but this isn't an open model. Using docling with different models has produced much worse results.

sosojustdo 23 hours ago||
I've been working on a tool specifically to handle these messy PDF-to-Markdown conversions because I ran into the same issues with tables and multi-column layouts.

I’ve optimized https://markdownconverter.pro/pdf-to-markdown to handle complex PDFs, including those tricky tables that span multiple pages and 2-column formats that usually trip up tools like Docling. It also extracts embedded diagrams/images and links them properly in the output.

Full disclosure: I'm the developer behind it. I’d love to see if it handles your specific datasheets better than the models you've tried. Feel free to give it a spin!

bradfa 22 hours ago||
Cool! But given that often electronics documentation is covered by NDAs, my preferred solution is local-first if at all possible.
baalimago 1 day ago||
I thought that context building via tooling was shown to be more effective than rag in practically every way?

Question being: WHY would I be doing RAG locally?

petesergeant 1 day ago|
For code, maybe? For documents, no, text embeddings are magical alien technology.
Strift 1 day ago||
I just use a web server and a search engine.

TL;DR: - chunk files, index chunks - vector/hybrid search over the index - node app to handle requests (was the quickest to implement, LLMs understand OpenAPI well)

I wrote about it here: https://laurentcazanove.com/blog/obsidian-rag-api

jeanloolz 1 day ago||
Sqlite-vec
petesergeant 1 day ago|
I’ve got it deployed in production for a dataset that changes infrequently and it works really well
electroglyph 1 day ago||
simple lil setup with qdrant
turnsout 21 hours ago||
The Claude Code model highlights the power of simple search (grep) and selective reads (only reading in excerpts). The only time I vectorize is when I explicitly want to similarity-based searching, but that's actually pretty rare.
__mharrison__ 21 hours ago|
Grep (rg)
More comments...