Top
Best
New

Posted by azath92 3 days ago

Show HN: HomeBrew HN – Generate personal context for content ranking(www.hackernews.coffee)
TLDR: Build a quick HN profile to see how little context LLMs need to personalise your feed. Rate 30 posts once, get a permanent ranked homepage you can return to.

Our goal was to build a tool that allowed us to test a range of "personal contexts" on a very focused everyday use case for us, reading HN!

We are exploring use of personal context with LLMs, specifically what kind of data, how much, and with how much additional effort on the user’s part was needed to get decent results. The test tool was a bit of fun on its own so we re-skinned it and decided to post it here.

First time posting anything on HN but folks at work encouraged me to drop a link. Keen on feedback or other interesting projects thinking about bootstrapping personal context for LLM workflows!

122 points | 48 commentspage 2
bstsb 3 days ago|
i found the "personal profile" that it created almost more interesting than the actual feed itself. from quite a small sample of posts it had mapped and summarised my interests really well.

i think the bit that needs the most work is classifying each post on the home page; quite a lot of posts that i would mark as "Dive" given its own classification of me ended up as "Skim".

azath92 3 days ago||
Yes us to! I spend heaps of time playing with the link between preferences and the profile.

We aren't really sure yet how best to surface _why_ the model predicts what it does. You can hover over the skim label and there is a bit of reasoning text, which might shed some light on why for now. We will think more about how to make these relationships more clear in the process of tightening them up and generally improving them.

Once the relationships are a bit more clear theres probably the 80/20 rule of work to tighten up those predictions.

cropcirclbureau 3 days ago||
Are you referring to what's in the textarea on the edit profile text dialog? Seems to be a simple concatenation of the titles according to how I tagged them. If it's indeed this what you're referring to, what did you find interesting about this?
Gracana 2 days ago||
Mine has this analysis at the top, which I found interesting:

## Analysis of user's tech interest: The user shows a strong interest in foundational computing concepts, historical perspectives on technology, and cutting-edge advancements in AI/ML, particularly those related to model architecture and efficiency. They are also drawn to low-level programming, system design, and hardware. Conversely, they seem less interested in business/startup narratives, general data manipulation tools, and consumer-oriented tech news unless it has a deep technical underpinning.

mdrzn 3 days ago||
Very interesting, but like others suggested I'd like for it to use my upvoted submissions and comments to build a profile about me.
peterm4 2 days ago||
I love this. I’ve always considered doing something similar with a traditional recsys model.

The only feature I’d love to see, is there are many posts where I’m more interested in the HN comments, rather than the articles themselves. It would be great to see this incorporated somehow.

Awesome work though. Will bookmark!

azath92 2 days ago|
thats grand, thank you! we really want to look at both using comments as a source for preference, and applying your preferences to comments to perhaps flag/sort/filter once you are on a page. Will have to think about the user flow, and the way in which preferences differ for comments vs posts
flexagoon 2 days ago||
It tried to figure out my interests based off my answers. Little does it know that I'm actually just interested in anything that has a catchy/funny title.
azath92 2 days ago|
hahaha thats a great test case, gona remember that for evals. Did it notice anything? or did it just get confused. we found if you have random preferences it often just says to skim half and skim half, which i suspect it might do in your case.

It _in theory_ should try and pick up a content style (funny stuff??) even if the tech is seemingly random, but i wouldn't be surprised if it just failed.

foruhar 2 days ago||
Nifty! Is there a github or source for this?
azath92 2 days ago|
Sadly not! It lives in our FE monorepo at work, it just kind of jumped out at us as something to split out and demo once we started using it to test some of our ideas. Its something wed consider for sure, but for now the lack of an open repo is kind of a bit of tech debt in a way. Easier to dev in our monorepo to get something out fast.
mebazaa 2 days ago||
Did you consider using more “traditional” recommendation systems? (and maybe using LLMs to create synthetic preferences…)
azath92 2 days ago|
we came to this by looking at how a "user profile" in plain english could be both used and generated by LLMs, but once we were looking at this we did discuss traditional recsys. Two things against it for this usecase: bootstrapping preferences with a low number of data points, and no "unified" storage of all users preferences or pre-existing dataset is difficult with trad ML or statistical methods. Also having your preferences or "model" if you will in plain english gives a sense of agency, transparency and individuality to your recomendataions that are at least difficult, if not impossible to communicate with other types of models.

Id love to have those assumptions challenged though, if there are examples you could point me towards.

simongray 2 days ago||
Having to rate the 30 examples made me realise just how much HN is dominated by LLM content these days. Kinda sad.
azath92 2 days ago|
I genuinely find that interesting to hear. what about the 30 examples felt different to say the frontpage? (assuming it did)

On a meta level i was suuuuper conscious of writing every word of this post/comments myself, as my prior is that HN's community is very intollerant of and highly sensitive to low effort content, whether via AI or not. This is despite using AI tools for lots of other parts of work (drafting, coding, summarising, brainstorming etc).

Do you think HN has become more accepting of AI slop, the slop is becoming harder to detect, or isnt as discerning as i assume?

simongray 2 days ago||
I'm not talking about the content but about the topics.
pvg 3 days ago|
Somewhat tangential but a oddly under-explored side of the LLMs+HN data projects - HN search that does a good job finding HN submission/comment results for the type of things often asked in Ask HN that can be answered by HN searches. 'How do I learn about [some nice thing or another]', etc. People asking this sort of thing often don't know the right keywords so pure keyword search often doesn't work well but more latent-spacey things, in theory, could. Another related one is "can you LLMgenerate something akin to dang's 'Related' posts".
azath92 3 days ago||
hackernews is such a rich source of "non trivial" content, and we are really seeing this when using it for a project compared with just consuming it as an individual.

We are focusing right now on how comments could be used to build up a better user context, and your comment has made me think about how we can feed comments in (instead of just titles and urls) for your selected preferences to make a better profile, without needing to scrape anything (expensive and slow).

But i think that would only work because of their quality and relevance generally, which would for sure make it an interesting knowledge source for pure LLM search. Feels like something someone should build or maybe already has! keen to see if anyone knows of projects like that.

pvg 3 days ago||
Feels like something someone should build or maybe already has!

Yep, it very much feels like that but it doesn't seems to have happened yet. Even a not-entirely-quite-working yet attempt there could be an interesting thing/discussion.

azath92 3 days ago||
I was able to find this post, which is a little old in LLM land, but appears to be pretty functional! https://news.ycombinator.com/item?id=40238509

The post and comments include a bunch of other tools that feel similar, and the tool itself works.

Ill have to take some time to use it, but also see what i can learn about how they've consumed and used HN comments more generally.

pvg 3 days ago||
If only there was some way to find all the similar related previous submissions (there have been a few :)