Top
Best
New

Posted by turtlesoup 7 days ago

Show HN: Are You in the Weights?(www.intheweights.com)
With more traffic moving off-web and into LLMs, I got curious about what traces we leave "in the weights". My design partner and I built a site in the past few weeks that checks recognition across frontier and small models. It queries many of them in parallel, clusters the responses, and tells you how strongly they recognize you. Happy to answer any questions here!
470 points | 247 commentspage 13
zimpenfish 6 days ago|
Not terrible although I am somewhat insulted that QWEN3 8B hallucinated me as the chimp from Jimmy Neutron and no, MISTRAL 3.2 24B, I don't stream on Twitch.

Oh and KIMI K2 0905 completely hallucinated a real name for me (I don't work on Pygame!)

VarunMenon 7 days ago||
super cool!! I love the idea and the UI
hnarayanan 7 days ago||
I love this!
sph 7 days ago||
I get why they couldn’t slop pixel art Hitler, but why not Mandela?!
locusofself 7 days ago||
Yet another reminder that my wife is far more well known than I am
jubilanti 7 days ago||
PRIVACY WARNING: Every name/text entered into this site is publicly listed on the "latest" leaderboard which seems to paginate endlessly.
turtlesoup 7 days ago||
Just deployed a fix for this; removed latest and capped pagination.
bdieterm 7 days ago|||
It is still possible to download all entries via the api and sort them by the timestamp. Removing the cursor data would be one way to mitigate this.

Currently there are a bit more than 43000 entries. As far as I have seen, only the results are stored. When I entered a random name, only a similar name was found, and that similar name result was stored, but not the original input.

bdieterm 6 days ago|||
update to my previous comment:

All the data is still public. There are more than 104000 entries now.

The original name, that was searched, is also stored in the data (in another field; somehow I missed that before).

@tourtlesoup: Why don't you restrict the access and why don't you put a warning on your page?

bluefirebrand 7 days ago|||
This was the first thing I thought too.

Even if this thing wasn't publicly displaying the names, I would assume they would be collecting them for something.

Can't trust anything like this online.

ronbenton 7 days ago||
Can’t trust anything online
dofm 7 days ago|||
And will thus potentially end up in the effing weights.
rdtsc 7 days ago|||
That's the first thing I looked to see HN-ers real names :-) and thought "hey, that's a pretty clever to get everyones' names".

Apparently it's fixed now. Surely you'll trust a random website...

Crowberry 7 days ago|||
That sucks… shame on me I guess
1over137 7 days ago|||
Wouldn't thinking so be the default for the HN crowd? I'd have thought any hacker would assume any text you type in a random website would be used however the website administrator wanted. (Not that the general public would think so.)
cocoa19 7 days ago||
Ugh too fucking late. What a privacy nightmare.
dvt 7 days ago||
I have a unique last name (maybe that's why), but pretty much nailed it:

    David Titarenco
    Software engineer and open-source contributor

    340 strength · Top 20%

    GPT-5.5 says
    Software engineer and writer known for work
    on developer tools, systems, and programming-
    related articles.

    Claude Opus 4.8 says
    Software engineer and entrepreneur known for
    web/JavaScript development work and contributions
    to open-source projects and tech startup communities.
planb 7 days ago||
[dead]
defytonofficial 7 days ago||
[flagged]
victorbjorklund 6 days ago|
[dead]
More comments...