Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

Posted by unrvl22 4 hours ago

Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model(github.com)

124 points | 72 commentspage 2

AnotherGoodName 3 hours ago|

This is fascinating that it worked though. Can we just merge all the open weight models and get something better?

wds 3 hours ago||

I imagine it'd work the same as merging all the good-tasting foods to get an even tastier one

nylonstrung 1 hour ago|||

If you go to Civitai this is pretty how it works in that corner of the image generation world

Everything is using Stable Diffusion as underlying model, then most of the usage is merged of checkpoints

avereveard 2 hours ago|||

most merge improve a small subset of "feeling" benchmark (too small, too specific, or out of distribution) and tend to show degradation on actual benchmark, with especially punishing result on long chain benchmarks.

also only work on matching architectures (i.e. finetunes/loras of the same model)

dindunuf 2 hours ago|||

that kinda worked in llama 1/2 era, not between different models but between finetunes of the same model. the briefly legendary Mythomax was IIRC a merge of 5+ tunes, some of which were merges themselves.

_3u10 3 hours ago||

No, they need the same arch, but you can distill them into a single model. And yes, if you use the API directly Claude will often say it’s an open weight model (likely the ones it was distilled from)

yieldcrv 2 hours ago||

Didn’t the last thread about this have someone from the lab or an enthusiast in Rio saying exactly that?

Its a fine tune of Qwen

Not a conspiracy

daemonologist 2 hours ago|

The allegation here is that it's not actually a fine-tune of Qwen, but instead an undisclosed mashup (merge) of someone else's fine-tune of Qwen and the original model. Rio subsequently said that the model was in fact a merge, that they did additional fine-tuning after the merge, and that they accidentally uploaded the base merge instead of the version with additional fine-tuning. But this seems like quite an oversight...

yieldcrv 59 minutes ago||

> But this seems like quite an oversight...

Not to me, what would people like to happen? Who are those people? And why do they care?

Aurornis 2 hours ago||

[dead]

antii 2 hours ago||

[dead]

diego_moita 1 hour ago||

WHAT!? There are thieves in Rio de Janeiro?

Oh, I am so SHOCKED, so SHOCKED! /s

Explaining the joke: in Brazil, Rio de Janeiro is known as "Terra de bandido" (Gangster's Land).

Kinda like Chicago in the 20's or Naples and Palermo in the 90s.

elzbardico 3 hours ago||

[flagged]

guiraldelli 3 hours ago||

Without evidence, your comment is just bad mouthing.

I have been involved in academia, including in Brazil, and I don't find academia there any more copycat than any other institution, including top tier ones.

boca_honey 2 hours ago||

This is very easy to prove [1][2]. Brazil has that reputation in the broarder academic world, and it's for a reason.

[1] https://www.sciencedirect.com/science/article/abs/pii/S17511...

[2] https://www.scielo.br/j/aac/a/xNytDrrrHdyK4XPcHBRJZmd/?lang=...

avdelazeri 2 hours ago||

One study about faculty hiring people they know, and the other about high school students cheating on assignments... What was the original claim again?

dghlsakjg 2 hours ago|||

This was a municipality working with a government associated IT company.

What does it have to do with Brazilian academia?

_3u10 3 hours ago||

No, typically Brazilians go to Paraguay for their education, most of their technology comes from Paraguay too.

knuppar 2 hours ago|||

that's just a lie lol, stop spreading misinformation

cassiogo 2 hours ago|||

What? Never heard of this

stymaar 2 hours ago||

That sounds like nonsense, they don't even speak the same language in Brasil and Paraguay …

alfiedotwtf 3 hours ago|

Wasn’t it already obvious given the awfully familiar parameter numbers?

intoXbox 1 hour ago|

That only tells what base architecture they used, but fine tuning does not increase the number of weights, it just adapts the weights to improve better on a fine tuning dataset- something they claimed they had done