Top
Best
New

Posted by chabons 6 hours ago

Muse Spark: Scaling towards personal superintelligence(ai.meta.com)
https://meta.ai/
188 points | 243 commentspage 3
toddmorey 6 hours ago|
Question: since they've rebooted their approach to AI... have they given up on open models? There's no mention of open source or open weights or access to the models beyond their hosted services.
thegeomaster 6 hours ago||
Alexandr Wang on Twitter [0] mentioned open source plans:

"this is step one. bigger models are already in development with infrastructure scaling to match. private api preview open to select partners today, with plans to open-source future versions. incredibly proud of the MSL team. excited for what’s to come!"

https://x.com/alexandr_wang/status/2041909388852748717

prodigycorp 5 hours ago||
So the answer is: no. lol. Remember Llama 4 Behemoth, and how we were supposed to get more great models from it?
wmf 5 hours ago||
This may be too large to run locally anyway. Maybe they will distill down some smaller open versions later.
khurdula 4 hours ago||
"we hope to open-source future versions of the model."

Love to see it. Cheers!

binaryturtle 4 hours ago||
Looks like it needs a meta account? As soon you hit enter it wants to log-in. I guess I won't try this any time soon. :)
spprashant 2 hours ago||
Sounds like a good effort. They are choosing to focus on multi-modality - perhaps they are taking a different route here to Anthropic.

I don't like that I need to login to my FB/Instagram account to access this.

btown 2 hours ago||
Benchmarks are meaningless until the pelican benchmark comes out: https://simonwillison.net/
tekacs 4 hours ago||
https://meta.ai/share/pe4HxOfv2Bp

Finding a little bit tricky to evaluate because the harness is unfortunately very, very bad (e.g. search is awful). Can't wait to try this in some real external services where we can see how it performs for real.

Definitely getting ordinary high-quality results, overall. But hard to test agentic behavior and hard to test prose quality, even, when just working off of the default chat interface.

One thing that stands out is that _for_ the quality it feels very, very fast. Perhaps it's just only very lightly loaded right now, but irrespective it's lovely to feel.

I'm quite impressed with the tone overall. It definitely feels much more like Opus than it does, like, GPT or Grok in the sense that the style is conversational, natural and enjoyable.

redlewel 2 hours ago||
I am already somewhat concerned with companies like Anthropic and especially OpenAI having personal data via chats. Typing that sort of information into a Meta AI product feels completely irresponsible. You could make some very sophisticated ads/psyop attacks with data from daily ai chats.

I doubt its better than Opus and even if it was its not worth the privacy concerns.

sidcool 6 hours ago||
Will experiment with the model. But I am scared of sharing any information with the Zuck ecosystem.
eranation 4 hours ago||
Sarcasm aside, tried it (with instant mode), it's an impressive model.

It nailed all the ChatGPT meme gotchas (walk to the carwash, Alice 50 brothers, upside down cup, R's in strawberry, which number is bigger, 9.11 or 9.9?)

I guess all that money poaching OpenAI / Anthropic talent went somewhere...

Now, would I use "Meta Muse Code" or "Muse CoWork" if I have to have a facebook account to all of my developers? Maybe not.

Would I use it via an API key? I might, depends on the pricing!

turtlesdown11 4 hours ago|
so since they hard programmed all of the meme gotchas, they built a good model?
nh23423fefe 3 hours ago||
lazy snark < playing around with it
eranation 5 hours ago|
So this is why Anthropic rushed the weirdest "pre-responsible-disclosure-totally-not-for-marketing" announcement yesterday? To make sure Spark doesn't steal their thunder? (Spark beats Opus 4.6 on some benchmarks...). Or did I become a bitter cynical old man.
levocardia 3 hours ago||
Anthropic had their mythos post (and model) basically ready a few weeks ago, as evidenced by the blog content leaks. Also I highly doubt they just threw together a 250-page PDF model card in a "rush."
hnav 5 hours ago|||
It's giving "OpenAI says its new model GPT-2 is too dangerous to release (2019)"
reducesuffering 4 hours ago||
[because it would start an arms race]. The very arms race we're in... They were right
dbgrman 2 hours ago|||
Last i checked with friends at meta they are pretty deeply invested in using claude for coding etc. anthropic has nothing to be scared of at MSL.

If spark beats opus 4.6, why is meta wasting money on opus internally?

signatoremo 4 hours ago||
13 days ago.

https://news.ycombinator.com/item?id=47538795

spindump8930 4 hours ago||
Yes, it's far more certain that meta released this, which is less convincing on evals, as a result of the mythos previews.
More comments...