Sakana Fugu - Hacker News

Posted by Finbarr 1 day ago

232 points | 121 commentspage 3

hmokiguess 1 day ago|

How do you configure it to run with pi or claude code? I'm curious to try it (via subscription ideally)

EDIT: Found something here https://dev.classmethod.jp/en/articles/sakana-fugu-ga-first-...

chvid 1 day ago||

This would have been much more interesting and impactful if it had relied on open source models rather than commercial models that are only availble via an API.

The reasoning chains could have been used, and the resulting combined model could easily and effectively have been distilled.

claw-el 1 day ago||

Will Le Chat try to eat Sakana? There is Le Chaton Fat and then there is Sakana Fugu too..

olmo23 1 day ago|

> Le Chaton Fat

For others looking around: LCF is a meme model, it's not real. It's a joke.

adamnemecek 1 day ago||

Seems kinda underwhelming considering they raised like $400M.

ffsm8 1 day ago||

400m is the new 400k! Just look at the other company evaluations and how much they raised vs what they delivered

itemize123 1 day ago||

it's just one of their products right

nickandbro 1 day ago||

Very interesting. I wonder if its kinda functions similarly to how OpenRouter's fusion API does. Hopefully isn't too long to respond.

ljlolel 1 day ago||

Yea similar, possibly even more steps / slower. I put together an all open source fusion at 1/3 of price of Fable: https://trustedrouter.com/blog/open-fusion-beats-fable-5

We open sourced it all

and will be releasing a similar orchestrator next week on TrustedRouter

stygiansonic 1 day ago||

From a brief reading of what Fusion does: https://openrouter.ai/docs/guides/features/plugins/fusion

Looks like Fusion calls a bunch of models and then uses an LLM to synthesize the results, and pass to another model for final output.

Fugu looks like it's doing something different? Using an LLM earlier on in the flow as an orchestrator to decide which other LLMs to call. More coordinator than simply synthesizing results, and more "agentic".

It's interesting because it's all exposed behind a single OpenAI compatible endpoint (Responses API?) and so then presumably someone could use this for one of their single agents. Now you have agent-of-agents, nested in some sense. The token usage increases accordingly!

soma8088 1 day ago||

$40/month on Kiro, I hardly ever hit my credit limit

andai 1 day ago||

See also: OpenRouter Fusion, similar idea, although it seems limited to internet research tasks? (Unclear, maybe someone who has used it can elaborate.)

What's nice is that OpenRouter included a pareto graph showing the cost as well as the performance. (But not time, unfortunately -- model fusion adds a large factor to round trip time.) Benchmarks are a lot less helpful without that.

OpenRouter: Surpassing frontier performance with fusion (blog post with benchmarks)

https://news.ycombinator.com/item?id=48525392

OpenRouter Fusion API

https://news.ycombinator.com/item?id=48537641

See also: Sibling comment with an open source implementation

https://news.ycombinator.com/item?id=48624782#48629598

I did my own last weekend in a few lines of Python, though I haven't tested it much yet. (Looking for some very hard, very cheap benchmarks, if such a thing exists!)

agalamli 1 day ago||

i've seen many AI models, tried some. i'm genuinely interested in trying this kind of model/architecture. however i'm a little confused about the pricing.

bprasanna 1 day ago||

Isn't this what perplexity is?

JumpCrisscross 1 day ago|

Is Perplexity still a daily driver for a lot of folks?

teknologist 10 hours ago|

Is this just OpenRouter Fusion but from Japan?

More comments...