DeepSeek v4 - Hacker News

Posted by impact_sy 2 days ago

DeepSeek v4(api-docs.deepseek.com)

https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main...

970 points | 626 comments

jari_mustonen 2 days ago|

Open Source as it gets in this space, top notch developer documentation, and prices insanely low, while delivering frontier model capabilities. So basically, this is from hackers to hackers. Loving it!

Also, note that there's zero CUDA dependency. It runs entirely on Huawei chips. In other words, Chinese ecosystem has delivered a complete AI stack. Like it or not, that's a big news. But what's there not to like when monopolies break down?

chvid 2 days ago||

The incredible arrogance and hybris of the American initiated tech war - it is just a beautiful thing to see it slowly fall apart.

The US-China contest aside - it is in the application layer llms will show their value. There the field, with llm commoditization and no clear monopolies, is wide open.

There was a point in time where it looked like llms would the domain of a single well guarded monopoly - that would have been a very dark world. Luckily we are not there now and there is plenty of grounds for optimism.

sigmoid10 2 days ago|||

Still not sure how I feel about China of all places to control the only alternative AI stack, but I guess it's better than leaving everything to the US alone. If China ever feels emboldened enough to go for Taiwan and the US descends into complete chaos, the rest of the world running on AI will be at the mercy of authoritarian regimes. At the very least you can be sure noone is in this for the good of the people anymore. This is about who will dominate the world of tomorrow. And China has officially thrown their hat in the ring.

Ladioss 2 days ago|||

I always find it an illuminating experience about the power of mass propaganda every time I see an American believe they somewhat have the moral high ground over China, despite starting a new war somewhere around the globe either for petrol or on behalf of Israel every six months.

kiba 2 days ago|||

Just because America is doing bad things doesn't mean China is good, or vice versa.

flossly 1 day ago|||

> Just because America is doing bad things doesn't mean China is good, or vice versa.

When someone points out hypocrisy, this is "the answer", it seems. But it is just a statement, not a rebuttal of the hypocrisy that was pointed out.

Hypocrisy is still hypocrisy.

And bad things are bad things. Yet no amount of propaganda (red scare, "eew dictatorship", Uyger-genocide, Taiwan threat) can convince me that the China is as evil (or more evil) than the US-Israel alliance of the the last 50 years.

LinXitoW 1 day ago||||

Of course not, but that's never how Americans act. The commenter didn't say "I don't like that the only two serious competitors are from the USA and China", they ONLY called out China.

It's a small difference, but important. Especially because that person is far more likely to be responsible (voting) for and profiting from USAs bad stuff.

razodactyl 1 day ago||||

I think a lot of us are blinded by our own propaganda. I would expect many Chinese geeks to have the same values as us for the greater good of humanity.

Lapel2742 2 days ago||||

> Just because America is doing bad things doesn't mean China is good, or vice versa.

Of course not. When it comes to SOTA LLMs you have the choice between two bad options. For many, choosing the Chinese option is just choosing the lesser of two evils (and it's much cheaper).

eloisant 1 day ago||

Why people always dismiss the European option?

Mistral is right here, their models are in-between the cheap to run Chinese models and top of the line performances of US frontier models.

Lapel2742 1 day ago|||

> Why people always dismiss the European option?

Mistral is good for many tasks where you do not need SOTA or near SOTA performance. They cannot compete if you do.

john_minsk 1 day ago||||

For a lot of people in the world Europe = USA

benterix 1 day ago||

But this makes zero sense. Two different continents, values systems, law systems. Not to mention the current USA administration is openly hostile to Europe. So why would anyone confuse the two.

3s 1 day ago|||

It’s not top of the line and mostly not open source

cpursley 1 day ago|||

Yeah, idk this looks pretty good and they ain't bombing anyone nor trying to spread global communism USSRs style:

https://www.youtube.com/watch?v=P7W20hdgWXY

I think I'll take the open AI models, innovative high quality EVs and cheap solar panels, please.

vsgherzi 1 day ago||||

Not about moral high ground. Ones a democracy one isn’t.

makingstuffs 1 day ago|||

How can there be democracy in an environment where freedom of thought is all but nullified due to social manipulation through mainstream media. Calling something ‘free’ doesn’t make it so.

The reality is that the term democracy in western society has essentially become meaningless due to the swathes of algorithmic manipulation which occurs every second of everyday through every possible digital medium.

spaceman_2020 1 day ago||||

Your democracy has consistently voted senile 75 year olds for 3-elections now

The current president - who Americans voted for twice - is heavily accused of being a pedophile and has reneged on every one of his poll promise

Really not the best advertisement for democracy

wiseowise 1 day ago||

The difference is that there was (at least an illusion of) choice. Nobody said that it is a perfect system. And Trump will be gone in 3 years, while Putin and Xi will stay in power until their death.

dspillett 1 day ago||||

I'm going off democracy, at least how it is currently implemented. It is proving far too easy to pervert.

It turns out that the people will vote for some terrible things in order to get that one petty little thing a given candidate promises and they want, or because they don't like something specific about the other candidate(s). And of course they'll later say “well, I didn't vote for that” when they quite demonstrably did.

benterix 1 day ago||

Well, the politicians learned how to game the system well. Now people need to learn how to game the politicians. A formal verification process of pre-election promises would be a good start.

lmz 1 day ago||||

So that means the people are complicit in whatever wars the US started. Not sure if better or worse.

sscaryterry 1 day ago||||

"Democracy is the worst form of government, except for all the others that have been tried". Winston Churchill

sucrosesucrose 1 day ago||||

And why should anyone prefer a democracy over any other form of government? Doesn't it depend on the philosophy of each People?

wiseowise 1 day ago||

> created: 18 minutes ago

Right.

Jackpillar 1 day ago||||

Democracy is a stretch

monadgonad 1 day ago||||

"Not about moral high ground. One's an ideology my morals agree with, one isn't."

wiseowise 1 day ago||

Is believing people should have a choice a moral high ground now?

drcongo 1 day ago||||

Can you clarify which is which?

sigmoid10 1 day ago||

Chinese propaganda seems to hit very hard these days. If you really don't know, you seriously need to check what media you are consuming. Yes, the US has huge problems, many old and some new, but on a serious technical level the answer is (at least for now) 100% clear.

chvid 1 day ago|||

Germany was a democracy when it fought the Soviets.

wiseowise 1 day ago||||

What makes you think they’re American?

mrkramer 1 day ago||||

All empires are to some degree evil because their agenda is to dominate weaker peoples and nations. They almost all committed crimes against humanity and genocides if you look retrospectively from the todays point of view. Even our beloved Roman Empire that the Western civilization is built upon was genocidal empire.

benterix 1 day ago||

Not sure if we can call it "beloved". For sure respected for what it did to build the base of modern civilization, but we are aware of its dark sides. And probably Nero would be an excellent example of what can happen to the empire and its people when a crazy person becomes its ruler.

melagonster 2 days ago||||

[dead]

OCASMv2 2 days ago||||

[flagged]

willsmith72 2 days ago||||

[flagged]

latexr 1 day ago|||

> That doesn't mean it's positive in human rights

Neither is the US, land of slaves, segregation, and the KKK. They did seem to get better there for a few of decades, but sure are working hard to return to their roots.

birdsongs 2 days ago||||

> That doesn't mean it's positive in human rights

Isn't the US building mass detention camps right now for all the brown people there? And arresting / detaining / demanding papers from any and everyone? With federal agents killing civilians?

Don't get me wrong, China is also horrible here, they have their own camps.

But pretending the US is positive wrt human rights is a wild take in 2026.

Levitz 1 day ago|||

>Isn't the US building mass detention camps right now for all the brown people there?

No, it is not, but the freedom of speech protections the US has (that China doesn't) allow for such commentary.

birdsongs 1 day ago||

Good thing I'm in Europe and not governed by those.

And yes, they are-

https://en.wikipedia.org/wiki/List_of_immigrant_detention_si...

sam_goody 1 day ago||||

> sn't the US building mass detention camps right now for all the brown people there?

Why would you think that?

> And arresting / detaining / demanding papers from any and everyone?

I have lots of friends from outside the U.S. that come regularly and don't find it onerous. Maybe it depends where you are coming from?

> With federal agents killing civilians?

OK, I agree that there are issues, and even very serious ones. Obviously, not on the level of China, but still serious issues. Nonetheless, what you see on left leaning media is not representative of what is happening on the ground throughout the U.S. Not even close.

IMO, the US is definitely positive wrt human rights. There are issues, but you can go to a No Kings protest, and live your life happily without issues, and it is hard to find another country that is nearly as forgiving. And it at least has people trying to spread concepts of individual liberty, vs most countries in Europe, almost all countries in Asia, and ALL Muslim countries, that are leaning to removing individual rights.

mapcars 1 day ago|||

>Isn't the US building mass detention camps right now for all the brown people there?

No? Its for illegal people, regardless of color. Just so happened that most illegals come from specific places

drcongo 1 day ago||

kein Mensch ist illegal

me551ah 2 days ago||||

With the number of wars that the US have waged over the years including in Vietnam, Iran and supporting Israel. I don’t think even the US has done a stellar job in defending human rights.

If you meant American citizen human rights, then you’re correct.

latexr 1 day ago||

> If you meant American citizen human rights, then you’re correct.

Not even that. ICE has already killed US citizens, they no longer prohibit segregation, trans people were banned from the military, and many more. All of those affect American citizens.

tw1984 1 day ago||||

> That doesn't mean it's positive in human rights

How about your pack up your arrogance and stop defining human rights for me and other 1.4 billion Chinese?

OtomotO 1 day ago||||

How positive for the human rights of the people of invaded countries was the US?

Ask around in Vietnam, Iraq, Syria and countless more countries around the world.

epolanski 1 day ago|||

[dead]

rhubarbtree 2 days ago||||

[flagged]

fastasucan 2 days ago|||

Not very democratic to invade other countries on the whim of a president.

JumpCrisscross 2 days ago||||

> they said democratic

They didn't even say that. They only said China playing is "better than leaving everything to the US alone."

Cthulhu_ 2 days ago||||

For now indeed, the people that want to get rid of it are currently in power.

The US was one of the first democracies in the world, and many countries followed suit. But the US hasn't kept up, and now the powers that be have exploited the weaknesses in the system. With arguably the biggest one being giving the president too much power (appointing supreme court justices, executive orders, etc).

jack_pp 2 days ago||||

Democracy in most of the countries is just theater. Trump promised no more wars iirc.

Don't get me wrong, I'd rather live in a country without a million cameras that automatically fine me for crossing the street illegally but I don't actually deceive myself in thinking my vote counts for much.

culi 1 day ago|||

> I'd rather live in a country without a million cameras

Are you talking about the US or China? https://deflock.org/

China at least banned the use of facial recognition in public spaces by their supreme court in 2021 (and then further strengthened the ban in 2024 and also got the PIPL).

If you're thinking of the "social credit" system please know that that's just an online meme. China's credit score system is not even nationalized and not nearly as invasive as the US's credit score system, which can sometimes determine whether or not someone is allowed to buy a house.

Besides their own credit score system, the other thing that sometimes gets labelled the "social credit system" was an attempt they had to track the behavior of business leaders and elected politicians. Basically anyone who holds social power but not the common person. This also never really took off and was not ever nationalized/centralized.

palata 2 days ago||||

> I'd rather live in a country without a million cameras that automatically fine me for crossing the street illegally

Agreed, but there again, the democracies have surveillance capitalism, it's not exactly like we're not being tracked.

phatfish 2 days ago|||

You let Trump and all the tech-bro shitheads win with that attitude unfortunately. Democracy is an ongoing battle.

cumshitpiss 2 days ago|||

[dead]

FrojoS 1 day ago|||

[flagged]

subdude 1 day ago|||

I agree, that's why Iran is correct to arm and defend themselves against Israel and the US.

samrus 1 day ago|||

Yeah, those 8 year old girls had been 2 weeks away from developing a nuke. Had been since 1997 im told

chmod775 1 day ago||||

> Still not sure how I feel about China of all places to control the only alternative AI stack, but I guess it's better than leaving everything to the US alone.

Fully agree. From a US perspective, that sucks. For everyone else it's pretty great.

At this point the world's opinions of China are better than those of the US in some polls. One country invests and helps build infrastructure on a massive scale globally, the other alienates allies, causes countless conflicts, and openly threatens to end civilizations.

Indeed, even if one isn't partial to China, there's reasons to be glad that an increasingly hostile US has powerful competition.

> This is about who will dominate the world of tomorrow.

For this you'd need a technological moat. So far the forerunners have burned a lot of money with no moat in sight. Right now Europe is happy just contributing on research and doing the bare-minimum to maintain the know-how. Building a frontier model would be lobbing money into the incinerator for something that will be outdated tomorrow. European investors are too careful for that - and in this case seem to be right.

benterix 1 day ago||

Yeah it's confusing. I mean China has work camps for Uighurs and is very brutal on Tibetans etc. OTOH, their leader is not setting the world on fire every second week and compared to Trump seems like the paragon of reason on the surface. Of course we know it's a facade but man what crazy times to live in.

Cthulhu_ 2 days ago||||

Moral stances aside, I'd argue it's healthy that the US gets competition from abroad. I appreciate the boost that the world is getting from China - infrastructure and construction projects are a huge benefit to economies. Their focus on green energy has caused a huge influx of affordable solar panels, home batteries, EVs, etcetera, helping reduce the dependency on fossil fuels - while the US and especially the other big money spenders in the middle east would rather the world remain fully dependent on them. But for the past years Europe and now Asia are feeling the pain from being overly reliant on that.

China's policies and government aren't morally defensible and I do fear that they will become more aggressive in spreading their influence and policies onto other countries, but from an economic standpoint what they're doing is super effective. While the previous world power (the US) is stuck in infighting and going through cycles of fixing/undoing the previous administration's damages, instead of planning ahead.

mft_ 1 day ago||||

You’re right… but that’s on the rest of the world not getting their shit together.

It’s this sort of example (and not properly supporting Ukraine, and not agreeing how to collectively deal with migrants, and not agreeing how to coordinate defence, and myriad other examples) that highlights what a pointless mess the EU is. It’s not a unified block - it’s 27 self-interested entities squabbling and playing petty power games, while totally failing to plan for the future with vision.

The EU could/should have ensured that a European equivalent to OpenAI or Anthropic could thrive, and had competitive frontier models already; instead, they’re years and countless billions behind.

simgt 1 day ago||

The EU pouring even more billions in this would just have meant pouring billions on US tech. China is winning on all fronts at this game because of the embargo, they end up even more vertically integrated as a result of it.

benterix 1 day ago||

> The EU pouring even more billions in this would just have meant pouring billions on US tech.

Which is crazy given that ASML is European.

chvid 1 day ago||||

The important thing is that LLMs are well-dispersed and the technology is relative open, much more open than it could have been. Alternative worthwhile LLMs will emerge from Europe and other non-US western countries once the economic incentives are there.

cde-v 1 day ago||||

China doesn't even care about Taiwan anymore, their saber-rattling about it is a convenient distraction while they quietly make it completely irrelevant in the next few years.

SgtBastard 1 day ago||||

Mistral (a French company) shouldn’t be discounted.

amunozo 2 days ago||||

Competition with the Soviet Union gave all the workers in the world better conditions, also advances in science and technology... (And risk of mutual destruction ;)), even if the USSR wasn't good.

Danox 2 days ago|||

Isn’t Mistral close in the ballpark?

techsystems 1 day ago|||

There are no European models that come close. It's Korean models, then a UAE model K2, then Mistral.

Lapel2742 2 days ago||||

AFAIK: Current Mistral models are not competitive with SOTA-models that come out of the USA or China. They are "good enough" for enterprise usage when you don't need SOTA performance.

Their main selling point is: They are neither US-American nor Chinese. That's a real moat in today's world. I think at the moment they feel quite comfortable.

eunos 2 days ago|||

They arent. Benchmark wise they are quite apart.

spaceman_2020 1 day ago||||

I've been baffled watching America double down on the same strategy even when it failed to produce results

They sanctioned the hell out of Huawei and now Huawei is bigger than ever

America is just not able to digest the idea that another country can be as good, if not better, at innovation

lanthissa 1 day ago|||

not really, china has gone domestic for everything as soon as it could.

its naive to think they would have stayed on a 'western' stack.

Most of the time 'losing' isn't making a bad choice its being put in a situation where you have no good choices.

ifwinterco 2 days ago|||

As a Brit I'm here for it to be honest, I'm tired of America with everything that's going on.

China is not perfect but a bit of competition is healthy and needed

jurgenburgen 2 days ago|||

I don’t know if we’re ahead of the curve but that tired feeling has started turning into hate here in the EU. I guess being threatened with invasion does that to you.

The next decade is going to look very different with America Alone.

koe123 2 days ago|||

I grew up in the states when I was younger, always feeling some closeness to Americans even after I moved back to Europe.

With all that goes on it has changed. Recently I sat on a plane near some Americans discussing their holidays here, and I noticed I felt contempt. Sitting their with insane privilege as their government torches the world.

Individuals remain individuals, and one really ought not to be prejudice. However the lack of resistance I see in in the “land of the free” as their “democratic” institutions collapse just makes me believe they never cared at all. In France cars are torched if the pension age is raised. In America the rise facism apparently doesnt matter to them.

mettamage 2 days ago|||

From my small bubble it's not that. I'm Dutch, married to an American who now knows enough Dutch such that we can treat it as a secret language when we're in the US.

My family in law seems to swing slightly republican. As a Dutchie, I could get some answers because I'm too naive not to talk about politics. So I got to probe a bit. What I simply found was that they'd say "I can't trust the news, none of it. Not CNN, not Fox News, nothing". Then I'd say "well in the Netherlands, I'd argue that while news outlets have their bias, you can trust them on basic factual reporting". She looked at me with a stare that I could only describe as "oh but honey, you're too young and naive to understand". To which I thought "you don't know the Netherlands. We're not perfect but we're nowhere near as deranged as what I'm seeing here".

I think that explains a lot of it for some people. The trust in the media, all media, is completely broken. Trump has how many fellonies now? Can't trust it. Kamala is doing what now? All talk. DOGE is fixing the government? I fucking hope so! But can't trust the damn news. Whether they do or don't, they are always burning money, god damn bureaucrats.

I feel that's the mindset that my family in law has.

zmmmmm 1 day ago|||

> I can't trust the news, none of it. Not CNN, not Fox News, nothing

This view gets echoed here on HN a lot. I find it very strange to be honest, because I tune in to CNN and I see lots of bias in the commentary and editorial, but when it comes to factual reporting they are pretty straightforward and down to earth. It seems to me that the real issue is people don't seem to distinguish between reporting and editorial content / commentary. Stop watching that garbage and actually consume the factual content and analysis. Yeah it's dry and boring but if that isn't enough for you then it just shows you never cared about facts in the first place.

fastasucan 2 days ago||||

I think this is spot on. "Every fault of america is just how it is in any society.". Nice way to just accept it.

JumpCrisscross 2 days ago||||

Out of curisoity, what is your wife's take?

My running hypothesis has been the trust breakdown arises from social-media overexposure driving lazy nihilism, which in turn gave free reign to a uniquely-corrupt class of politicians. But I'm not sure how to neutrally evaluate that.

virgildotcodes 1 day ago||

I think the collapse of public trust was very intentional, and the result of a much longer term effort than social media.

The most famous examples are likely the tobacco industry spreading misinformation through self-funded studies and experts, and the fossil fuel industry doing the same to seed doubt about climate change. But of course we can think of countless examples of entire industries and individual large corporations pushing out misleading bullshit, threatening or outright killing journalists and activists to cover up their catastrophic fuckups and their chronic conscious excretion of negative externalities.

This has all of course been going on since the dawn of time, but to focus on the last century in the US, we've seen all sorts of corporations and coalitions of rich and powerful people push misinformation into nearly every sector of our society - universities, science, journalism, politics, etc. in order to undermine confidence in shared facts, corrupt people's ability to discern whether or not something is fundamentally true, and sow confusion so that they can continue to operate in perpetuity in this chaotic maelstrom of doubt.

Lots of capture of government towards these ends as well, we can look at the concomitant constant cuts to education in order to weaken people's understanding of the world and ability to think critically. The revocation of the Fairness Doctrine was probably a step change, and Trump represents the sharpest recent escalation of all this.

From day one, he's done everything he can to shred any collective notion of shared objective truth. Anything he doesn't like is fake news, and the idea that the media is lying, scientists are lying, experts are lying, and institutions are lying, he has spread so fucking successfully through society, to the point where Americans no longer have anything like a shared sense of reality.

It seems like we're being reduced to tribes who are organized primarily around faith in various charismatic individuals.

I think this is fundamentally the worst thing he's done, because it lays the foundation for virtually every other conceivable and inconceivable abuse. If people can't even agree on what is happening, we're fucked. People and institutions in power can do anything they want to whoever they want, because the public has lost their ability to even recognize the danger posed to them collectively and thus mount any resistance based on a shared sense of reality.

Social media has definitely famously accelerated aspects of this like the fragmentation and the spread/magnification of fringe worldviews through echo chambers, but I think it's just one (and maybe this is controversial, but I'd be willing to be generous enough to think the 20something year old creators were too stupid to conceive of these long term consequences at first, but who knows, maybe not) element in a much longer and more intentional, malicious war against the many for the benefit of the few.

dkga 1 day ago||

Not only that, but in tandem the collapse of social capital in the US has been the result of a very intentional process (on top of the multidecade undercurrent of declining social capital). This according to Robert Putnam himself (sorry, don’t have time to find the source now but will add it later).

roer 1 day ago|||

This is quite interesting. I'm not sure what can to be done to reverse this? When you've reached a level of untrust where you deem trust itself naive, how can you recover?

TomGarden 2 days ago||||

I a European who spent the last decade in America and I'm not sure I'd call Americans privileged compared to Europe. With money being the one means you have to be treated well in society, comparing it to Europe, America feels like the hunger games. Want healthcare (ie surviving)? Healthy food? To own your house? Welcome to the games

wiseowise 1 day ago||

Europe doesn’t wage war right now. Their point is that Americans are talking about vacation while their troops invade and destroy Iran.

sterlind 2 days ago||||

not all of us are just "sitting here with insane privilege." it's quite dangerous for some of us right now.

I'm trans. this Administration does not like us. after Charlie Kirk's murder, things got legitimately scary. Musk was retweeting people who called us "deranged bioweapons" who needed to be "forcibly institutionalized." NSPM-7 is surveilling and infiltrating trans organizations. the Heritage Foundation proposed labeling us as "ideological extremists," in the same category as neo-Nazis. if I'm arrested, I'll go to a men's prison where I'll likely be given to a violent inmate as his cellmate to "pacify" him (V-coding.)

so yeah, I keep my head down. a lot of Jews kept their heads down in Germany in the '30s, you know? and just like then, it doesn't seem like other countries are too keen on taking us in as refugees. I hope that changes if things get bleak.

JonChesterfield 2 days ago||

Get out seems an important priority. Good luck

n8cpdx 2 days ago||||

It’s not that it doesn’t matter to Americans. It is worse; half the population (or at least, half the voting population), is thrilled with the development of fascism. The other half has been ringing the alarm bells for well over a decade; it seems to make no difference.

And you’re right, most Americans do not understand the privileges they have or give one single shit about democracy; it is just not a salient political issue. But eggs… don’t get me started on eggs.

KronisLV 1 day ago|||

> The other half has been ringing the alarm bells for well over a decade; it seems to make no difference.

I feel like the issue there is that alarm bells in of themselves solve nothing. I won't extend that argument to one of its obvious conclusions, but instead I will say that efforts to attack education and critical thinking skills all contribute to people being susceptible to their democracy being corrupted and robbed blind - so having an educated populace with a sense of integrity and respect of human rights would help!

Cthulhu_ 2 days ago|||

It's probably a bit more nuanced than "half this, half that"; when you look at the facts, most voters aren't that extremist. A lot of votes vote one way or the other because they would simply never vote for the other.

This is why the swing voters / swing states are so important in the US, because only a few million are flexible enough to switch sides.

Of course the core issue is that there's a two party system; while I'm sure that in a healthy democracy the current republican and democrat parties would be the bigger ones, they wouldn't have a majority.

drcongo 1 day ago||

> A lot of votes vote one way or the other because they would simply never vote for the other.

This, for me, is the crux. Politics is treated like a team sport in the US, you pick your side and cheer them on no matter what. And team sports in America are even more bananas - you grow up supporting the Brooklyn Dodgers and a few years later they're 2.5k miles away with a new name. This seems a perfect example of what's happened / happening to the Republican Party - it's not the same party any more, but everyone who tied their entire personality to cheering for the red team is still cheering for it as it burns the country to the ground. I predict that inside ten years it will have also had the name change and probably be run out of Florida or somewhere.

enaaem 1 day ago||

I think it's more like a cult than a team sport. Sports fans figure out really fast if their manager is shit.

jorvi 2 days ago|||

> In France cars are torched if the pension age is raised.

This is not something to be proud of. You guys are giving yourself loaned freebies, retiring 5+ (!) years earlier than countries like BeNeLux and Germany, and are pretty much expecting the EU to eventually pick up the pieces which will drag us all down.

Edit: always lovely when HN downvotes truths :)

the_gipsy 2 days ago||

That's bullshit. Pensions are not a zero-sum game, and other countries don't have to pay for them.

It just doesn't make sense to delay retirement while youth unemployment is such a big problem. We ALL should be fighting like France, in many aspects.

jorvi 2 days ago|||

Other countries don't directly pay for the pensions, but France is staring into a giant fiscal abyss because of their low retirement age (and other generous social benefits). Any attempt to change those results in the country being taken hostage by rioters, thus nothing changes.

At some point France will be in too deep shit and will look to the EU to cover for them. We will all pay for that. And it is deeply unfair because other countries their citizens have accepted later retirement and more frugal benefits to keep their countries fiscally healthy.

France could cover the fiscal hole in other ways, but taxing corporations and wealth at a higher rate also consistently ends up being blocked. And each year the hole gets deeper.

snakeboy 1 day ago||

> Any attempt to change those results in the country being taken hostage by rioters, thus nothing changes.

Your theory doesn't actually match with reality, given that Macron's retirement reform was passed into law despite protests. As currently enacted, the age of retirement in France will progressively increase from 62 until reaching 64 in 2030.

jorvi 1 day ago||

It does match reality.

Reform wasn't passed, it was forced via a technicality after riots made it politically unpalatable, and it has put France in a governing crisis ever since.

Also, retirement in North, West and Central EU is 67+, not 64. Greece is at 67 too, although begrudgingly.

Again, I'd be equally happy if France covers the fiscal hole some other way, but I am not going to cover for a country that is willingly becoming the sick man of Europe because they want to live comfortably on borrowed time. Which, by the way, is a literal repeat of Greece its crisis. Time is a flat circle indeed.

FrojoS 1 day ago|||

It’s not bs. France is lobbying for “Eurobonds”, debt they can take at German interest rates and with Germans etc holding the bag, for about two decades now.

https://youtu.be/tMd7EfFsPIc (Video claims France is against them, but if they ever were they are not anymore)

miroljub 2 days ago|||

[flagged]

hsiudh 2 days ago||||

"not perfect" is a _very_ big simplification of what China is though

rglullis 2 days ago|||

Isn't that the same to every major superpower?

FuckButtons 2 days ago|||

No. There is no moral equivalence with totalitarianism.

ywvcbk 2 days ago|||

Modern China isn't exactly totalitarian though and US is rapidly converging with China in that regard anyway.

bwv848 2 days ago||

How totalitarian is exactly totalitarian? I asked chatgpt and it gave few points

- Control goes beyond politics

- A single, all-encompassing ideology

- No meaningful private sphere

- Mass mobilization and propaganda

- Extensive surveillance and repression

Seems like China is ticking all the boxes.

drcongo 1 day ago||

Honestly thought you were listing traits that the US has now till the last line.

bwv848 1 day ago||

Really laugh my ass off, so much whataboutism and American centrism when the debate is whether China is trustworthy on AI. Given your ignorance you should go and do your research, but I will help you a bit here.

- Control goes beyond politics

state corporation monopoly, 党支部 in private sector, crackdowns on NGOs and charities.

- A single, all-encompassing ideology

Party led, mandarin speaking Han Chinese nationalism, blended with Little Pink's unquestionable support for Xi and the party.

- No meaningful private sphere

社区网格员

- Mass mobilization and propaganda

We saw mobilization Chinese social media, attacking celebrities who don't openly say anything the party wants them to say. Mobilization in real life is rare though, cos it had shown it can backfire.

- Extensive surveillance and repression

Do I really need to explain this?

Hamuko 2 days ago||||

Which are the current nontotalitarian superpowers?

NicoJuicy 2 days ago||||

That's also the current US administration.

Luckily laws still stand somewhat.

( And Trump ain't smart enough)

fragmede 1 day ago||

Trump's smarter than he lets on. He plays the buffoon in public, but he's smart enough to have gotten elected twice. Which is two times more than I've managed to.

DiogenesKynikos 2 days ago|||

China is not totalitarian. Many people believe that China is still like 1950s-60s-era Maoist China, but it's just not.

FuckButtons 2 days ago||

tiananmen square was in 1989. Hong Kong was snuffed out like a light. Covid saw people caged and sealed in their houses. You do not need to look back at the cultural revolution to see the prc for what it is.

e4325f 1 day ago|||

and the Kent State shootings were in 1970.

Being self-righteous and a yank doesn't make sense, country of war mongers, something that cant be said of China.

FuckButtons 1 day ago||

Clearly I’ve hit a nerve if you’re stooping to whataboutism. Perhaps you should reflect on why that is.

DiogenesKynikos 2 days ago|||

Is your contention that Hong Kong is also a totalitarian society? Have you been to Hong Kong in the last 5 years? I feel like people saying these sorts of things are just completely divorced from reality.

> Covid saw people caged and sealed in their houses.

No. There were a few incidents very early on, when everyone was (quite understandably) panicking about a new, deadly virus that nobody had ever seen before, when some local city officials barred the doors of people who had just come from Wuhan. That was a scandal inside China, and it was immediately reversed.

What China did do quite extensively was border quarantine, and during localized outbreaks (caused by cases that slipped through quarantine at the border), mass testing and quarantine measures. This was during a once-in-a-generation pandemic that killed millions of people. In China, these measures saved several million lives. The estimates are that China's overall death rate was about 25% that of the US, and these measures are the reason. By the way, Taiwan and Australia took nearly identical measures, and I very much doubt that you would call them totalitarian societies.

scrollop 2 days ago|||

Whatbaoutism at it's finest.

Have a peek at the fredom indx and the press freedom index for China. Guess where they stand?

You know about the chinese internet firewall.

You can't trust any data from the CCP.

And please don't equate the aberration that is the Trump administration with "regular" US administrations (and this is coming from a non US person).

rglullis 2 days ago|||

People in China live under totalitarian rule, that much is true.

But how free is the average North American, where getting sick can bring you and your family financial ruin? Where the "free press" is controlled by corporations who are also the main source of campaign funding for politicians? Where their urban spaces are designed to require you to have a car and promote complete atomized individuals?

amunozo 2 days ago||||

Regular US administrations that commited war crimes in half the world for decades. But apparently it only matters what they do in the US.

Revanche1367 1 day ago||||

Indexes made by Europeans and Americans to congratulate themselves are not reliable.

culi 1 day ago||

Exactly. Even if you don't buy into western biases, it's heavily reliant on subjective perception surveys. Hardly proof of anything

rhubarbtree 2 days ago||||

You’re right, for now, but I think trump will try to turn America into a dictatorship.

_blk 2 days ago|||

..you forgot to mention that any technology in China, foreign or domestic, can and will be used for and to the benefit of the -military- party.. But like someone posted: "not perfect" fits the bill.

Check out the Sean Ryan Show with Palmer Luckey on China and military tech.

Thlom 1 day ago||

Same goes for every country on earth?

IsTom 2 days ago||||

You can say the same about the US

hunter67 2 days ago|||

they compare it to fascist USA though

allan_s 2 days ago||

Ask a gay, a black or a Japanese how it feels living in China.

cromka 1 day ago||

Outside of gay people, the rest is your projection: they are homogenous society, racial problems are nonexistent. US is heavily heterogenous and despite that you segregated like a third of society at the time.

Ridiculous take.

falkenstein 2 days ago||||

america is a continent. let’s take back our vocabulary (fellow european here). the little orange man shows very well what i mean when he started giving names to the gulf of mexico.

barrenko 2 days ago||||

This is such a tired argument, and morally repugnant. Where is the UK in the race, where is the EU? Lets get of our asses and stop moralizing.

(China wiped out the entire EU industry through a "quiet" trade war since like the last 15 years, and we're not really talking about that aren't we...)

Cthulhu_ 2 days ago|||

Not so much a trade war as basic economic forces, and it's been going on for much longer than that. When infrastructure improves, companies and customers can look further to get their stuff done. If it's cheaper to do your industrial or manufacturing work abroad and have it transported to your country, that just happens.

The powers that be try to slow this down by banning imports outright (you can't for example import American chicken into Europe because of food safety laws), or high import taxes (Chinese EVs have a 50% import tax in Europe and the US to protect the local car manufacturers. Which is fair because the Chinese EV manufacturers are state-sponsored so their prices are unfair. Then again, western companies get billions in investor money to push the prices down).

HatchedLake721 2 days ago||||

UK has the people but not the electric grid/infrastructure to compete.

EU/France has Mistral.

whywhywhywhy 1 day ago||

France has mistral and the energy infrastructure to compete, the EU has nothing.

calgoo 2 days ago|||

You mean the west handed their industry to china over the last 15 years? Its not like the US is any better off in this. The EU is not a country, so you can't talk about it as if it was. Each country has their own companies and industries. There is AI in Europe, and its growing, however we might not be as "energetic" about destroying our countries to build giant data centers to serve our billionaire overlords. That does not mean that there is no investment, there is, including a bunch of American corporations like Amazon. But there is also a lot of corruption and bribing (lobbying - lets call it what it really is, no more whitewashing) going on around that too.

So again, stop referring to EU as a country, we are not, and it just annoys any Europeans as it comes of as "Americans who don't understand the world outside of the USA".

timmmk 2 days ago||||

Fellow countryman here. I came here to say the same thing

lifeisstillgood 2 days ago|||

As a different Brit I do not accept such moral relativism.

China’s governments actions are on a completely different level - for example:

“””

Since 2014, the government of the People's Republic of China has committed a series of ongoing human rights abuses against Uyghurs and other Turkic Muslim minorities in Xinjiang which has often been characterized as persecution or as genocide.

“”” https://en.wikipedia.org/wiki/Persecution_of_Uyghurs_in_Chin...

https://www.amnesty.org/en/location/asia-and-the-pacific/eas...

Yes Trump is clearly trying Totalitarianism in America, but it is orders of magnitude different from what is happening in China.

amunozo 2 days ago|||

Why do we ignore all the human right abuses the US perform abroad? Iraq, Afghanistan, now Iran, Gaza and Lebanon through Israel, support to Saudi Arabia (which would not exist without the US), El Salvador... And inside it's also horrible with its treatment to immigrant.

That should be at least comparable (if not worse) than what China is doing.

OCASMv2 2 days ago||

Yes, El Salvador is so evil for imprisoning dangerous criminals and protecting innocent lives.

samrus 1 day ago|||

This is how china tried to justify its genocide against uighers. Was theboutrage against that just politically motivated? Or do americans only care about ethnic cleansing when theyre not the ones doing it

amunozo 1 day ago|||

Not for imprisoning, but for imprisoning them in draconian conditions, without proper judgements, etc. Have you seen those prisons for fuck sake?

cedws 1 day ago||||

There’s little to no evidence of such “genocide”, but I can go on YouTube to watch videos of the US bombing civilians in the Middle East.

tw1984 1 day ago||||

It is just shocking to hear such stuff from someone in the UK.

phatfish 1 day ago||||

The US supports the genocide in Gaza, it supports the bombing of Lebanon. The US itself has now started (another) war and bombed Iran.

China is repressing the Uyghur and threatening Taiwan. I don't agree with these actions but is really "orders of magnitude" worse than the destruction the US facilitates in the Middle East?

With Trump they are now openly hostile to European democracies, and ICE and doing their best at repression within the US.

chronc6393 1 day ago||

> The US supports the genocide in Gaza, it supports the bombing of Lebanon. The US itself has now started (another) war and bombed Iran.

> With Trump they are now openly hostile to European democracies, and ICE and doing their best at repression within the US.

And what is Europe going to do about it?

Boycott ChatGPT and Claude? Ha.

Markoff 2 days ago||||

genocide right, that's why Uighurs were allowed to have more children than majority of Chinese Han population /facepalm

by your logic gentrification of neighborhoods with different people moving in is genocide as well

Btw. remind me when last tiem China bombed school and killed 150+ school girls as your friend US?

Or as Brit I hope you are proud about all the killing your country participated in in illegal invasion to Iraq based on fake news about WMD.

junnan 2 days ago|||

It's 2026 and people still believe this Uyghur genocide propaganda? In the meantime, Israel and the US have been killing people in the middle east for years, but china is "on a completely different level"?

dzonga 1 day ago|||

Jensen Huang said this in his recent interview - that China has the best/most engineers, it has the chip making ability, it's a good thing they wanna build on a Nvidia stack - but if you push them they will build on an all Chinese stack - but the interviewer was being a numb head who kept parroting the propaganda of Western tech supremacy

d3Xt3r 1 day ago|||

> Also, note that there's zero CUDA dependency.

So does this mean I can run this on AMD? And on a consumer 9000 series card?

randomgermanguy 1 day ago||

If you found a rare 9000 card with 200+ GB of VRAM, sure

TrackerFF 2 days ago|||

Let's see how long it takes before the big US AI companies start lobbying to outright ban use of Chinese AI, even the open source / local models. For "national security" reasons, of course.

chronc6393 1 day ago|||

> Let's see how long it takes before the big US AI companies start lobbying to outright ban use of Chinese AI, even the open source / local models. For "national security" reasons, of course.

Already do on EVs.

barnabee 2 days ago|||

Hopefully the US’ self imposed isolation will mean that when they do, they aren’t able to force the rest of the world to follow suit.

ibic 2 days ago|||

"Open Source" is the ultimate romance understood by software engineers.

frankdenbow 1 day ago|||

Jensen was saying this in that interview last week and the interviewer dismissed it.

khalic 1 day ago|||

Open weight and open source are not the same

SquareWheel 1 day ago||

This is a pretty banal comment at this point. Open source is the term used in the LLM community. It's common and understood. Nobody is going to release petabytes of copyrighted training data, so the distinction between open source vs weights is a rather pointless one.

stefan_ 1 day ago||

First you steal all the code, then you want to redefine the term? Is it never enough with you AI guys? Where's the humility, where's the good?

kitd 2 days ago|||

I can't find any info on what exactly is open sourced.

And in any case what does open source actually mean for an llm? It's not like you can look inside it to see what it's doing.

gommm 2 days ago||

For me open source means that the entire training data is open sourced as well as the code used for training it otherwise it's open weight. You can run it where you like but it's a black box. Nomic's models are good example of opensource.

adammarples 1 day ago||

Yes the weights are basically compiled code, compiled from the source data and the training code.

laurentiurad 1 day ago|||

not a full AI stack. Training still runs on NVIDIA chips.

sudo_cowsay 2 days ago|||

I sometimes wonder if there are any security risks with using Chinese LLMs. Is there?

dalemhurley 2 days ago|||

Theoretically yes. It is entirely possible to poison the training data for a supply chain attack against vibe coders. The trick would be to make it extremely specific for a high value target so it is not picked up by a wide range of people. You could also target a specific open source project that is used by another widely used product.

However there is so many factors involved beyond your control that it would not be a viable option compared to other possible security attacks.

mazurnification 2 days ago|||

But propaganda or non ethical marketing - why not? (That is bias toward pointing to certain provider(s)).

_blk 1 day ago|||

Would be interesting to hook up a much simpler LLM as fact checker to see when errors are introduced.

If I had to place a hidden target it'd probably be around RNGs or publicly exposed services..

oliwarner 2 days ago||||

If there is, couldn't they exist in any model?

I don't mean that flippantly. These things are dumped in the wild, used on common (largely) open source execution chains. If you find a software exploit, it's going to affect your population too.

Wet exploits are a bit harder to track. I'd assume there are plenty of biases based on training material but who knows if these models have a MKUltra training programme integrated into them?

rhubarbtree 2 days ago||||

Backdooring software at scale.

Spearphishing.

Building reliance and exploiting it, through state subsidies, dumping, and market manipulation.

Handicapping provision to the west for competitive advantage.

cassianoleal 2 days ago||||

What about LLMs from other origins? What makes them less risky?

eucyclos 2 days ago||||

From my experience, kinda the opposite? It's like Chinese software is... Harder to weaponize or hurt yourself on. Deepseek is definitely censored, but I've never caught it being dishonest in a sneaky way.

Hamuko 2 days ago||||

There must be. The executives at my company wouldn't have banned them all for no reason after all.

c0nstantien 2 days ago||||

[dead]

baal80spam 2 days ago|||

Is this a serious comment? It honestly reads like the last famous words.

Of course there are risks.

slekker 2 days ago|||

But remember to not ask about Taiwan!

tigrezno 2 days ago|||

you talk like there isn't censorship in american AIs, like Israel topics.

swingboy 1 day ago|||

Ask a Chinese model about Taiwan, get denied. Ask an American model about Israel, get citizenship revoked and deported.

unclejuan 2 days ago|||

To be fair I prefer the Chinese models censorship (yes, seriously) because if you ask certain topics they just don't answer instead of giving skewed answers.

spaceman_2020 1 day ago||||

I can't wait for Taiwan to peacefully reunify with the mainland so the west with its constant war waging won't even have this talking point

eunos 2 days ago||||

> China asks other country not to meddle with internal separatism > They also dont support separatism in my country

Understandable.

Markoff 1 day ago||||

pretty sure you can ask whatever you want and it will tell you official stance agreed by almost all countries in the world that Taiwan is part of China as it's recognized by your own country (I don't even know where are you from, but there is like 98% chance I'm right)

Lionga 2 days ago||||

Quit a bit better then made to bomb little girl schools in Iran.

spiderfarmer 2 days ago||||

Just ask it for a summary of the USA’s role in Iran, Gaza, Lebanon and its recent threats against Panama, Cuba and Greenland! It might be able to keep track.

teiferer 2 days ago|||

Does all this insane behavior from the US justify the Chinese censorship?

samrus 1 day ago||

Of course not. But its disengenuous to only mention one like the US is clearly th lesser of 2 evils

libertine 2 days ago|||

Are you implying that western models were manipulated to hide and distort those events, like they do with the Tiananmen Square event, and Taiwan?

spiderfarmer 2 days ago|||

Let's say I'm more outraged by the actual events.

vkou 2 days ago||||

Ask Gemini today if the United States is trying to destroy the nation of Iran, and it will feed you the (white-washed) party line, straight from the White House, with a bit of 'some people disagree' thrown in. No mention of America's threats of "Complete annihilation", "Killing a civlization", and all the rest.

> Summary: The U.S. is currently engaged in an active war aimed at dismantling the Iranian government and its military capabilities, but it distinguishes this from destroying the country or its people. However, the humanitarian impact—including civilian casualties from airstrikes and the domestic crackdown by Iranian security forces—has led many international observers to warn that the campaign risks long-term instability and "state collapse" rather than a simple transition of power.

It does do quite a bit better if you ask it about the genocide in Gaza, summarizing the case for it, and citing only token justifications from the guilty party.

As of April 2026, Gemini is... For very obvious reasons, highly biased towards cultural consensus. If your cultural consensus is strong on some really messed up things, that's the outcome that it's going to give you.

teiferer 2 days ago||

Isn't there a difference between the models output reflecting the mean of public discourse and the active adjustment of information by the government?

Irrespective of how close the outcomes are to the actual facts, those two things have a different quality, don't they?

vkou 2 days ago||

> Isn't there a difference between the models output reflecting the mean of public discourse and the active adjustment of information by the government?

Not as much a difference as you would wish, as mean of public discourse is very actively managed, to our collective detriment, by a very small group of powerful people, which often includes the government. It's the nature of mass media, and the incestuous relationship between power and reach.

They Thought They Were Free, and all that. By the time the 'mean of public discourse' centers on something incredibly stupid or awful, nobody can be arsed to figure out who planted that idea in our heads.

jfjdhdjdbdb 2 days ago|||

History is by definition his story.

Bayart 2 days ago||

It's not. It's an English pun on a Greek word, which roughly means "investigation".

man4 2 days ago|||

[dead]

trvz 2 days ago||

[flagged]

kkzz99 2 days ago||

This is so unbelievable racist and deranged.

trvz 1 day ago||

Rule of thumb is: half the statements out of capitalist states are false, all statements out of communist(-ish) ones are false. No racism, I’m perfectly willing to believe half of what comes out of Taiwan.

hodgehog11 2 days ago||

There are quite a few comments here about benchmark and coding performance. I would like to offer some opinions regarding its capacity for mathematics problems in an active research setting.

I have a collection of novel probability and statistics problems at the masters and PhD level with varying degrees of feasibility. My test suite involves running these problems through first (often with about 2-6 papers for context) and then requesting a rigorous proof as followup. Since the problems are pretty tough, there is no quantitative measure of performance here, I'm just judging based on how useful the output is toward outlining a solution that would hopefully become publishable.

Just prior to this model, Gemini led the pack, with GPT-5 as a close second. No other model came anywhere near these two (no, not even Claude). Gemini would sometimes have incredible insight for some of the harder problems (insightful guesses on relevant procedures are often most useful in research), but both of them tend to struggle with outlining a concrete proof in a single followup prompt. This DeepSeek V4 Pro with max thinking does remarkably well here. I'm not seeing the same level of insights in the first response as Gemini (closer to GPT-5), but it often gets much better in the followup, and the proofs can be _very_ impressive; nearly complete in several cases.

Given that both Gemini and DeepSeek also seem to lead on token performance, I'm guessing that might play a role in their capacity for these types of problems. It's probably more a matter of just how far they can get in a sensible computational budget.

Despite what the benchmarks seem to show, this feels like a huge step up for open-weight models. Bravo to the DeepSeek team!

ozgune 1 day ago||

I reviewed how DeepSeek V4-Pro, Kimi 2.6, Opus 4.6, and Opus 4.7 across the same AI benchmarks. All results are for Max editions, except for Kimi.

Summary: Opus 4.6 forms the baseline all three are trying to beat. DeepSeek V4-Pro roughly matches it across the board, Kimi K2.6 edges it on agentic/coding benchmarks, and Opus 4.7 surpasses it on nearly everything except web search.

DeepSeek V4-Pro Max shines in competitive coding benchmarks. However, it trails both Opus models on software engineering. Kimi K2.6 is remarkably competitive as an open-weight model. Its main weakness is in pure reasoning (GPQA, HMMT) where it trails Opus.

Speculation: The DeepSeek team wanted to come out with a model that surpassed proprietary ones. However, OpenAI dropped 5.4 and 5.5 and Anthropic released Opus 4.6 and 4.7. So they chose to just release V4 and iterate on it.

Basis for speculation? (i) The original reported timeline for the model was February. (ii) Their Hugging Face model card starts with "We present a preview version of DeepSeek-V4 series". (iii) V4 isn't multimodal yet (unlike the others) and their technical report states "We are also working on incorporating multimodal capabilities to our models."

lifty 2 days ago|||

Wondering how gpt 5.5 is doing in your test. Happy to hear that DeepSeek has good performance in your test, because my experience seems to correlate with yours, for the coding problems I am working on. Claude doesn't seem to be so good if you stray away from writing http handlers (the modern web app stack in its various incarnations).

hodgehog11 1 day ago||

Very cool to hear there is agreement with (probably quite challenging?) coding problems as well.

Just ran a couple of them through GPT 5.5, but this is a single attempt, so take any of this with a grain of salt. I'm on the Plus tier with memory off so each chat should have no memory of any other attempt (same goes for other models too).

It seems to be getting more of the impressive insights that Gemini got and doing so much faster, but I'm having a really hard time getting it to spit out a proper lengthy proof in a single prompt, as it loves its "summaries". For the random matrix theory problems, it also doesn't seem to adhere to the notation used in the documents I give it, which is a bit weird. My general impression at the moment is that it is probably on par with Gemini for the important stuff, and both are a bit better than DeepSeek.

I can't stress how much better these three models are than everything else though (at least in my type of math problems). Claude can't get anything nontrivial on any of the problems within ten (!!) minutes of thinking, so I have to shut it off before I run into usage limits. I have colleagues who love using Claude for tiny lemmas and things, so your mileage may vary, but it seems pretty bad at the hard stuff. Kimi and GLM are so vague as to be useless.

lifty 1 day ago||

My work is on a p2p database with quite weird constraints and complex and emergent interactions between peers. So it's more a system design problem than coding. Chatgpt 5.x has been helping me close the loop slowly while opus did help me initially a lot but later was missing many of the important details, leading to going in circles to some degree. Still remains to be seen if this whole endeavour will be successful with the current class of models.

nibbleyou 2 days ago||

Curious to know what kind of problems you are talking about here

hodgehog11 2 days ago||

I don't want to give away too much due to anonymity reasons, but the problems are generally in the following areas (in order from hardest to easiest):

- One problem on using quantum mechanics and C*-algebra techniques for non-Markovian stochastic processes. The interchange between the physics and probability languages often trips the models up, so pretty much everything tends to fail here.

- Three problems in random matrix theory and free probability; these require strong combinatorial skills and a good understanding of novel definitions, requiring multiple papers for context.

- One problem in saddle-point approximation; I've just recently put together a manuscript for this one with a masters student, so it isn't trivial either, but does not require as much insight.

- One problem pertaining to bounds on integral probability metrics for time-series modelling.

pm2r 2 days ago||

It would be wonderful to have a deeper insight, but I understand that you can disclose your identity (I understand that you work in applied research field, right ? )

hodgehog11 1 day ago||

Yes, I do mostly applied work, but I come from a background in pure probability so I sometimes dabble in the fundamental stuff when the mood strikes.

Happy to try to answer more specific questions if anyone has any, but yes, these are among my active research projects so there's only so much I can say.

throwa356262 2 days ago||

Seriously, why can't huge companies like OpenAI and Google produce documentation that is half this good??

https://api-docs.deepseek.com/guides/thinking_mode

No BS, just a concise description of exactly what I need to write my own agent.

u_sama 2 days ago||

I am very partial to Mistral's API docs https://docs.mistral.ai/api

vitorgrs 2 days ago|||

Meanwhile, they don't actually say which model you are running on Deepseek Chat website.

lykr0n 2 days ago|||

It's because they're optimizing for a different problem.

Western Models are optimizing to be used as an interchangeable product. Chinese models are being optimizing to be built upon.

Barbing 2 days ago|||

>Western Models are optimizing to be used as an interchangeable product.

But so much investment in their platforms, not just their APIs?

raincole 2 days ago|||

> Western Models are optimizing to be used as an interchangeable product

Why? It sounds like the stupidest idea ever. Interchangeability = no lock-in = no moot.

setr 2 days ago|||

First you clone the API of the winner, because you want to siphon users from its install-base and offer de-risked switch over cost.

Now that you’re winning, others start cloning your API to siphon your users.

Now that you’re losing, you start cloning the current winner, who is probably a clone of your clone.

Highly competitive markets tend to normalize, because lock-in is a cost you can’t charge and remain competitive. The customer holds power here, not the supplier.

Thats also why everyone is trying to build into the less competitive spaces, where they could potentially moat. Tooling, certs, specialized training data, etc

hunter67 2 days ago||||

Our (western) economic model forces competing individual companies to be profitable quickly. China can ignore DeepSeek losing money, because they know developing DeepSeek will help China. Not every institution needs to be profitable.

FuckButtons 2 days ago||||

yes, they want to win the same way they won more or less every other economic competition in the last 30 years, scale out, drop prices and asphyxiate the competition.

simonjgreen 2 days ago||||

Yeah, it’s an interesting one. I think inertia and expectations at this point? I don’t think the big labs anticipated how low the model switching costs would be and how quickly their leads would be eroded (by each other and the upstarts)

They are developing their moats with the platform tooling around it right now though. Look at Anthropic with Routines and OpenAI with Agents. Drop that capability in to a business with loose controls and suddenly you have a very sticky product with high switching costs. Meanwhile if you stick with purely the ‘chat’ use cases, even Cowork and scheduled tasks, you maintain portability.

tick_tock_tick 2 days ago||||

They are all racing to AGI. They aren't designing them to be interchangeable they just happen to be.

rglullis 2 days ago|||

No, they are not. If they were "racing to AGI" they would be working together. OpenAI would still be focused on being a non-profit. Anthropic wouldn't be blocking distillation on their models.

koe123 2 days ago|||

If by AGI you mean IPO, sure. I genuinely don't believe Dario nor Sam should be trusted at this point. Elon levels of overpromising and underdelivering.

djmips 1 day ago||

If by AGI you mean IPO - I automatically read that in Fireship's voice. XD

peepee1982 2 days ago|||

If you want other people to know whether you're being genuine or sarcastic, you'll have to put a bit more effort into your comments. Your comment just adds noise.

kennyloginz 2 days ago||

What da?

Alifatisk 2 days ago|||

You might enjoy Z.ais api docs aswell

kubb 2 days ago||

Western orgs have been captured by Silicon Valley style patrimonialism, and aren’t based on merit anymore.

orbital-decay 2 days ago||

>we implement end-to-end, bitwise batch-invariant, and deterministic kernels with minimal performance overhead

Pretty cool, I think they're the first to guarantee determinism with the fixed seed or at the temperature 0. Google came close but never guaranteed it AFAIK. DeepSeek show their roots - it may not strictly be a SotA model, but there's a ton of low-level optimizations nobody else pays attention to.

chenzhekl 2 days ago||

It's interesting that they mentioned in the release notes:

"Limited by the capacity of high-end computational resources, the current throughput of the Pro model remains constrained. We expect its pricing to decrease significantly once the Ascend 950 has been deployed into production."

https://api-docs.deepseek.com/zh-cn/news/news260424#api-%E8%...

nsoonhui 1 day ago|

Sorry, but exactly where in the article that you linked contains the mention of " Ascend 950"?

chenzhekl 1 day ago||

it's in the footnote text of the first figure of the section the link points to, where "昇腾950" means "Ascend 950"

xingyi_dev 2 days ago||

Deepseek v4 is basically that quiet kid in the back of the class who never says a word but casually ruins the grading curve for everyone else on the final exam.

revolvingthrow 2 days ago||

> pricing "Pro" $3.48 / 1M output tokens vs $4.40

I’d like somebody to explain to me how the endless comments of "bleeding edge labs are subsidizing the inference at an insane rate" make sense in light of a humongous model like v4 pro being $4 per 1M. I’d bet even the subscriptions are profitable, much less the API prices.

edit: $1.74/M input $3.48/M output on OpenRouter

menzoic 2 days ago||

API prices may be profitable. Subscriptions may still be subsidized for power users. Free tiers almost certainly are. And frontier labs may be subsidizing overall business growth, training, product features, and peak capacity, even if a normal metered API call is profitable on marginal inference.

dannyw 2 days ago||

Research and training costs have to be amortized from somewhere; and labs are always training. I'm definitely keen for the financials when the two files for IPO though, it would be interesting to see; although I'm sure it won't be broken down much.

schneehertz 2 days ago|||

This price is high even because of the current shortage of inference cards available to DeepSeek; they claimed in their press release that once the Ascend 950 computing cards are launched in the second half of the year, the price of the Pro version will drop significantly

Bombthecat 2 days ago||

In six month deepseek won't be sota anymore und usage will be wayyyy down.

randomgermanguy 1 day ago|||

Only comparing on SOTA scores (ignoring price etc.) is like choosing your daily-driver by looking at who makes the fastest sports-car...

dnnddidiej 1 day ago||

Not really. SOTA vs non SOTA is "can I get my coding work actually done today" vs. "this can do customer support chat"

It is like car vs. kick scooter.

Palmik 1 day ago||||

Or there will be DSv4.1/2/3 ;)

man4 2 days ago||||

[dead]

Barbing 2 days ago|||

Well, if they distilled once…

m00x 2 days ago|||

They are profitable to opex costs, but not capex costs with the current depreciation schedules, though those are now edging higher than expected.

nl 2 days ago||

Amazingly, the current depreciation overestimates the retained value of GPUs.

In 2023, the depreciation schedule for H100s was 2 years, but they are still oversubscribed and generating signficant income.

Coreweve has upped their depreciation for GPUs to 6 years(!) now, which seems more realistic.

https://www.silicondata.com/blog/h100-rental-price-over-time

amunozo 2 days ago|||

I was thinking the same. How can it be than other providers can offer third-party open source models with roughly the similar quality like this, Kimi K2.6 or GLM 5.1 for 10 times less the price? How can it be that GPT 5.5 is suddenly twice the price as GPT 5.4 while being faster? I don't believe that it's a bigger, more expensive model to run, it's just they're starting to raise up the prices because they can and their product is good (which is honest as long as they're transparent with it). Honestly the movement about subscription costing the company 20 times more than we're paying is just a PR movement to justify the price hike.

peepee1982 2 days ago||

I'm pretty sure OpenAI and Anthropic are overpricing their token billed API usage mainly as an incentive to commit to get their subscriptions instead.

simonjgreen 2 days ago|||

Anthropic recently dropped all inclusive use from new enterprise subscriptions, your seat sub gets you a seat with no usage. All usage is then charged at API rates. It’s like a worst of both worlds!

peepee1982 2 days ago||

What's the point then? Special conditions for data retention/non-training policies?

simonjgreen 2 days ago||

SSO Tax is a large part of it, controls around plug-in marketplace, enforcement of config, observeability of spend. But it’s all pretty weak really for $20 a month.

And Microsoft are going the same route to moving Copilot Cowork over to a utilisation based billing model which is very unusual for their per seat products (I’m actually not sure I can ever remember that happening).

weird-eye-issue 2 days ago|||

The target audience for the APIs is third party apps which are not compatible with the subscriptions.

peepee1982 2 days ago||

True. I missed that.

raincole 2 days ago|||

Insert always has been meme.

But seriously, it just stems from the fact some people want AI to go away. If you set your conclusion first, you can very easily derive any premise. AI must go away -> AI must be a bad business -> AI must be losing money.

zarzavat 2 days ago||

Before the AI bubble that will burst any time now, there was the AI winter that would magically arrive before the models got good enough to rival humans.

mirzap 2 days ago|||

My thoughts exactly. I also believe that subscription services are profitable, and the talk about subsidies is just a way to extract higher profit margins from the API prices businesses pay.

Bombthecat 2 days ago||

Google stated a while back, that with tpus they are able to sell at cost / with profit.

Aka: everyone who uses Nvidia isn't selling at cost, because Nvidia is so expensive.

vitorgrs 2 days ago|||

And they actually say the prices will be "significantly" lower in second semester when Huawei 650 chips comes in.

Flavius 1 day ago|||

It's because investors in OpenAI/Anthropic want to get their money back in 10 months, not in 10 years.

jimmydoe 2 days ago|||

They’ve also announced Pro price will further drop 2H26 once they have more HUAWEI chips.

crazylogger 2 days ago|||

I haven't seen anyone claiming that API prices are subsidized.

At some point (from the very beginning till ~2025Q4) Claude Code's usage limit was so generous that you can get roughly $10~20 (API-price-equivalent) worth of usage out of a $20/mo Pro plan each day (2 * 5h window) - and for good reason, because LLM agentic coding is extremely token-heavy, people simply wouldn't return to Claude Code for the second time if provided usage wasn't generous or every prompt costs you $1. And then Codex started trying to poach Claude Code users by offering even greater limits and constantly resetting everyone's limit in recent months. The API price would have to be 30x operating cost to make this not a subsidy. That would be an extraordinary claim.

nl 2 days ago|||

The claim that APIs are subsidized is very common.

eg:

Token prices are significantly subsidized and anyone that does any serious work with AI can tell you this.

https://news.ycombinator.com/item?id=47684887

(the claims don't make any sense, but they are widely held)

vessenes 1 day ago||

I’ll note that it’s common and dangerous, in that there’s a generation of engineers who are at risk of leading each-other astray as to the economics and therefore probability distribution of outcomes for some firms that will massively impact their careers.

I think I understand the major reasons for this meme, but I find it really worrying; there were lots of incorrect ‘it’s a bubble’ conversations here in 2012-2015, but I don’t think they had the pervasive nature and “obvious” conclusion that a whole generation of engineering talent should just, you know, leave.

Meanwhile I am hearing rational economic modeling from the companies selling inference; Jensen, (a polished promoter, I grant you) says it really well — token value is increasing radically, in that new models -> better quality, and therefore revenues and utilization are increasing, and therefore contrary to the popular financial and techbro modeling of 2023, things like A100s still cost quite a lot whether hourly or to purchase. (!) Basically the economic value is so strong that it has actually radically extended the life of hardware.

I just hate to imagine like half of the world’s (or US’s) engineering talent quitting, spending ten years afraid, or wrongly convinced of some ‘inevitable’ market outcome. Feels like it will be bad for people’s personal lives, and bad for progress simultaneously.

dannyw 2 days ago|||

Yeah, subscriptions used to be extraordinarily generous. I miss those days, but the reinvigoration of open weight models is super exciting.

I'm still playing with the new Qwen3.6 35B and impressed, now DeepSeek v4 drops; with both base and instruction-tuned weights? There goes my weekend :P

masafej536 2 days ago|||

Point taken but there isnt any western providers there yet. Power is cheaper in china.

3uler 2 days ago|||

These models are open and there are tons of western providers offering it at comparable rates.

NitpickLawyer 2 days ago|||

As this is a new arch with tons of optimisations, it'll take some time for inference engines to support it properly, and we'll see more 3rd party providers offer it. Once that settles we'll have a median price for an optimised 1.6T model, and can "guesstimate" from there what the big labs can reasonably serve for the same price. But yeah, it's been said for a while that big labs are ok on API costs. The only unknown is if subscriptions were profitable or not. They've all been reducing the limits lately it seems.

ithkuil 2 days ago||

Is there evidence that frontier models at anthropic, openai or google or whatnot are not using comparable optimizations to draw down their coats and that their markup is just higher because they can?

casey2 2 days ago|||

It's the decades of performance doesn't matter SV/web culture. I'd be surprised if over 1% of OpenAI/Anthropic staff know how any non-toy computer system works.

dminik 2 days ago|||

I mean, not one "bleeding edge" lab has stated they are profitable. They don't publish financials aside from revenue. And in Anthropic's case, they fuck with pricing every week. Clearly something is wrong here.

sekai 2 days ago||

> I’d like somebody to explain to me how the endless comments of "bleeding edge labs are subsidizing the inference at an insane rate" make sense in light of a humongous model like v4 pro being $4 per 1M. I’d bet even the subscriptions are profitable, much less the API prices.

One answer - Chinese Communist Party. They are being subsidized by the state.

lbreakjai 1 day ago|||

When China does it it's communism. When companies in the west get massive tax cuts, rebates, incentives and subsidies, that's just supporting the captains of industry.

fblp 2 days ago||

There's something heartwarming about the developer docs being released before the flashy press release.

taurath 2 days ago||

Their audience is people who build stuff, techs audience is enterprise CEOs and politicians, and anyone else happy to hype up all the questionably timed releases and warnings of danger, white collar irrelevence, or promises of utopian paradise right before a funding round.

onchainintel 2 days ago|||

Insert obligatory "this is the way" Mando scene. Indeed!

necovek 2 days ago||

Where's the training data and training scripts since you are calling this open source?

Edit: it seems "open source" was edited out of the parent comment.

b65e8bee43c2ed0 2 days ago|||

doesn't it get tiring after a while? using the same (perceived) gotcha, over and over again, for three years now?

no one is ever going to release their training data because it contains every copyrighted work in existence. everyone, even the hecking-wholesome safety-first Anthropic, is using copyrighted data without permission to train their models. there you go.

necovek 2 days ago|||

There is an easy fix already in widespread use: "open weights".

It is very much a valuable thing already, no need to taint it with wrong promise.

Though I disagree about being used if it was indeed open source: I might not do it inside my home lab today, but at least Qwen and DeepSeek would use and build on what eg. Facebook was doing with Llama, and they might be pushing the open weights model frontier forward faster.

JumpCrisscross 1 day ago|||

> There is an easy fix already in widespread use: "open weights"

They're both correct given how the terms are actually used. We just have to deduce what's meant from context.

There was a moment, around when Llama was first being released, when the semantics hadn't yet set. The nutter wing of the FOSS community, to my memory, put forward a hard-line and unworkable definition of open source and seemed to reject open weights, too. So the definition got punted to the closest thing at hand, which was open weights with limited (unfortunately, not no) use restrictions. At this point, it's a personal preference that's at most polite to respect if you know your audience has one.

dannyw 2 days ago|||

Yeah, open weights is really good, especially when base models (not just the instruction tuned) weights are released like here.

Tepix 2 days ago||||

Nvidia did with Nemo.

niea_11 2 days ago||

And they got sued :

https://www.reuters.com/technology/nvidia-is-sued-by-authors...

fragmede 2 days ago|||

it's not a gotcha but people using words in ways others don't like.

a96 2 days ago||

It's not about likes, it's a flat out lie.

woctordho 2 days ago||||

They are exactly open source. The training data is the internet. Don't say it's on the internet. It IS the internet.

The training scripts are in Megatron and vLLM.

bl4ckneon 2 days ago||||

Aww yes, let me push a couple petabytes to my git repo for everyone to download...

necovek 2 days ago||

An easier thing would be to say "open weights", yes.

0-_-0 2 days ago|||

Weights are the source, training data is the compiler.

injidup 2 days ago||

You got it the wrong way round. It's more akin to.

1. Training data is the source. 2. Training is compilation/compression. 3. Weights are the compiled source akin to optimized assembly.

However it's an imperfect analogy on so many levels. Nitpick away.

mirekrusin 2 days ago||

It's dataset [0] released under some source available license or OSI license, ie. open dataset or open source dataset.

[0] https://news.ycombinator.com/item?id=47758408

sho 2 days ago||

So, this is the version that's able to serve inference from Huawei chips, although it was still trained on nVidia. So unless I'm very much mistaken this is the biggest and best model yet served on (sort of) readily-available chinese-native tech. Performance and stability will be interesting to see; openrouter currently saying about 1.12s and 30tps, which isn't wonderful but it's day one after all.

For reference, the huawei Ascend 950 that this thing runs on is supposed to be roughly comparable to nVidia's H100 from 2022. In other words, things are hotting up in the GPU war!

alpineman 2 days ago||

Can't see how NVIDA justifies its valuation/forward P/E ratio with these developments and on-device also becoming viable for 98% of people's needs when it comes to AI

aurareturn 2 days ago||

On-device is incredibly far away from being viable. A $20 ChatGPT subscription beats the hell out of the 8B model that a $1,000 computer can run.

Nvidia's forward PE ratio is only 20 for 2026. That's much lower than companies like Walmart and Costco. It's also growing nearly 100% YoY and has a $1 trillion backlog.

I think Nvidia is cheap.

littlestymaar 1 day ago|||

> On-device is incredibly far away from being viable. A $20 ChatGPT subscription beats the hell out of the 8B model that a $1,000 computer can run.

That's a very strange comment. Why would anyone run a dense model on a low-end computer? A 8B model is only going to make sense if you have a dGPU. And a Qwen3.6 or Gemma4 MoE aren't going to be “beaten the hell out” for most tasks especially if you use tools.

Finally, over the lifetime of your computer, your ChatGPT subscription is going to cost more than the cost of your reference computer! So the real question should be whether you're better off with a $1000 computer and a ChatGPT subscription or with a $2000 computer (assuming a conservative lifetime of 4 years for the computer).

midwain 1 day ago||||

This is an assessment of the moment. When rate of AI data center construction slows down, then P/E will start to grow. Or are we saying that the pace will only grow forever? There are already signs of a slowdown in construction.

dannyw 2 days ago|||

I do think Nvidia isn't that badly priced; they still have the dominance in training and the proven execution

Biggest risk I see is Nvidia having delays / bad luck with R&D / meh generations for long enough to depress their growth projections; and then everything gets revalued.

npodbielski 2 days ago||

Great! Can't wait to buy decent GPU for interference for <1k$

primaprashant 2 days ago|

While SWE-bench Verified is not a perfect benchmark for coding, AFAIK, this is the first open-weights model that has crossed the threshold of 80% score on this by scoring 80.6%.

Back in Nov 2025, Opus 4.5 (80.9%) was the first proprietary model to do so.

stared 2 days ago|

SWE-bench Verified is, at this point, contaminated https://openai.com/index/why-we-no-longer-evaluate-swe-bench...

So it os hard to tell how much of a model gain is due to skill, and how much - overfitting.

More comments...