DeepSeek-V4-Flash: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash
DeepSeek-V4-Pro: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro
Back in Nov 2025, Opus 4.5 (80.9%) was the first proprietary model to do so.
So it os hard to tell how much of a model gain is due to skill, and how much - overfitting.
And we got new base models, wonderful, truly wonderful
`https://openrouter.ai/api/messages with model=deepseek/deepseek-v4-pro, OR returns an error because their Anthropic-compat translator doesn't cover V4 yet. The Claude CLI dutifully surfaces that error as "model...does not exist"
This “no harm to me” meme about a foreign totalitarian government (with plenty of incentive to run influence ops on foreigners) hoovering your data is just so mind-bogglingly naive.
Relatively speaking, DeepSeek is less untrustworthy than Grok.
When I try ChatGPT on current events from the White House it interprets them as strange hypotheticals rather than news, which is probably more a problem with DC than with GPT, but whatever.
Any specific examples?
That would be a great argument if the American models weren’t so heavily censored.
The Chinese model might dodge a question if I ask it about 1-2 specific Chinese cultural issues but then it also doesn’t moralize me at every turn because I asked it to use a piece of security software.
Even for minor stuff like beeing addicted to drugs.
Looks pretty totalitarian to me.
Not quite the same.
Quick google top link
https://en.wikipedia.org/wiki/Forced_organ_harvesting_from_F...
I think not.
Note: you can have this conversation criticizing the US on a US website. Try criticizing Xi or the CCP or calling him Pooh on a Chinese website.
You think China doesn’t imprison drug users?
China recently executed a low level drug trafficker
https://www.lemonde.fr/en/international/article/2026/04/05/c...
China is one of the top executioners. China executes more than rest of the world combined
https://www.amnesty.org/en/latest/news/2017/04/china-must-co...
You think China is honest about political prisoners in Tibet and Xinjiang?
Criticize the US all you want but I can’t understand the whitewashing of a real totalitarian and genocidal state like mainland China.
But if we start nitpicking the US also executes people all over the world without trial and has secret prisons worldwide where they put people (guess what) without trial.
This is why I’ve been urging everyone I know to move away from American based services and providers. It’s slow but honest work.
yes, this is exactly what I'm saying.
China is a nation built for peace, while western nations are built for war.
The US is (mostly) protective of its citizens but (depending on administration) varyingly hostile to outsiders (immigrants, starting wars, etc.).
China is suppressive towards its own citizens, but has been largely peaceful with other countries and immigrants/visitors. (Granted, China has way fewer immigrants than the US, so this is not comparable).
But for folks on the opposite side of the world, the threats are more like "they're selling us electric cars and solar panels too cheaply" and the hypothetical "these super cheap CCTV cameras could be used for remote spying"
Feel free to go post similar on Chinese social media about their leaders.
By the way, even with the current administration, there's no question about which is the more authoritarian with their own citizens between China and the US. But if you aren't American, then the US government is much more of a threat than the Chinese.
China cannot make the life of an official in Europe miserable for investigating their atrocities towards the Uighurs, meanwhile CPI judges are now forcedly unbanked and cannot work with American software because they investigated in US's ally's atrocities in Gaza.
Sure. China and America are the same. Go try the social media experiment.
The executive branch?
Half the country would be locked up right now if they weren’t allowed to criticize Trump. Have you even paid attention to how much he’s shitted on, on a daily basis?
- Sam Altman & Worldcoin collecting everyone's eyeball scan - Discord attempting to roll out worldwide age & id verification - LinkedIn collecting data on your web browser extensions - WhatsApp collecting browser data via a local server running on device
Its sad to see how you have regulated yourselves into a position where Mistral is your only claim.
My country’s per capita income is $2500 a year. We can’t pay perpetual rent to OAI/Anthropic
This sounds whole lot like potatoh potahto. I think the former argument is very much the correct one: China can undercut everyone and win, even at a loss. Happened with solar panels, steel, evs, sea food - it's a well tested strategy and it works really well despite the many flavors it comes in.
That being said a job well done for the wrong reasons is still a job well done so we should very much welcome these contributions, and maybe it's good to upset western big tech a bit so it's remains competitive.
The decisions to mobilize a large rural base toward manufacturing and the central bank goals to keep the yuan cheap as a critical support of this project were absolutely national.
They were ultimately about bringing (or trying to bring) one of the most populous nations in the world out of extreme poverty; in particular the people of the country out of extreme poverty.
There are different policies in place today, and, crucially, bleeding edge tech is not gainful labor employment —- BYD has some factories with roughly 2 employees per acre of robotic production, for instance. Or datacenters where the revenue could scale but the labor will not.
So, these are different times, different goals, different political and labor outcomes. Reasoning about what China “must do”, or has as a matter of “national policy” should start with a clear look at history and circumstance, or you’re likely to read things incorrectly.
Just this week they published a serious foundational library for LLMs https://github.com/deepseek-ai/TileKernels
Others worth mentioning:
https://github.com/deepseek-ai/DeepGEMM a competitive foundational library
https://github.com/deepseek-ai/Engram
https://github.com/deepseek-ai/DeepSeek-V3
https://github.com/deepseek-ai/DeepSeek-R1
https://github.com/deepseek-ai/DeepSeek-OCR-2
They have 33 repos and counting: https://github.com/orgs/deepseek-ai/repositories?type=all
And DeepSeek often has very cool new approaches to AI copied by the rest. Many others copied their tech. And some of those have 10x or 100x the GPU training budget and that's their moat to stay competitive.
The models from Chinese Big Tech and some of the small ones are open weights only. (and allegedly benchmaxxed) (see https://xcancel.com/N8Programs/status/2044408755790508113). Not the same.
> Open weight!
They clearly were implying it's not open source.
If we can't build the weights, then we don't have the source. I'm not entirely sure what an open-source model would even look like, but I am confident that these binary blobs that we are loading into llama.cpp and vllm aren't the equivalent of source code. We have absolutely no idea what sort of data went into them.
This is fine. It isn't slanderous. It is what we have, and it is awesome. Just because it is awesome doesn't make it open source.
So you can’t see what facts are pruned out, what biases were applied, etc. Even more importantly, you can’t make a slightly improved version.
This model is as open source as a windows XP installation ISO.
Did you even read my comment?
they-might-take-a-bit-to-publish
And you think the US tech giants don't have any ulterior motives?!
I just want to remind you that this is happening at the same time as Anthropic A/B tests removal of Code from Pro Plan, and as OpenAI releases gpt-5.5 2x more expensive than gpt-5.4...
That’s a big if. It’s my experience that models that perform very well on benchmarks do not necessarily perform well in real life.
I’ve mostly started ignoring the benchmarks and run my own evals.
Well, yeah... Like Opus 4.5, 4.6, 4.7. Top of the benchmarks and yet it's a pile of crap at the moment and has been for months.
>Can the same be said about DeepSeek or any other open-source model provider performing distillation?
Open source models that distill from SoTA reminds me of the story of Robin Hood -- robbing the rich and giving it to the poor. So to answer your question: yes, but it's better than the alternative where only a select few companies have SoTA models.
Oh, so people might be forced to give back the AI earnings? Should I be worried about the last year's capital gains on my portfolio?
Altman and Amodei are so mad about muhh model when they steal our data and pollute the Internet with slop.