Top
Best
New

Posted by htrp 5 hours ago

Anthropic says Alibaba illicitly extracted Claude AI model capabilities(www.reuters.com)
50 points | 80 comments
tristanj 41 minutes ago|
Here's what is happening:

Chinese resellers are selling Claude tokens at a 70-90% discount from API prices. They achieve this by reselling capacity from pooled Claude Max 5x accounts, payments fraud, and also reselling the model output & reasoning chains to various Chinese labs.

Claude and ChatGPT are both blocked in China. You need to use a VPN to access either, and you can't pay with a Chinese bank card. So most people who want access to Claude go via a reseller. It's the easiest and cheapest way to access Anthropic models in China.

Resellers have tens of thousands of bot accounts doing this. This is also why Anthropic introduced identity verification, to slow down the onslaught of bot accounts.

Here's one token reseller, they're offering Opus 4.8 at a 93% discount below official API rates: https://yunwu.ai/pricing?provider=Anthropic

This is one reason why Deepseek & GLM are priced so cheaply, they are competing with impossibly low token prices in China. They have to keep prices low, in order for people to use them.

I shared this story a few months back, but it never got any traction https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...

xgstation 13 minutes ago||
> This is one reason why Deepseek & GLM are priced so cheaply, they are competing with impossibly low token prices in China. They have to keep prices low, in order for people to use them.

This one does not make sense to me at all.

Deepseek and GLM are openweights, even US inference provider are selling them at much cheaper price. The price is cheap because the model is more efficient.

tristanj 5 minutes ago||
DeepSeek permanently cut their V4-pro API prices by 75 percent because they were too expensive. Without the price cut, Deepseek V4-pro tokens would cost more than resold Opus 4.8 tokens.
gruez 33 minutes ago|||
>They achieve this by reselling capacity from pooled Claude Max 5x accounts, payments fraud, and also reselling the model output to various Chinese labs.

>Here's one token reseller, they're offering Opus 4.8 for a 93% discount below official API rates: https://yunwu.ai/pricing?keyword=claude

But is it cheaper than getting your own account? Otherwise this sounds like the "anthropic/openai are losing gazillions of dollars because they're selling $1k worth of tokens for $100" line that's commonly trotted out by AI bears.

tristanj 12 minutes ago|||
It's very difficult to make your own Anthropic account from China, Chinese bank cards are blocked by Anthropic, so you'd need to pay with a foreign bank card.

There's a similar Claude resale market going on in Russia.

spindump8930 26 minutes ago||||
> Claude and ChatGPT are both blocked in China

So it's presumably cheaper than attempting to spin up your own method of circumventing the blocks.

Mr_Xpes 17 minutes ago||
[dead]
weird-eye-issue 26 minutes ago|||
You can use it as an API unlike the subscription.
nonethewiser 37 minutes ago|||
Thats pretty crazy. This kind of thing jeopardizes Claude Max.
avaer 35 minutes ago||
If Anthropic is selling a dollar for less than a dollar, they are running a business that doesn't make sense. That's what jeopardizes Claude Max, not this.
walrus01 22 minutes ago|||
Plenty of things are intentionally run at a loss (for years!) to gain market share and quantity of ongoing recurring users, or with expectation of ROI later on. Multiple generations of the Xbox hardware have been sold at a loss with the expectation that customers will purchase 300, 400, 500 dollars worth of games, which are very high margin, over the lifespan they own the system.
avaer 13 minutes ago||
I get that. It works as long as nobody calls out the emperor for having no clothes.

It's similar to fractional banking, you gamble that people won't want their deposits all at once and pray for you're big enough for bailouts when they do.

It's still a business whose fundamentals don't make sense, you're just gambling you won't get found out.

gruez 31 minutes ago||||
But if it's intended to be used by one person, it seems like breaking the contract by sublicensing it out to dozens of other people. It's like buying a netflix subscription for $15, then sublicensing it on a per-hour basis to dozens of other people.
ralph84 28 minutes ago|||
Almost all consumer services have a built-in level of breakage that make them profitable. Mobile providers certainly wouldn't be able to offer unlimited calling if everyone was actually on the phone 24x7.
margalabargala 16 minutes ago||
Sure they would. Do you know how little bandwidth a phone call takes?

A voLTE call is like 40kbps. For every person on earth to be on the phone to another person would be 4 billion calls would be about 160tbps. Which is less than 10% of the Internet's capacity.

fwipsy 33 minutes ago|||
Hm! In this context, introducing ID verification may have been a significant silver lining to the order to take down Fable for Anthropic.

This also sheds a very different light on people saying that competitive open-source models are undermining frontier labs' business model.

epsteingpt 38 minutes ago||
How are they 'streaming' the responses and 'pooling' the tokens?

Do they have MacBooks in the US that run the queries and stream the outputs back to China?

paxys 36 minutes ago|||
Why do you need macbooks? Just rent servers from any hosting provider.
walrus01 34 minutes ago||
Not going to work for very long or at any scale coming from datacenter/hosting provider IPs. Google "residential proxies for sale" for the tip of an iceberg of how they snowshoe the traffic.
paxys 31 minutes ago||
As long as you stick to a single unique IP per account it isn't going to get flagged.
walrus01 29 minutes ago||
Respectfully, no, that's not how it works. You think the people running anti-fraud and anti-bot measures don't have tools that know the specific ipv4 and ipv6 CIDR ranges of every ASN that they categorize as hosting/colo providers?

And that's just as a basic first effort reject measure to prevent automation tools from using things designed for human-interactive use only.

Go try to do many of these things from Cogent IP space and see how long your project lasts.

teravor 36 minutes ago||||

    > Do they have MacBooks in the US that run the queries and stream the outputs back to China?
why would anyone do that? you do realize the laptop farm case was work computers?

the answer to your question is containers/VMs + residential proxies

globalnode 33 minutes ago||
that explains why theyre blocking me. i have privacy controls up high and they must think im a chinese residential proxy bot
bagels 37 minutes ago|||
They probably asked claude how to do it.
walrus01 48 minutes ago||
Reminds me a bit of the anecdote of Steve Jobs complaining about people ripping off the Mac GUI, in the mid to late 1980s, when he gave no public acknowledgement to the work done by Xerox on the Alto and Star operating system.

"you're trying to rip off what I've already ripped off!"

Crawl the whole Internet to build a gargantuan sized LLM and then complain you're being copied...

breput 44 minutes ago||
I think you meant a quote attributed to Bill Gates:

"Well, Steve, I think there's more than one way of looking at it. I think it's more like we both had this rich neighbor named Xerox and I broke into his house to steal the TV set and found out that you had already stolen it."

walrus01 42 minutes ago|||
Yes, I think the Gates quote was a response to repeated and aggressive complaints originating from Jobs (to anyone who would listen) that he had been ripped off.
jakebasile 28 minutes ago|||
I don't know if that's a real quote from Gates, but I do know it was in Pirates of Silicon Valley.
seanmcdirmid 41 minutes ago|||
Apple gave Xerox the right to buy $1 million of pre-IPO stock before the meeting took place.
taneq 43 minutes ago|||
“You’re trying to kidnap what I’ve rightfully stolen!”
nonethewiser 39 minutes ago||
You can’t just equivocate crawling websites with building bleeding edge LLMs what the fuck
bloppe 30 minutes ago|||
The equivocation is crawling website <-> crawling LLM responses.

Both Anthropic and Alibaba are trying to build bleeding edge LLMs. That part is the same. The way they source their data is slightly different, but they would both argue it constitutes fair use under Copyright law.

paxys 33 minutes ago||||
The websites, music, movies, books, photos, art that they stole didn't appear out of thin air. The amount of time and effort people have collectively poured into creating these works throughout history far, far surpasses Anthropic's own effort of converting them into model weights.
walrus01 38 minutes ago||||
"Your extremely efficient multi petabyte internet content suction machine is ripping off my extremely efficient multi petabyte internet content suction machine"

Sucking down petabytes of peoples' copyrighted content that they never granted a specific license to you to use seems to be an unavoidable and default part of the process of building any huge LLM.

nonethewiser 35 minutes ago||
So why was there crawling in 1998 but no LLMs?
epsteingpt 35 minutes ago|||
It's not really equivocation in this instance. This feels like a 'bad faith' comment. We can do better.

LLM's literally wouldn't work without the sum total of knowledge (in the forms of books and other copyrighted content) being used as 'training data' for these LLMs.

The 'bleeding edge' LLMs required many things, but: 1 Tech innovation ('attention') 2 Lots of compute 3 Data 4 Pre + post training

#4 doesn't happen without #3.

It's pretty obvious at this point that the major providers have stolen vast amounts of #3 - they have paid nearly 0 of the creators.

We can argue about the impact (I'd lean net good) vs. the cost. But arguing there isn't a cost is a bit silly.

nonethewiser 34 minutes ago||
All of this supports the fact that models arent essentially just web crawling
margalabargala 7 minutes ago||
Sure, but alibaba is still building an LLM. The scraping of responses and the scraping of websites occupy the same location in the stack of each. It's very comparable.
drillsteps5 5 hours ago||
I'm looking forward to the trial where Anthropic will have to disclose sources of their training data, and then explain why they are entitled to charging customers for using regurgitated training data but Alibaba which trains their models on Anthropic's models are not.

Should be fun.

Edit: clarification

conception 52 minutes ago||
They already did and paid 1.5B https://authorsguild.org/advocacy/artificial-intelligence/wh...
ninefathom 57 minutes ago|||
While I love the sentiment, I feel like the odds of this actually ever reaching a trial are low, given the international positioning of the parties, and the... um... complex relationships involved.

Anthropic's actions seem performative. Others have already speculated on the likely audience(s).

appplication 56 minutes ago||
Being logically consistent isn’t as profitable as being aggressive and loud.
amazingamazing 46 minutes ago||
Distillation is fundamentally impossible to protect against. All you can do is slow them down. Change my view.

Eventually these Chinese companies will release some extension like Honey, which will sit on top real, non-Chinese clients and send everything to China anyway.

It's over.

lebovic 22 minutes ago||
It's too late to prevent distillation of some capabilities, like finding writing code or finding vulnerabilities [1].

But an AI lab can continue to produce immense economic value without releasing the model publicly for possible distillation. For example, it could use a future model solely in-house to develop therapeutics.

Hopefully there's a future where others can access frontier models, but it's not neccessary if preventing proliferation through distillation is considered more important.

[1]: See the notes on distillation in https://dualuse.dev/posts/export-controls-on-fable

nonethewiser 23 minutes ago|||
Distilled models are necessarily behind so long as models are progressing. Models are progressing. Maybe it will be over some time in the future.

And Berkeley’s “False Promise of Imitating Proprietary LLMs” found imitation closes the style gap fast but there is a large capability gap.

https://arxiv.org/abs/2305.15717

lebovic 15 minutes ago||
Curiously, this isn't always true.

For example, GLM 5.1 is more capable at pentesting than the model from which it is alleged to have been distilled [1].

Intuitively, this makes some sense: you can "distill" from multiple frontier models, and you can further post-train the distilled model. But I'm not sure exactly what happened with GLM 5.1.

[1]: https://dualuse.dev/posts/chinese-models-are-sometimes-bette...

nonethewiser 29 minutes ago|||
Im not so sure because we only seem to see distillation from China. What’s preventing tech companies from the UK, Germany, etc. from distilling Claude, GPT, etc. Do they simply lack the ability to?

Point being there may be no technical solution but there may be a political one (theoretically).

HaloZero 36 minutes ago|||
Doesn’t that require them to register an account using the browsers they’ve compromised? If anthropic adds identity verification won’t that cut that down. Maybe it will let them use Gemini inside of chrome
amazingamazing 26 minutes ago||
No, they could easily buy legitimate, already registered accounts and use VPNs.
seany 39 minutes ago|||
I can't even come up with a reason to find it wrong.
IncreasePosts 27 minutes ago||
I personally bristle at the corporate espionage and IP theft that China has undertaken the last few decades. I can't help but respond here whenever anyone brings up the inane comparison to Samuel Slater.

But with this, I don't have an issue. There is no theft since what is being used is the exact product that is being delivered. Yes, it's breaking the ToS, but ToS are generally bullshit. Anthropic surely broke thousands of ToS or other legal terms while it was scraping for content to train on. Which is why they had to pay $1.5B

redwood 30 minutes ago||
One simplistic way to describe distillation would be to try everything imaginable and cache the response. But trying everything imaginable is hardly trivial
0xbadcafebee 50 minutes ago||
There's two basic kinds of distillation: 1) the massive [and dumb] method where you ask a question and use the answer as reinforcement (Black Box), and 2) more targeted distillation where you use one model to directly inform/train/guide another model (RLAIF).

The latter is basically fine-tuning the model with direction from another model. Thousands of businesses do this every day to fine-tune. This is almost certainly what the Chinese labs are doing, since it has a much better effect on the end result than just getting simple answers to simple questions.

These complaints of distillation are inflating the problem to make it sound worse than it is, because they want the USG to block/ban Chinese model providers as protectionism. They have already called for more export controls on chips (which is funny because DeepSeek v4 was designed to run on Huawei chips and now the other Chinese providers are following suit). But they can't come right out and say that, so their claim is that they're asking for more export controls because distilled models might not be as safe as their own. But if you show them a jailbreak of their model that bypasses their safety, they'll tell you that any model can eventually be jailbroken so don't worry about safety.

Mr_Xpes 23 minutes ago||
Classic example of why better API key management and abuse-resistant proxy layers matter... AgentKey-style tools help mitigate exactly this kind of large-scale credential abuse and distillation attempts
zakkl 5 hours ago||
It sounds like Anthropic is eagerly trying to show to USG that they are willing to heavily monitor ‘foreign adversaries’ on their platforms.

This combined with no implementation of KYC makes it seem like they want to find a middle ground with Fable where its off of export controls but they promise to prevent China and specific others from using.

ninefathom 53 minutes ago||
This seems to me like a stab in the right direction.

Obviously their actions are going to be fiscally motivated at the root, but sussing out how they intend the precise dynamics to play out is more nuanced.

Thinking of this as an effort to woo the defense hawks cuts a very clear path.

verdverm 44 minutes ago||
This is not the first time it happened. What have they done to improve the situation? I suspect it more a cat & mouse game, with a lot more cats playing.
paxys 25 minutes ago||
Warn everyone that your models are so good they will wreck cybersecurity.

Complain/brag that chinese firms are illegally using the models and bypassing export controls.

Be surprised when your model gets banned by the government.

thadk 33 minutes ago||
Does anyone have hints on what kinds of prompts are most used for a distillation like this—SWE-Bench sorts of things?

Is reconstructing the compressed knowledge in the model like reconstructing a lossy JPG or MP3 a reasonable analogy?

BigTTYGothGF 38 minutes ago|
If you're an AI booster surely you'd think this was a good thing as it means more models are available in more places to more people more easily. I'm exactly the opposite, and I think this is a good thing because I want Anthropic to suffer.
rikima_ 16 minutes ago||
so it’s a good thing whichever way you look at it
nonethewiser 22 minutes ago||
That doesnt follow.
BigTTYGothGF 19 minutes ago||
Which part?
More comments...