Top
Best
New

Posted by htrp 1 day ago

Anthropic says Alibaba illicitly extracted Claude AI model capabilities(www.reuters.com)
740 points | 1194 commentspage 14
rvba 10 hours ago|
Why is it called "distillation" when it seems to be "scraping"? (as in web scraping)

When bots open the same board 1 million times per day it is web scraping to train the AI model and OK. When someone asks 150 thousand questions it is now distilling.

On an unrleated note, 150k qieries feels like nothing?

Scrapers seem to account for 50% total internet trafic.

Do they use different methodology since it is suddenly bad when scraping happens to them?

bparsons 10 hours ago||
Where did Anthropic get all their training data? Funny that these companies care about the sanctity of IP all of a sudden.
freejazz 10 hours ago||
Why would it not be fair use?
nacozarina 7 hours ago||
Thieves complaining about theft and then gaslighting the victims; rich, but not smooth.
otikik 10 hours ago||
If it's out there on the internet it's ok to use it for training, independently of what the licenses or the TOS say.

If not, then we should look at Alibaba, but we should look at Anthropic as well.

Groxx 19 hours ago||
Perhaps this is related to the "Mythos is too dangerous and cannot be exported" movements? It'd be a fairly effective way to justify extreme actions in combating it.

One could even wonder if they requested it, as a tactic to support their eventual IPO valuation.

Which is part of the problem of such an obviously-corrupt government: conspiracy theories are somewhat reasonable, as they keep getting validated.

hirako2000 15 hours ago||
Karma is a thing.
dev1ycan 11 hours ago||
It's so funny how LLMs, which trained on millions of books, stolen (and even if they weren't, which they were, pirated from online pirate sites like libg and annas, they didn't have consent for the VAST majority of them), and stolen code, and stolen comments, etc.

Now complain about their stuff getting "stolen"... lol.

bilsbie 9 hours ago||
Can we finally just nope out of this closed model of AI development?

It should all be open source with each gain shared and celebrated by all.

yogthos 21 hours ago|
So let me get this straight, a company which built its whole business on ignoring IP is all of a sudden upset that somebody is not respecting their IP?
More comments...