Posted by htrp 1 day ago
When bots open the same board 1 million times per day it is web scraping to train the AI model and OK. When someone asks 150 thousand questions it is now distilling.
On an unrleated note, 150k qieries feels like nothing?
Scrapers seem to account for 50% total internet trafic.
Do they use different methodology since it is suddenly bad when scraping happens to them?
If not, then we should look at Alibaba, but we should look at Anthropic as well.
One could even wonder if they requested it, as a tactic to support their eventual IPO valuation.
Which is part of the problem of such an obviously-corrupt government: conspiracy theories are somewhat reasonable, as they keep getting validated.
Now complain about their stuff getting "stolen"... lol.
It should all be open source with each gain shared and celebrated by all.