Posted by T-A 20 hours ago
Nvidia Nemotron is also an open training source model, though a portion of its dataset remains proprietary.
Quoting lambda's comment:
> Note that the Nemotron models are generally stronger than Olmo and K2 Think V2 (according to Artificial Analysis benchmarks), and there is a lot of overlap in their datasets (lots of datasets are based on the same sources with different filtering, Olmo and K2 Think V2 both have used some Nemotron datasets).
> But yeah, Nemotron is a modern and fairly capable LLM, even the 122b is more capable than Deepseek R1 (a 671b model) on most benchmarks, and there's also the recently released 550b Ultra now.
In fact, if the frontier companies had taken their approach, it would have started much slower, but I think we would be far more advanced by 2035. Instead we have a majority of society that wants to see AI fail.
Do you talk to regular people? I work out of coffee shops routinely and literally like 90% of laptops have ChatGPT or Claude open. I was shocked at how many of my friends love the silliest of AI features (like Slack bot summarizing your day or your upcoming meetings), and a lot of decks, proposals, SOW's, etc. are (at least in part) generated with AI these days.
Young people who want to have secure jobs and who have any kind of experience with creativity see AI coming for their livelihoods and their joy simultaneously.
Middle-aged IT industry people like me, many of us are grudgingly learning it but believe it to be an obvious net negative the way it is currently deployed; it feels like we're automating all the wrong stuff.
I wouldn't go around talking as if people think AI is great. A solid proportion of the population would be tempted to push AI influencers under buses and trains.
Everyone wants at least some of the utility.
Few want to reach the end of the road we’re said to be walking. AI companies and the CEOs of megacorps. Everyone else is being sold a doomsday scenario (true or not).
https://www.pewresearch.org/chart/about-a-quarter-of-u-s-adu...
It all, of course, depends on what people mean by "AI" (I think the question basically defeats itself, it's akin to asking someone about "databases", given that it covers image generation, self driving cars, TikTok feeds, drug discovery and chatbots) but AI sentiment at large is more negative than positive.
https://www.pewresearch.org/chart/americans-predict-ais-impa...
So, depending on where you sit: Sure, most people will use "AI", meaning a chatbot (probably ChatGPT: https://www.pewresearch.org/chart/americans-report-using-cha...). 90% in coffee shop land, why not.
But that does not mean that they are not weary of the consequences, and are growing more weary. I think, predictably, the better situated you are and the more your direct livelihood is at stake. That's just the animal we are.
Does that mean that we should have slowed down? Matter of opinion. My take: Absolutely not. The people who need it the most around the world will have dramatically improved lives, because of access to better medical advice or information about institutions and systems, to start things and help them in their daily lives.
But this is one of those unique situations where wary (cautiousness, concernedness, preparedness, tinged with fear) and weary (exhaustion with a mental component) are overlapping into one horrible thing.
So I'm not correcting you because I think basically both are right: we are going through both of these at once because anxiety is what Scam and Wario in particular are selling.
I was at my daughter's football game, and another father from the club came up to me and asked if I were in IT and knew how AI worked. He then asked if I could help him setup an AI agent to generate passive income.
We're at the equivalent of December 2017 for crypto. Hang on to your hats!
Was it a two part question converted into one with a gate at the beginning, or was a general question about occupations and abilities?
I hate cars but I still drive to the office 1x / week because I have to.
Or is it just vibes?
https://gizmodo.com/people-hate-ai-even-more-than-they-hate-...
Was discussed just recently, and there are multiple articles and surveys on AI sentiment.
IsaacSim was (and might still be) the best robotic learning sim and I ran MLAgents.
It's always funny to see people tempted to call open-blobs/open-weights, which are literally shareware like WinRAR or Adobe PDF Viewer, open source, and then need to invent a new term for what is actually open source.
I empathize with this but curious what would make any other country a better safehaven for your data? I personally like the EU's approach to data safeguards, but are there other locales/data protections you have in mind that would keep your data "safe".
I purchase open model tokens for agent programming assistance, and I like lumo+ for everything else.
Another option is DuckDuckGo’s Duck.ai subscription, but I slightly prefer ProtonMail’s lumo+ packaging as a product.
How about deporting people without a hearing or opportunity to present evidence about their charges. And then violating the judges order to turn the planes around.
How about systematically ignoring judicial rulings.
How about detaining people based on the color of their skin and spoken language/accent.
How about violating the emoluments clause of the constitution by accepting a personal airplane.
How about sending your son in-law, who hasn’t been appointed to any office with the advice and consent of congress as required by the constitution.
How about refusing to seat elected congress members for reasons for months.
How about singling out companies like intel for targeted trade restrictions and then demanding equity in order to lift them.
What about threatening to delay or deny a merger of a media company unless your ally is allowed to buy them.
What about refusing to enforce the TikTok ban until you can arrange a buy out to an ally.
What about a formal market with a known price for pardons and commutations.
What about stating multiple wars without congressional approval.
What about creating a fake department named Doge that withholds funds apportioned by congress and breaks contracts that have explicit obligations for payment that results in more termination fees and losses than the savings. All without congressional approval.
How about threatening to withhold federal funds from states with governors of the opposing political party but not your own? Remember the president is supposed to execute the law congress passes not make law or arbitrarily enforce it based on their own political needs or values.
Not to detract from your general point about the US, your first point is something that's happened recently in Switzerland:
https://truthout.org/articles/swiss-police-arrest-deport-pal...
[1] https://www.bvger.ch/en/newsroom/media-releases/fedpol-must-...
There are always incidents in all democracies with millions of people, that contravene the expectations of rule of law and integrity of its systems.
The US has degenerated significantly in the past few years, to the point that when someone asks “can you give examples”, I expect a disingenuous ploy more than genuine ignorance. The list of breaches is so long, that listing it results in numbness and exhaustion of the mental muscles responsible for being aghast.
Compel you to reveal your secrets, including your passwords by threatening to arrest and detain you without legal proceedings for an unspecified period.
Deny your basic human rights, particularly at the borders, especially if you aren’t a citizen.
And more.
It is a commonly accepted "fact" right now, outside the US, that the US is not to be trusted (right now), due to some orange guy, and his mates, manipulating markets, running their mouths, doing all kinds of criminal and/or infantile shit.
I'd say there is quite a bit of evidence for this all around.
I think it’s valid to not trust the US with your data. But if the reason is some TDS “Orange Man Bad”, it’s you that’s acting infantile.
Ask intel, paramount, TikTok or anthropic if they feel law will be applied equally to all companies.
Ask the blue states that had fema funding withheld when it went to red states.
Ask black families that haven’t gotten reparations when Jan 6 rioters that beat and killed cops to over turn an election will get almost $2b in reparations and then had the Supreme Court throw out their votes in Louisiana in the middle of an election to overturn the voting rights act, redraw districts, overturn their own case law and the principle that judicial review shouldn’t happen too close to an election so they could redraw the districts.
Business leaders are sucking up to curry favor. That by definition isn’t the rule of law it’s the rule of dispensation. It’s the spoils system.
If you have a counter argument you’d better make it now or you will tip your hand.
Frankly, I'm surprised there's not more urgency on the part of Europeans to reduce dependence on US tech. I don't like it. I'm an American in tech. But, the US can't be trusted, at this time. And, given how irresponsible tech leadership has been, in kowtowing to Trump, I don't see how they can reasonably be trusted, either.
I invest in startups and companies at every stage are losing contracts in Europe specifically for this risk. I can’t say who but it’s a multi front trend.
I am also assembling the largest in home robotics training data set available which will be open source.
Want to help?
I was hoping the European AI companies and projects like Mistral and Apertus would, you know, do something good. But, their models trail not only US models, but Chinese models, including smaller ones, by a significant amount. I guess there's also the ethical component. Mistral is reportedly not plagiarizing like US companies, and isn't distilling US models like the Chines companies. Cheating gives one a leg up if there are no referees.
Anyway, I work for a robotics company, and I'm always interested in what's happening with open robotics stuff, including AI.
and really, the topic here is reducing a transgressive President from infringing tech activities elsewhere (used to be mainly about surveillance, but then trump happened).
They decided that spying on me in a commune in Hawaii, and then following me after to other public spaces was fine. I'm certain something was put in my food based on behavior I saw in communal meals, and I can't say I took video or photo evidence though I wish I did.
I'm of Pakistani descent, held a former secret clearance, and I did not break any oaths or violate any laws though the way I was treated was certainly how the above person described rule of law: our spy agencies for example operate completely without accountability and regularly commit atrocious behavior against US citizens beyond just me.
Let's say Gemini gets to AGI by tomorrow, will my Google account access, or Gemini apps access and data be blocked if I'm not a US citizen? (Anthropic did it with a 5% better model).
If US is classifying the model access based on citizenship, that's similar to treating it as a Defense capability.
You can already imagine Anthropic working with a bunch of shady brokers to "remedy" this situation.
This particular order wasn't actually about citizenship at all. It seems the administration simply believed restricting the order to non-citizens would make it easier to defend in court, but they made it knowing full well that the only way to implement it would be to completely shut off access for everyone.
Stallman was correct in the 80s and is correct now about libre software
From a practical perspective, I'm not sure any servers are safe anywhere...depending on who may want your data.
I'm surprised there isn't a lot more attention to encrypted, distributed, erasure-encoded stores.
> What most people miss IMO is that this is not a team who is doing this for the fourth time like virtually any other LLM provider and who could learn from its own past experiences. I bet if the team would do another model training they could get way better results at one fourth of the costs.
i doubt they are including a lot of training data labeled with the language.
"how to say X in language Y" is a different task from saying X in language Y
My last hope for soverign AI is from Chinese open models
If you want to mix models like this, check out https://github.com/deepbluedynamics/nemesis8
Going forward would be such open source, open data and open recipe models possibly someday even with the training being crowd sourced if not inference like the BitTorrent model.
Lastly, even Chinese models (GLM, Deepseek, MiMax) work really really good and any user would testify that they do not miss OpenAI/Anthropic/Gemini at all if they're using those Chinese models which is argument enough that with such models, no one is going to miss Chinese models as well.
The training data and the Apertus LLM may contain or generate information that directly or indirectly refers to an identifiable individual (Personal Data). You process Personal Data as independent controller in accordance with applicable data protection law. SNAI will regularly provide a file with hash values for download which you can apply as an output filter to your use of our Apertus LLM. The file reflects data protection deletion requests which have been addressed to SNAI as the developer of the Apertus LLM. It allows you to remove Personal Data contained in the model output. We strongly advise downloading and applying this output filter from SNAI every six months following the release of the model.
Also even after you do that, and start a chat, you currently get:
"JSON.parse: unexpected character at line 1 column 1 of the JSON data"
so it's not quite there yet.