I’ve joined Anthropic

Posted by dmarcos 13 hours ago

1238 points | 515 commentspage 2

aykutseker 10 hours ago|

the glorified marketer framing in this thread is missing the bigger signal. karpathy publicly pausing eureka labs to join anthropic is an ai founder of his caliber effectively conceding that verticals get eaten by frontier upgrades.for everyone here building on top of foundation, that's the actual news

aabhay 10 hours ago||

How serious was Eureka labs anyway? It seemed like essentially a banner for him messing around with content creation

behnamoh 9 hours ago||

people often found businesses to write off expensive purchases. my friend has a "company" which does nothing but he wrote off a $5000 MBP for this business expenses...

deadbabe 9 hours ago||

Get him audited.

achierius 8 hours ago|||

"that's the actual news", "the bigger signal", etc. -- this has a lot of the hallmarks of AI generated text but with overt stylistic simplification layered on top (nocaps, some weird spacing)

trollbridge 7 hours ago||

I mean, who hasn't put "stop using emdashes" into their prompt yet?

jonnyasmar 4 hours ago||

[flagged]

dang 28 minutes ago||

Could you please not post generated comments to HN? It's not allowed here. See https://news.ycombinator.com/newsguidelines.html#generated and https://news.ycombinator.com/item?id=47340079.

We ban accounts that do this and I don't want to ban you, so please write everything that you post to HN by hand.

Of course, it's impossible to know for sure what was LLM processed or not, but we're getting complaints about some of your posts and, upon inspection, the complaints seem justified.

markerbrod 13 hours ago||

I wonder what will happen with EurekaLabs now. I checked their X account, but the posts are now restricted. However, the background picture... that old AI-generated image feels surprisingly cringe (https://x.com/EurekaLabsAI/header_photo), incredible how much GenAI has improved since that image was created.

bilsbie 13 hours ago||

He should have done his own lab. He seems like someone capable of it and might bring some unique ideas.

qq66 12 hours ago||

If you don't actually have the desire to build, lead, and manage a large organization, this is a terrible idea for technical geniuses. A guy like him will instantly raise $1 billion which means hiring dozens of people, which means tons of interviews, management, performance review, planning, board meetings, etc etc.

It's good that there are avenues today for people to make tens or hundreds of $m in salaried positions at companies so that they don't have to do that stuff to get paid their value if they don't genuinely want to.

gdiamos 6 hours ago||

I’m seeing founders being encouraged to run their business with AI and cut out the etc etc

dalyons 1 hour ago||

Sure, that’s capitals’ dream but how does that actually work out in practice

Aboutplants 13 hours ago|||

Two years ago I’d agree, now he probably wants access to the immense capacity they have where if he were to start a lab from zero now, the ramp up to frontier pushing would require a lot more time. I don’t he needs the money as it is, and wherever he were to go would certainly make it worthwhile financially. Some people may just be cool with a couple hundred million dollars in their lifetime

JumpCrisscross 13 hours ago|||

> He should have done his own lab

Which raises the question: what can he do at Anthropic that he couldn't on his own?

TrackerFF 13 hours ago|||

Seems to me that you need incredible amounts of money to be competitive in the frontier model arena. I don't know how much money Karpathy has to spend, but I'd imagine that the money needed would almost certainly mean investors with deep pockets.

And then there's the uncertainty, will the AI "wars" be some winner-takes-all situation? Will the smaller labs eventually be acquired by the bigger ones, will they simply wash away if there's a crash?

I don't know. If you can land some exceptional gig at the big firms, maybe the financials are good enough to not start your own lab. Minimizing risk, and all that.

EDIT: Assuming such a startup would focus on frontier models.

JumpCrisscross 13 hours ago||

> you need incredible amounts of money to be competitive in the frontier model arena

This is my assumption.

> there's the uncertainty, will the AI "wars" be some winner-takes-all situation? Will the smaller labs eventually be acquired by the bigger ones, will they simply wash away if there's a crash?

He's Andrej Karpathy. He could wait to let the winner surface. Obviously better to get in with the winner earlier. But worse to get on the wrong team versus on the right team late.

shuckles 12 hours ago||||

He can be at the frontier while just having a regular job. Every other option is a lot more work.

skywhopper 13 hours ago||||

Make a lot of money.

UltraSane 12 hours ago|||

Access to a million GPUs?

gk1 13 hours ago|||

It’s not enough to have unique ideas. You need capital, compute, people, distribution, customers… There’s huge appeal to joining a place that has all those things and lets you pursue your unique ideas without worrying about all that.

amunozo 13 hours ago||

I'm pretty sure Karpathy can have billions of capital if he wanted to.

conductr 10 hours ago||

Mo money, mo problems. Just let the dude work, he’s not starving and he’s probably enjoying his life not completely wrapped up in the stress that running a company in this market must be.

slashdave 7 hours ago||

I am not entirely sure you understand how expensive it is to train these models

travisporter 7 hours ago||

Sir, please make it easy to break the twitter/x moat. Why don't we have an app that just posts to all the socials like bluesky/mastodon/threads etc

jimmydoe 6 hours ago||

Money ppl are on x, hence founders, hence the rest.

rdme 7 hours ago||

might want to check https://github.com/alexandru/social-publish

1970-01-01 11 hours ago||

He moves around quite a bit. Less than 2 years on average if you take away the longest and shortest jobs. It feels like this is just a celebrity hire to help raise IPO value, and then he'll move again when the tech hits another real-world scaling wall. Expect another short stint (stunt) with Anthropic.

renecito 10 hours ago||

rest & vest if he makes it to IPO

Yajirobe 10 hours ago||

I wonder what's his net worth - probably 100M+?

modeless 6 hours ago|||

I have to imagine that Anthropic is giving him an equity grant worth $1B or more in their upcoming IPO. And having him will increase their market cap more than that, if for no other reason than that people trust him as a judge of who is winning the AI race. So it's already worth it even before he does any work.

seattle_spring 9 hours ago||||

didn't FB grant someone else $1B over 4 years or something for some AI lead role? Wouldn't be surprised if this guy is getting similar offers, which could explode even further with the stratospheric valuations of top AI companies.

United857 8 hours ago||||

He’s a founder of OpenAI —- likely a billionaire if he held on to his share

trollbridge 7 hours ago||

It's still quite interesting to me how a founder of a nonprofit can become a billionaire by having shares in it.

nikcub 6 hours ago||

Because they later founded a for-profit to raise the capital required in the AI talent and compute race.

blazespin 2 hours ago|||

yes, pure engagement farmer. Marketing yourself is a big part of the biz these days. Thanks, social media

Rover222 10 hours ago|||

5 years at Tesla is an eternity in the tech world.

And 2 years is probably pretty average for the whole tech industry.

volkk 10 hours ago|||

> And 2 years is probably pretty average for the whole tech industry.

maybe for a fungible CRUD engineer. I think Karpathy is in a different league and I'm certainly surprised to hear this fact. I would expect someone like him to sit within a certain lab for a long time

cameldrv 10 hours ago|||

He's an extraordinarily bright guy. He can get a lot more done in two years than most people, and he can get up to speed with a new organization and a new task and be productive much faster than most people.

My impression with no inside knowledge, but understanding what Elon companies are like, is that he was assigned essentially an impossible task at Tesla and tried his very best, but it could not be done, and he semi-burned out. It makes sense for him to be getting back on the horse now.

The Elon approach to management as I see it is to assign what normally would be totally unreasonable goals to a small group of extremely bright people, and they work their asses off and somehow find a way. Sometimes this works, and sometimes it doesn't. If it works and the impossible was in fact, just barely possible, you dominate the market, everyone gets rich, and the people see it as the most exciting, intense, and rewarding part of their career. If it doesn't, they get depressed, divorced, and looking for other work. The Elon magic is threading the needle closely enough that a lot of the seemingly impossible things are in fact possible with enough hard work and brainpower, but although Elon is extremely good at this, the nature of the thing is that you can't predict which side you'll wind up on fully accurately.

andrewrn 9 hours ago||

And in the case where a team gets overworked, there are legions more fresh and bright engineers to burn.

redsocksfan45 9 hours ago||

[dead]

staticassertion 10 hours ago||||

That seems like the opposite. Why would someone with high market value stay in one place? 2 years is basically optimal - you vest 50%, maybe collect a promotion, do some good work and learn a lot, and then get to move on for another solid bump/ promotion and a new set of stocks.

I expect the people with low market value to be the ones sticking around labs for long periods of time, they don't have the option to move and they aren't getting poached.

jackling 8 hours ago||

It's incredibly hard to do good, novel work in 2 years for engineering. You'll likely not learn much either.

worik 9 hours ago||||

> And 2 years is probably pretty average for the whole tech industry.

Yes, and it is a problem

> maybe for a fungible CRUD engineer

And there's the cause.

We're in a meat grinder, and there is no $100M payout in sight for most of us

Rover222 7 hours ago|||

I mean just because OP wanted to "ignore" that he was at Tesla for 5 years, he was... still at Tesla for 5 years.

prerok 10 hours ago||||

Depends on the country, I guess. In Europe, it would definitely not be the norm and I would definitely call previous employers if it was several 2 year stints.

jstummbillig 10 hours ago|||

2-3 years is pretty average tenure inside the EU tech sector for the past few years [1], but regardless I don't know what that tells us, given that nothing else about this is average. The sample size of Andrej-sized talent in an ongoing tech revolution of epic proportions is just very small.

[1] https://ravio.com/blog/employee-tenure-trends

FabCH 10 hours ago||||

While I absolutely confirm that everything you said is true, it’s interesting that that call would be illegal in many European countries. And in many more you would at best get a „I can confirm this person worked on this position in this timeframe“

prerok 10 hours ago||

Hmm, interesting point, I did not know that. But, aside from that, while they cannot tell you that the candidate would not be ok, I did get hints on what to look out for.

I mean, you always have to take the previous employers' statements with a grain of salt, but if they say they really employed for just that project, it's also good info.

dukeyukey 9 hours ago||||

Europe is not a monolith. Lots of short stints is not unusual in London.

prerok 9 hours ago|||

At startups? Sure. Several stints at non-startups? Well, how much of that time was spent learning the domain? Is that knowledge transferrable (probably not because of non-competition clause)? Why were you not happy at the previous employer?

I am not saying any of these don't have valid answers. What I am saying is that we would prefer juniors that are commited and do the hard work when the work gets hard. And, at least where I work now, this gets recognized, and they become seniors in time.

worik 9 hours ago||||

London has not been part of Europe for a decade now. They locked themselves out in a bout of insanity

maleldil 7 hours ago||

Europe and the European Union are not the same thing. The UK is definitely European.

jen20 9 hours ago|||

Indeed it used to be the norm since basically everyone worth hiring was a contractor prior to the IR35 debacle.

barrenko 10 hours ago||||

And we have the stock market to show for it.

prerok 9 hours ago||

I would not say that this is because of this point. It's because investors in Europe are more conservative. Employees are as well, it's true, so it's strange to have someone out of the norm. It's not a red flag per se, but it's a thing to be evaluated.

toephu2 10 hours ago||||

Andrej Karpathy is from Europe (Slovakia).

prerok 9 hours ago||

His point of origin is immaterial to my point. I was not commenting specifically for Mr. Karpathy. My point was generic.

sneak 10 hours ago||||

Europe’s labor market is sadly still mostly out of touch with how startups work. It’s stuck in the last century. I’m not sure if this is due to tradition, or due to the fact that startups are much harder to start in Europe in general, so people on both sides of the hiring process have less experience with it.

Two years is more than long enough to join a startup, build 3 things, and see that your equity is never going to be worth anything, and find a new job. This isn’t anomalous or weird.

dukeyukey 9 hours ago||

Europe is not a monolith and I don't know why people think it is. 2 years stints are not unusual at all in London.

sneak 9 hours ago|||

London is economically, politically, geographically, and culturally fairly distinct from continental Europe.

holoduke 9 hours ago|||

It's about equity worth nothing jn Europe as an employee. Europe is a bad place for employees to join a startup. Lots of time people are attracted with shares/stars whatever. Only to find out they get nothing or are taxed to hell. There is a reason why Europe is failing.

gambiting 10 hours ago|||

If you're really in Europe then I'm sure you know that calling previous employers is completely pointless, the best you'll get is "yes this person has worked here before".

And I work in games and 2-3 years at each company is pretty normal, with some exceptions people just finish a project and then move(or are let go, unfortunately). YMMV of course.

prerok 10 hours ago|||

Depends, but definitely not pointless. Though, I do have the benefit of working in a small country, so chances are that I will know of the company and perchance know folks that work there. Even if not, employers will still see fit to help each other, at least if they are not direct competitors.

johnnyanmac 9 hours ago|||

> I work in games and 2-3 years at each company is pretty normal, with some exceptions people just finish a project and then move(or are let go, unfortunately). YMMV of course.

Yeah, being laid off every 2-3 years is a lot different from job hopping and shows exactly why the games industry is in its own little pocket of screwed in this market. Especially with games taking 3-5+ years to be made. How do you keep institutional knowledge when you kick it all out and basically start from scratch every cycle.

-sincerely, another game dev

StilesCrisis 10 hours ago|||

The folks I know who bounce that often are generally mis-hires:

- barely qualified, leaving to avoid getting PIP'ed

- overqualified/under-leveled, and moving is faster than getting promoted

Rover222 7 hours ago||

I think it happens on both ends of the spectrum, the folks you described, and also the ones with a good reputation and network who get recruited into new opportunities.

tty456 8 hours ago|||

Interesting methodology

clear-octopus 10 hours ago|||

[dead]

Aaronneyer 10 hours ago||

Even if that's true, 2 years is a huge amount of time to make real impact right now. By 2 years, we could have a clear winner of the AGI/ASI race.

tsunamifury 9 hours ago||

This is such a disappointing reality that people believe this.

1) advancements in AI are made by large teams of brilliant people (and individuals who take outsized credit)

2) AGI is defintionless buzz word

3) advancements in AI will need significant changes in either how the model works or fundamentally new non existent datasets.

bpodgursky 9 hours ago||

lol this is just not true sorry.

Claude code was one person's idea as a pet project and now it's singlehandedly 5x'd Anthropic's valuation. Sometimes single people matter, that's life.

HarHarVeryFunny 8 hours ago|||

I think both are true. Claude Code was one man's experimental project, but it's an application of AI, not AI itself.

Anthropic is a large company, with thousands of employees, and seems to be 100% (maybe 200%) LLM and scale pilled. All the advances from one model generation to the next are the result of dozens of experiments first at small scale then at larger scale, all competing for the same "development compute" portion of their overall "development + inference" compute resource.

In this environment, even if there are researchers who have ideas not on the "LLM + scale is all you need" path that Dario seems hell bent on, there seems to be not much chance that these ideas can compete for resources and compute with the mainstream experiments that the company believes their future depends on.

Maybe an individual developer like Sutskever, engaged purely in research rather than manning a barely turnable oil tanker, can make a difference, but at a company like Anthropic it seems way less likely. Cherny's baby is 100% aligned with Anthropic's mission of selling LLM tokens. Someone else fighting the mission, trying to pivot Anthropic in a new better direction is not likely to have such luck.

tsunamifury 5 hours ago|||

I was unaware that one persons idea invented LLMs and their coding abilities.

I strong suggest you better learn your recent history —- and how generally these things work

Traster 13 hours ago||

Karpathy is probably one of the biggest names in AI, I do wonder where he fits now. He's sort of bounced around Tesla back to OpenAI back to independent. He sort of left OpenAI before it really hit the inflection point, and he was at Tesla for a long time and they didn't really deliver what they wanted on the AI side. Now he's bounced around a few places. I understand that the leaders in this market play this silly game of trying to buy up the names like trading cards but I wonder what this turns into.

prodigycorp 13 hours ago||

i wouldn't be surprised if he just becomes a glorified marketer for anthro.

im also going to guess that whatever research he does would be free roam research that primarily serves to market the fact that claude was able to help perform the research.

the visible stuff he's been working on has been mostly agent soft skills. off the top of my head is autoresearch and his the wiki knowledge stuff. nothing particularly groundbreaking, but has helped devs expand their understanding of the utility that these models can provide.

not a diss to andrej i know he's reading this now

resiros 13 hours ago|||

I think you are underestimating both the value of both projects (autoresearch and personal wiki) just because they are simple. I see both POCs for continuous learning / optmization on the harness layer, which in my opinion is a very interesting direction.

I think Andrej has the experience (and now ressources) to productionize this research into something very interesting.

p.s. called it

> Karpathy will help launch a new team focused on using Claude itself to accelerate pretraining research — an increasingly important frontier as AI companies race to automate parts of AI development. (https://www.axios.com/2026/05/19/anthropic-openai-karpathy-a...)

DiscourseFan 12 hours ago|||

No, these are developed off of the assumed uses of the models (predictive autofiller) rather than their actual, cognitive and potential industrial use (developing large scale frameworks for industrial production, automating systems that normally require human monitoring), and uses that we have not yet discovered, because we have not figured out all the constraints and limitations of these models. If Karpathy was in the game like he used to be, he would be on real product. Right now he’s probably so lost by the very thing he helped create that he is stuck doing these mini projects for his own personal interest, without anyone really critically engaging with his work.

charcircuit 7 hours ago||||

Just because something is not ground breaking that doesn't mean that technology path isn't valuable.

saberience 10 hours ago|||

Those projects are a complete joke. Neither of them were even original, people have been playing around with those ideas for well over a year.

They just became "famous" because Karpathy is effectively an AI celebrity, so he could throw shit at a wall and post it on X and it would get 10k Github stars.

But seriously, people have been using the models to tweak hyperparemeters, or using LLMs to help create a second brain using markdown or json files or 100X other combinations of files, for a long time already.

fennecbutt 3 hours ago||

Agree. Watch people glaze him tho. Same as John Carmack who is supposedly brilliant but hasn't actually really done anything since Doom.

canada_dry 13 hours ago||||

> just becomes a glorified marketer

That implies Karpathy is either dumb or desperate and he is neither of those by a long shot.

noufalibrahim 12 hours ago|||

I don't think that's the parents implication.

Generally, when a "good" developer has a huge public presence and reputation, that's quite valuable to a company when they're competing in a tough space. Many a time, more so than the (very high) technical skill of the developer in question.

I've seen large funded companies gather good popular developers like pokemon cards and just have them go around give talks and write blog posts. It creates an aura around them which makes things like hiring, fund raising etc. much easier.

So, it's not really a statement about Karpathy himself. It's more about the company hiring him.

newppc 12 hours ago||

Yea, I say this as a marketing agency owner, not a developer or AI researcher, that besides Sam Altman, Dario, Demis and Elon, that Karpathy is one of the most influential I follow.

There’s a lot of value for the business world in learning AI from someone who has been at the top of their game but now is doing a general service by being a great educator and translator between the fields.

His recent Wiki approach may be simple to devs but is certainly an aha moment for the rest of the peanut gallery paying attention!

swiftcoder 12 hours ago||||

> That implies Karpathy is either dumb or desperate

This kind of thing happens to big names in software all the time. Carmack going to Facebook is a prime example - he joined with the idea of using all those resources to build world-changing tech, and instead he ended up headlining conferences, and fighting a losing battle against the corporate types who were put in charge of Oculus.

nine_k 12 hours ago||

Hasn't Carmack solved a few serious engineering problems, making Oculus more or less the most advanced VR device? (The fact that an advanced VR device does not seem to be needed by the mass market is not an engineering problem.)

gruturo 11 hours ago||

Yes - but - ironically - he did that _before_ joining them. IIRC he literally started collaborating and helping them while being at a different company.

StilesCrisis 10 hours ago||

That seems surprisingly common to me. Visionary engineer has solution to problem, gets hired, solves the broad strokes in the first year, then spends N more years in meetings with exec stakeholders and worrying about schedules/hiring/financials instead of _doing the vision work_.

shuckles 13 hours ago||||

No it doesn’t? It matches his skills to the lab’s needs. Karpathy is a media personality, manager, and educator far more than he is a hands-on researcher.

bdangubic 9 hours ago|||

he’s not a hands-in researcher just like lebron is not a basketball player but media personality :)

shuckles 8 hours ago|||

It’s kind of useless to argue through metaphors here. There are a hundred researchers with more significant contributions to theory and practice than Karpathy. If you disagree, I’d love to see what papers or implementations you think he’s offered that pushed SOTA.

nozzlegear 8 hours ago|||

Lebron can still dunk when he needs to!

HarHarVeryFunny 12 hours ago||||

He already stated his motivation a few months ago in an interview with Dwarkesh - basically saying that he might join one of the big labs, for a while, to keep in touch with frontier research.

Andrej seems like a great guy, but him joining Anthropic feels a bit like a transactional relationship (rich old guy marries hot young chick). Anthropic get a "glorified marketer", and he gets a front row seat at SOTA LLM dev 2026. I don't think they hired him expecting he's going to change the direction/pace of their research.

jimbokun 11 hours ago||

An employment relationship is transactional??? Like the employer pays money and the employee provides labor???

Scandalous!

HarHarVeryFunny 11 hours ago||

Maybe poor choice of words on my part - what I meant was that this doesn't appear to be a case of AI research co. hires AI researcher to do AI research.

A regular marriage is transactional to some extent too right, but not quite the same as Anna Nicole Smith marrying a 90yr old.

As an aside, an Indian guy I used to work with once explained to me how traditional Indian arranged marriages, like his own, work, and they are HIGHLY transactional. It's not just a matter of same caste, same social status etc, but an explicit trade off. In my co-worker's case he cheerfully told me how his wife was very dark skinned, therefore considered not that attractive/desirable (to other Indians!), but her family had money and social status so it was considered a fair trade for a nice looking boy like himself!

Swizec 12 hours ago||||

> That implies Karpathy is either dumb or desperate and he is neither of those by a long shot.

No it implies that he is more valuable for being famous than the hands-on work he can produce. This is the IC endgame

afavour 13 hours ago||||

I don’t think it does. I think it’s better phrased that he is marketing rather than a marketer. He can do whatever he wants to do, in return Anthropic gets to say “hey, this guy works with us!”

ghaff 13 hours ago||

Different people have different wants and needs. It's perfectly reasonable to work on some interesting projects and to be something of a figurehead.

nozzlegear 12 hours ago||||

I don't know anything about this person, but want to point out that renown and validation is something that most (all?) humans crave. That doesn't make them dumb or desperate, it makes them normal.

kmaitreys 13 hours ago||||

> https://gist.github.com/karpathy/442a6bf555914893e9891c11519...

Last thing I saw Karpathy talk about was this, which I find hard to believe that it came from a smart person.

carterschonwald 13 hours ago|||

oh my, i see what youre saying. at this point youd hope everyone has realized that the best way to keep models more reliable is to force them to stay honest via very very string static typing as a feedback loop. bags of text with hyperlinks certainly fail that measure

pixelsort 11 hours ago||||

Yes, that's probably his dumbest public idea to date. Given that this GPT repos and parts of autoresearch are brilliant I'm sympathetic. I think he's earned the right to exhibit mild expressions of AI psychosis at this point.

And, my objection was that he clearly had no understanding of the supply-chain risk he was worsening by advocating widespread use of Obsidian for agentic engineering tasks.

Since his announcement, Obsidian has taken proactive steps to mitigate the risks, or at least study threat model. Hopefully, they will implement proper RBAC or something before someone else with his visibility announces an even more irresponsible half-baked idea.

blackqueeriroh 10 hours ago||

Why is this dumb? Please educate those of us less brilliant than you!

redsocksfan45 9 hours ago||

[dead]

ModernMech 12 hours ago||||

I love how a ton of the replies after it are "I built exactly this with an LLM", even using his name in the repo.

redsocksfan45 12 hours ago|||

[dead]

fennecbutt 3 hours ago||||

CEOs also get paid millions and billions to do nothing. Sooooo...

"Improve yourself, no mistakes" in a loop. Woooah sooo revolutionary...

piker 13 hours ago||||

Being a singular influencer in this space, at this time, may be more valuable than a lot of successful VC-backed startups over the last few decades.

UncleMeat 12 hours ago||||

Andrej is a smart guy. You don't get into Stanford for grad school without that.

But he has always been known for his communication rather than his research. He got famous by putting out a (very well made) course on machine learning that was available to the public. Since graduating he hasn't exactly delivered on revolutionary new stuff at the businesses that employed him but he has continued to be extremely good at communicating thoughts about the current and future state of AI. Businesses want that and he knows that he can deliver that.

prodigycorp 13 hours ago||||

i mean he did publicly openly solicit interest to work at a frontier lab so he can be closer to what's going on https://www.youtube.com/watch?v=kwSVtQ7dziU&t=2870s

alfonsodev 12 hours ago||

And it makes a lot of sense, doesn't?.

There are things that you can only explore and learn in those places, for obvious reasons.

I don't know his personal life goals but he's a great communicator and educator, if this decision makes him more up to date, and allows him to create even more relevant content then is something everyone will benefit. I understand the risks of being bias toward one company and not the other, but if you look at the content he created so far, he always talk principle first and specific tool later.

I think people here should give him the benefit of the doubt.

prodigycorp 12 hours ago||

i meant everything out of respect for andrej. it's no different from how a visiting scholar can be great marketing for an institution

Barrin92 9 hours ago||||

he's not dumb or desperate compared to the average person, but it's very possible to be dumb and desperate compared to the delusional promises and outsized amounts of money in the industry. Manages to make smart people look extremely stupid every day.

coldtea 13 hours ago||||

Greedy is enough. Neither dumb nor desperate needed for this.

foobiekr 12 hours ago|||

Anyone who would voluntarily work for Musk when he went obviously has things going on that aren't great.

tayo42 12 hours ago||||

It's also hard to any hard research on your own without resources. At best a few gpus can only go so far right now.

AIorNot 12 hours ago||||

yes stop kidding yourself that he is going in as a tech leader in terms of providing technical innovation..at that stage its your persona that matters not the tech (sure I think Anthropic is going to listen to his advice..but its a transactional marketing win primarly)

his value to Anthropic is his influence..he has over 2 million followers, and value is that he is the Top influencer for AI right now, like it or not. just like Selena Gomez might be for top for women age 21-29...

Every AI nerd I know reposts his (very thoughtful posts and projects mind you) like religon

0123456789ABCDE 12 hours ago|||

> i know he's reading this now

meanwhile in the real world:

  claude --permission-mode=auto --model=opus -p '/onboard --user=karpathy'

baq 12 hours ago||

expectation: in the real world the CLI will be replaced by an agent prompt and to get to the shell you'll have to ask 'get me bash dammit'

Ifkaluva 12 hours ago|||

He may not be a brilliant researcher, but he is a brilliant teacher. I am glad he is joining Anthropic so he can stay up to date with the next round of things that he will teach :)

amelius 10 hours ago||

It is a pity we don't hear more about the truly brilliant researchers.

All we hear is Altman, Musk, ...

gist 9 hours ago||

> It is a pity we don't hear more about the truly brilliant researchers.

Reason? What is the value of that other than entertainment? And it's not in the interest of companies to make celebrities that then are poach targets (if they can avoid they would yes there are exceptions as noted elsewhere in this thread).

And if you did 'hear' (via articles) to what extent what was said even be correct vs. a writer just fluffing things up to the max.

Tech is not sports where you can actually see the superlatives and know that the person who praise is being lavished on actually won or threw or caught and so on. (Or even music where you can hear it and see the stadium that is packed with fans..)

hart_russell 12 hours ago|||

The self drive on my Tesla is damn near perfect. I haven’t driven my car in around 6 months.

HarHarVeryFunny 11 hours ago||

FWIW while Karpathy was at Tesla he was basically working on the vision component. The actual driving component (using vision as an input) was originally all C++. They may have started migrating parts of the driving component from C++ to neural networks while Karpathy was there, but most of it happened after he left in 2022, with the big switch being FSD 12 in 2024. User reports from before/after FSD 12 are like night and day.

B1FF_PSUVM 11 hours ago||

I was never convinced by the "vision only" approach - I don't see the point of throwing out or refusing to have additional data from other sensors.

I suppose that with modern ML they can just toss it in the blender and reap the benefits ...

HarHarVeryFunny 11 hours ago||

I'm not talking about vision (cameras) vs lidar etc, just the Tesla FSD architecture that separates the "vision" component (turning camera/sensor inputs into symbolic road/sign/vehicle/pedestrian/etc data), and the driving component which takes the vision data, plus current location and destination, and uses that to actually drive the car - switch lanes, make turns, avoid obstacles etc etc.

efavdb 12 hours ago|||

Tesla self driving works. I don’t know if Karpathy deserves credit for that or not.

Traster 10 hours ago|||

Tesla self driving kind of works. In a very similar way to how it kind of worked back in 2016. It's better than it was in 2016, don't get me wrong. But even today they haven't solved the problem and Karpathy left in 2022. And other companies notably have actually surpassed Tesla over that time. I don't think anyone could reasonably say he walked away in 2022 because he thought the job was finished.

comboy 9 hours ago||

Who surpassed Tesla that doesn't operate in some very limited region?

slaw 9 hours ago||

Waymo 70k driverless rides a day.

Apollo Go 100k driverless rides a day.

Tesla 0-5? driverless rides a day.

trollbridge 7 hours ago||

It's a bit of an exaggeration to call Waymo "driverless". All of their cars are supervised.

slaw 11 hours ago|||

Yes, no driver needed at all. Your Tesla makes money for you while you sleep at home.

random3 1 hour ago|||

he'd probably be a great face for developer relations or whatever Antropic calls the role

pier25 13 hours ago|||

> He sort of left OpenAI before it really hit the inflection point

Sorry I'm out of the loop... What inflection point are you referring to?

HarHarVeryFunny 9 hours ago|||

When OpenAI was founded, the mission was to develop AI, but nobody (anywhere) knew how to do AI, so OpenAI did ML research on games instead, which is what DeepMind was doing (with Google's perceived AI/ML dominance being the raison d'etre for OpenAI, and Google having just bought DeepMind). This was the era when Karpathy was at OpenAI.

Around the time Karpathy left, Ilya Sutskever, another OpenAI founder, started playing with Google's new "Transformer" architecture, which was the beginning of the "GPT" series, GPT-1, GPT-2 and eventually ChatGPT (GPT 3.5 + RLHF). In retrospect OpenAI's early Transformer experiments and GPT-1 was the inflection point that moved OpenAI from a company that wanted to build AI, as soon as anyone else did, to one that was actually doing so, although I think it would be revisionist for anyone to claim they knew what they were doing at the time. The early GPT-1 and GPT-2 papers read more like "wow, this is a bit unexpected, look at all of the things it can do!".

helloplanets 13 hours ago||||

Karpathy left OpenAI in 2017 for Tesla, came back from Tesla in 2023 and left again in 2024.

So pretty sure the original poster is talking about 2017.

shuckles 13 hours ago||||

GPT-1 presumably, which was released a year after he left. Prior to focusing on GPT, OpenAI was pursuing a lot of research directions.

Traster 10 hours ago||||

Well Karpathy left in 2017, and all the sort of commercial stuff didn't happen till a while later - for example they set up the structure to take external money in 2019 and that's obviously the point at which they'd found the pathway that justified doing massive training runs and all that. So Karpathy was out very early (left at the point that Musk thought OpenAI had basically failed).

nashashmi 13 hours ago|||

The inflection is Right before its meteoric rise.

ArchieScrivener 12 hours ago|||

Or they collude by hiring each others engineers as a way to create manageable competition and information sharing outside their fiduciary duty to shareholders.

synergy20 12 hours ago|||

I somehow felt he, along with Andrew Ng, are very few well-known AI experts that are left behind on the money side during the AI-gets-me-super-rich crazy time, unfortunately.

liuliu 12 hours ago|||

Only if you think B is an important thing. He is easily > $100M from Tesla.

gordonhart 12 hours ago||||

Andrew Ng has been investing in AI startups for almost a decade, I would be very surprised if this rising tide left him behind.

dzonga 12 hours ago|||

I can't speak for Andrew Ng - but my take is he did out of pure altruism - love. just in terms of advancing free education e.g coursera & the free machine learning courses etc he brought to the masses.

not everyone does things to be rich.

outside1234 13 hours ago|||

DevRel or whatever we call that now

nashashmi 13 hours ago|||

Some people are good at developing the sciences. Others are good at developing commercial products.

And tesla is not a good place for science development. Tesla is structured from narcissistic mindset: results driven, cynical, and position-based. This doesn’t bode well for long term sciences.

I dont see how he could be helping anthropic

redanddead 13 hours ago|||

I read this as a bad sign for Anthropic. Relying yet again on more hype instead of improving products.

OpenAI’s hiring recently has been much stronger, whether through luck or structure. The “no-name” guys have actual taste. I love that. I don’t care that they’re no-names.

I don’t know Karpathy personally, I won’t speak bad about a man I don’t know. I hope he makes CC better. I just read this as hype. My point is that there’s nothing he has that an empowered no-name product manager doesn’t. It’s like Alex Wang at Meta. That acq didn’t redeem Meta. They actually lost LeCun. Where’s Llama today?

Regardless of what Anthropic’s share price is, OpenAI has been fucking killing it recently. I don’t take particular pleasure in saying that, i’ve been a google and gemini guy for years

My lens is meritocratic. My experience is as an extremely heavy user of both company’s full suite of products in the range of 5 digits per month. My interest is better products not hype.

sigmar 13 hours ago|||

>OpenAI’s hiring recently has been much stronger, the guys have actual taste.

Can you cite specifics? "I won't speak bad about someone, but also won't speak good about others" resulted in a comment that seems to contribute nothing

redanddead 12 hours ago||

I judge them from a meritocratic lens.

I’m hoping Karpathy will make Claude Code better, in the meantime I’m super happy seeing a small product manager like Tibo fucking crushing it on Codex

woah 11 hours ago||

Where are you following the comings and goings of small no name product managers like it's a baseball team?

sigmar 10 hours ago|||

clearly the employees that tweet the most must also have the best contributions to R&D... </sarcasm>

redanddead 6 hours ago||

That’s not my claim.

My point is that product velocity is visible in shipped workflow improvements, not prestige hires

Prestige is fickle, look at academia today

redanddead 11 hours ago|||

Hahaha wdym? Where have you been dude?

Joking aside, there are small communities pushing codex and AI to the bleeding edge of what's possible.

Here I'll give you an example. The last few updates from Boris at CC have been tweaks to the system prompt to make it use less compute, effectively making the system dumber, making it tell you to go to bed. I mean come on! Tibo has been impressing me, bc they're building the things these small communities are building.

One of the things these bleeding edge guys and girls have been working on is a /goal feature, essentially ralph loops. Codex released it as a feature the other day. I can't help but be impressed. As an ex-pm, this is product management.

Then you take a look at what the Chinese are doing on their own forums, and it just makes what Google and Anthropic are doing look outdated. OpenAI feels competitive, which I like. What's coming will not be kind to us, we adapt or we die.

blackqueeriroh 10 hours ago||

Anthropic has had goal for a bit.

vondur 13 hours ago||||

It feels like these companies are constantly going back and forth on who has the best product constantly. It's such a dynamic time with how fast they are both working.

scottyah 13 hours ago||||

OpenAI seems to be dumping a LOT of money into marketing on social media at least.

xmcp123 12 hours ago|||

To be fair, Mythos is probably one of the most significant marketing pushes in the industry in both impact and investment.

I am sure there is an element of reality in it's capabilities, but there's also a significant amount of "We don't have the compute to handle this at scale", and "look look, we have the best model. It's so good that you can't even compare it to other models. That is how good we are."

redanddead 11 hours ago||

I’m noticing a real disconnect in the user base about this

The Claude maximalists that can never see any wrong in anything and the users that care about actual capability

These guys are going to be in for a rude awakening when the Chinese are steamrolling us with data centers you can see from space and better models, Amodei will tell you that himself

xmcp123 11 hours ago||

Hey, it's not like the Chinese have a serious demographic crisis they can't cope with, and their only hope is to significantly increase productivity per worker.

trollbridge 7 hours ago||

The USA does too, but it seems all we can talk about in America is how workers are "obsolete".

redanddead 6 hours ago||

Anyways, what do you think the solution is to that?

redanddead 12 hours ago||||

I’ve been using Claude and Codex extremely heavily and use adblockers so I don’t see them

Avicebron 12 hours ago||

I think they mean the paid shills

redanddead 12 hours ago||

Whenever I see a user base turn against actual users or imply censorship or discredit actual experiences it always ends in a death spiral: Deny -> Product stops improving -> Censor -> Die

Adapt or die

munk-a 10 hours ago|||

Anthropic as well. The private equity partnerships for guaranteed users are going to make their numbers look great.

j_bum 13 hours ago||||

Curious what you mean by killing it? Products? Model quality?

redanddead 13 hours ago||

Dude, both! Codex is going to eat Openclaw… i don’t love saying that.

What codex is a few steps away from doing is changing fundamentally a lot of workflows.

Remote codex with their computer use is basically you at your computer doing things, 24/7.

Then they added gpt images 2.0

what codex can do, in a few more product iterations, is show you visually side by side “would you prefer this (A) or that (B)” in a series of questions. This is what some open source researchers have been up to. That’s no longer guessing.

I’m not trying to hype a company i have no stake in, but they’ve been killing it.

It’s extremely compute intensive, but also very satisfying.

scottyah 12 hours ago||

Codex and openclaw are both "owned" by openai, and most of the features have been in claude code for awhile now.

redanddead 12 hours ago||

To be fair, Claude Dispatch was really cool. I had to wait a good 3 weeks for Codex to come out with remote

misiti3780 13 hours ago||||

really - what am i missing?

redanddead 13 hours ago||

It just feels like more hype instead of product focus.

Example 1, just from top of my mind, Composer 2.5 released today. Go look at their benchmark.

Composer 2.5 and Opus 4.7 ranked around the same, meanwhile gpt-5.5 was miles ahead.

You wouldn’t have caught me dead using a gpt model 2 years ago

worik 9 hours ago||||

This is true for all the UASanian frontier model owners

They are all going to get their lunch eaten by the Chinese.

In the USA with access to most of the world's capital, they've succumbed to the temptation of "bigger, faster, harder"

Whilst the Chinese, with enough capital only, have had to think.

The Chinese models are already miles ahead on cost/inference basis and will probably pass all the USAnian companies in five years

The age of UASnian engineering dominance are coming to an end.

Let's all hope she goes quietly - not at the moment

felixgallo 13 hours ago|||

Out here in the actual demonstrated world, OpenAI has been leaking quality people like a sieve, has not yet demonstrated anything remotely similar to 'taste', and is led by a sociopath (https://www.newyorker.com/magazine/2026/04/13/sam-altman-may...), so I think you can rest easy.

napierzaza 13 hours ago|||

[dead]

Veserv 13 hours ago|||

I mean, you would think that all those people he killed as the person in charge of deploying knowingly dangerously defective self-driving software for profit would have had a impact. But executives seem to just skate on killing customers to line their own pockets these days. Just "following orders" I guess.

Barbing 13 hours ago|||

He deployed, not just developed?

Veserv 12 hours ago||

Yes, he was [1] director of AI and Autopilot Vision at Tesla, directly poached and reporting to Elon Musk on the most important headline feature of Tesla directly managed by Elon Musk.

He had both the technical and executive authority to determine if the product was fit for customer usage. He had direct executive responsibility for the product on the road between 2017-2022.

If he, the lead architect and executive responsible felt the product was dangerous and then he was overridden, he can not get away with claiming he was “just following orders”, he had a moral duty to not sign-off or quit otherwise he is clearly complicit in deploying a dangerous product for his own self-enrichment.

When people talk about engineering ethics, this is literally a completely uncontroversial textbook example. The only way you accept this is if you do not want ethics in engineering.

Furthermore, he was extremely hireable with numerous job opportunitys available to him. He would not be destitute or even particularly worse off if he did quit for ethical reasons. Any self-preservation defense is also invalid.

[1] https://techcrunch.com/2017/06/20/tesla-hires-deep-learning-...

Barbing 12 hours ago|||

Andrej Karpathy is a reason* Tesla doesn’t have Lidar and thus is a reason Tesla self driving isn’t nearly as safe as it could be?

He heard Elon say “I drive with eyes, so cars just need eyes” & shipped?

:( happy to have my impressions corrected (but I was kind of pretending it’s a 2026 scenario where you could slap Lidar, ship a Waymo, if you were just willing to spend the friggin MONEY - 2017 was too early for most any “self” driving IIRC)

*edit - in a scenario where his refusal to skip Lidar catalyzed change

Avicebron 12 hours ago|||

I don't the comp sci has the same requirements for ethics coursework like mechanical, aerospace, etc..

ahartman00 10 hours ago|||

According to ABET they do if they want the degree to be accredited. We had two classes for my SE degree. From Criterion 3. Student Outcomes:

"2. an ability to apply engineering design to produce solutions that meet specified needs with consideration of public health, safety, and welfare, as well as global, cultural, social, environmental, and economic factors." "4. an ability to recognize ethical and professional responsibilities in engineering situations and make informed judgments, which must consider the impact of engineering solutions in global, economic, environmental, and societal contexts."

https://www.abet.org/accreditation/accreditation-criteria/cr...

browsingonly 12 hours ago|||

Passing a mandatory class != believing in its message and acting on it.

Unfortunately, rather important courses like engineering ethics have become lumped in with mandatory DEI objectives and similar 'grievance studies' requirements, classes which many suffer through quietly, regurgitating the Correct responses while they count the minutes until they can get back to more substantive classwork. Some undergraduates may unfortunately gloss over ethics just as they gloss over lectures on privilege.

Barbing 5 hours ago||

The privilege stuff feel zero sum?

clear-octopus 9 hours ago|||

[dead]

espadrine 12 hours ago||

His goal could simply be to learn SOTA architectures.

When rumors started that GPT-4 design would be kept secret, he likely wanted to know what architecture it would be. Perhaps he left Tesla, waited out the non-compete clause, and joined OpenAI to learn its details.

When Mythos dropped, there were hints that it had a new architecture. He might similarly want to know how it works.

Either way, there is enough cross-lab hiring that those secrets eventually get known, but only by the labs.

zitoshi 12 hours ago||

interesting signal about where AI is...

It actually feels like a signal that it is in a tapering phase.

As in, if it was in a growth phase a freeform, solo - collab with who you want, would be more beneficial. But in a tapering phase you'd want structure and to be in the private formal meetings.

just an idea

TeMPOraL 11 hours ago||

Or, you could fit the exactly opposite story to the same data:

Growth is when you want to have institutional support, to be at the tip, backed by infinite money and best compute infra, and benefit disproportionally from compounding. Conversely tapering is when you're best flying under the radar, and there's plenty of value both in ideas and in hardware, as the leading players shed excess they can't support anymore, ...

zitoshi 1 hour ago||

Yes, that makes sense.

But to your point, then the growth is not in the ideas that can be generated with AI, and more in the structure. Which feels like a different stage. Maybe "growth", wasn't a good word.

KeplerBoy 11 hours ago||

Feels like the opposite.

Stuff is still happening and you need to be part of a big lab to see it. NanoGPT is fun but at some point you need that datacenter.

zitoshi 1 hour ago||

I would guess he's never left out at whats happening, because he's still highly connected with those people even when not at the company.

I also feel like there are many ways he can access compute for use of his own ideas.

423abaf 13 hours ago||

He is citing R&D? I have always been under the impression that he is an image recognition etc. expert rather than an LLM expert.

So, does Anthropic pivot to military tech or pretend to do so before the IPO?

Or is this simply a deal where he uses his formidable influencer skills for Anthropic and gets to cash in on the IPO?

gyomu 12 hours ago||

We are in the early stages of AI. Anthropic is Altavista and OpenAI is AskJeeves or something. 10-20 years from now the scene will be unrecognizable and all of this will be inconsequential but at the same time it is the fondation on which tomorrow is built.

nilkn 9 hours ago||

Anthropic looks a lot more like early Google -- not the first mover, but "lightning in a bottle" culture, talent, focus, and product direction that causes them to become a dominant, enduring figure.

OpenAI looks a lot more like early Yahoo -- earlier, quite a spectacle at first, definitely a game-changer and disruptor, but overspent, less focused, and subject to slow collapse under its own fragmentation and lack of overwhelming clarity of mission and purpose.

All that said, history rhymes but does not repeat, and trying to map present-day companies onto previous generations is an exercise in futility. The future is fundamentally unique.

TrackerFF 11 hours ago|||

Well, one big difference now is that you need to billions to become the next big player. The barriers to entry are incredibly high, if you plan on competing against the big players.

Of course, there could be some future lab or startup which completely revolutionizes the field by going for some approach that doesn't require a boatload of money to train a model, but for now, we're stuck with the LLMs and the costs they come with..

PUSH_AX 8 hours ago||

People say deepseek is about 5 months behind frontier, they claim their final training run was 7 figures. The trail blazing is likely making it cheaper to follow not more expensive.

Barbing 12 hours ago|||

But do I leave all my money in US index funds?

jjordan 12 hours ago|||

It's the safer bet.

Barbing 12 hours ago||

You are now my financial advisor

brcmthrowaway 12 hours ago|||

Which funds?

Barbing 12 hours ago|||

Searching “invest $10[0]k into USA index funds low fees”, the Vanguard funds that come up! (Vanguard sounds a little special, maybe they do good marketing. Ah, per Wiki: “Vanguard is owned by the funds managed by the company and is therefore owned by its customers.”)

Looking familiar: VTI or VOO, VTSAX or VFIAX

littlexsparkee 11 hours ago||

you might consider VXUS for int'l / hardware exposure (20-40% of total)

moffers 12 hours ago||||

Maybe just like one of each.

sieabahlpark 12 hours ago|||

[dead]

Sohcahtoa82 12 hours ago|||

OpenAI will be the Yahoo of AI. Starting off as a household name, but fades to irrelevancy as competitors take over.

UltraSane 12 hours ago|||

Google is much better positioned long term with their TPUs and separate enormous revenue from advertising.

destring 11 hours ago|||

Not so sure on the advertising front. B2C is now mostly social media, and Google doesn't own any. That's why the pivoted hard to YouTube shorts to try and capture that segment, but it is nowhere near TikTok or Instagram. Case in point, Meta's advertising revenue is predicted to surpass Google's this year.

munk-a 10 hours ago||

You underestimate that YouTube has become what TV was for the majority of young people. Premium is relatively lucrative - but the ad revenue is insane. If Google can succeed in building an AI to generate slop to hold eyeballs fixed on the screen and cut out creators it will be a highly profitable dystopia. Facebook is similarly positioned (via Instagram - not Facebook itself) while TikTok is in a highly unpredictable state with the recent acquisition. Oracle may stay hands off and treat it as a golden goose but that hasn't been the recent track record for anyone with the surname Ellison.

arealaccount 12 hours ago|||

So Google remains as Google

croes 11 hours ago|||

So we get ad flooded useless AIs?

make3 10 hours ago||

That's a funny thing to say as time is infinite, and we're at the early stages of every single thing. Reasoning in time dynamics is useful though to be clear

thoughtpeddler 10 hours ago|

Wondering what the plan is to steward Eureka Labs, LLM101n, and whatever else was being cooked up. As a fellow educator, was very much looking forward to seeing how this would have evolved things.

More comments...