Claude for Excel - Hacker News

Posted by meetpateltech 2 days ago

667 points | 450 commentspage 3

scrappyjoe 1 day ago|

Maybe this is how we get code versioning for Excel.

Git LFS for workbook + the following prompt :

“Create a commit explains what has changed in the workbook since the last commit. Be brief, but explain the change in business terms as well as code change terms.”

vjvjvjvjghv 2 days ago||

Hope it’s better than what MS is currently shipping as AI. Everything I try to do something, the response is “sorry, I can’t do this”.

smithkl42 2 days ago|

Copilot is getting better - I'm getting fewer of those than I used to - but it's still significantly more stupid than other agents, even when in theory it's using the same model.

theshrike79 1 day ago||

The best thing that can come from this is unit tests for Excel.

LLMs work best when they can call tools (edit the sheet) and test their results in a loop.

It's like the "value seek" thing Excel has had since forever; "adjust these values until this cell is X"

Excel doesn't have any way to verify that every formula in that 60k line sheet is correct and someone hasn't accidentally replaced one with a static number for example.

filearts 1 day ago|

In a previous professional life, I did financial modelling for a big 4 accounting firm. We had tooling that allowed us to visualize contiguous ranges of identical formulas (if you convert formulas to R1C1 addressing, similar formulas have the same representation). This allowed for overrides to stick out like a sore thumb.

I suspect similar tools could be made for Claude and other LLMs except that it wouldn't be plagued by the mind-numbing tedium of doing this sort of audit.

rahimnathwani 2 days ago||

How is this different from the existing Claude skill, that uses a prompt and pandas to edit an Excel file?

https://github.com/anthropics/skills/blob/main/document-skil...

shooker435 2 days ago|

This isn't built for Excel users who use Github and Claude Skills, it's built for Excel users who would run away from Git commands.

rahimnathwani 2 days ago||

The Claude skill I linked to is built into the Claude desktop client. You just attach an Excel file to your chat and ask away.

I linked to the skill prompt just to more clearly explain the approach that's currently available to all Claude users.

It requires zero familiarity with git or command line.

jawns 2 days ago||

Gemini already has its hooks in Google Sheets, and to be honest, I've found it very helpful in constructing semi-complicated Excel formulas.

Being able to select a few rows and then use plain language to describe what I want done is a time saver, even though I could probably muddle through the formulas if I needed to.

break_the_bank 2 days ago||

I would recommend trying TabTabTab at https://tabtabtab.ai/

It is an entire agent loop. You can ask it to build a multi sheet analysis of your favorite stock and it will. We are seeing a lot of early adopters use it for financial modeling, research automation, and internal reporting tasks that used to take hours.

frankfrank13 2 days ago|||

I have had the opposite experience. I've never had Gemini give me something useful in sheets, and I'm not asking for complicated things. Like "group this data by day" or "give me p50 and p90"

gumby271 2 days ago|||

Last time I tried using Gemini in Google Sheets it hallucinated a bunch of fake data, then gave me a summary that included all that fake data. I'd given it a bunch of transaction data, and asked it to group the records into different categories for budgeting. When asking it to give the largest values in each category, all the values that came back were fake. I'm not sure I'd really trust it to touch a spreadsheet after that.

genrader 2 days ago||

you should:

-stop using the free plan -don't use gemini flash for these tasks -learn how to do things over time and know that all ai models have improved significantly every few months

ipaddr 2 days ago||

Or not use it.

break_the_bank 2 days ago|||

I forgot to add, you can try TabTabTab, without installing anything as well.

To see something much more powerful on Google Sheets than Gemini for free, you can add "try@tabtabtab.ai" to your sheet, and make a comment tagging "try@tabtabtab.ai" and see it in action.

If that is too much just go to ttt.new!

dangoodmanUT 2 days ago||

Gemini integratoins to Google workspace feels like it's using Gemini 1.5 flash, it's so comically bad at understanding and generating

teddyh 1 day ago||

“Copilot in Excel is a global financial crisis waiting to happen.”

— Zack Korman, <https://x.com/ZackKorman/status/1974828240679166396>

hufdr 1 day ago||

AI can definitely save time, but sometimes it hides the real problems. Most spreadsheet issues aren’t math errors they’re logic messes. Claude can fix your sheet, but it can’t fix your company culture.

fudged71 2 days ago||

Interesting their X post mentions "pre-built Agent Skills" but it's not on the webpage. I wonder if they will give you the ability to edit/add/delete Skills, that would be phenomenal.

Edit: found it on their other blog post https://www.anthropic.com/news/advancing-claude-for-financia...

luccasiau 2 days ago|

You can add and customize skills in claude.ai and other surfaces

fudged71 18 hours ago||

Sorry, I'm well aware of skills, but wanted to know how specifically they would be used in this Excel extension

asdev 2 days ago||

George Hotz said there's 5 tiers of AI systems, Tier 1 - Data centers, Tier 2 - fabs, Tier 3 - chip makers, Tier 4 - frontier labs, Tier 5 - Model wrappers. He said Tier 4 is going to eat all the value of Tier 5, and that Tier 5 is worthless. It's looking like that's going to be the case

mediaman 2 days ago||

That is a common refrain by people who have no domain expertise in anything outside of tech.

Spend a few years in an insurance company, a manufacturing plant, or a hospital, and then the assertion that the frontier labs will figure it out appears patently absurd. (After all, it takes humans years to understand just a part of these institutions, and they have good-functioning memory.)

This belief that tier 5 is useless is itself a tell of a vulnerability: the LLMs are advancing fastest in domain-expertise-free generalized technical knowledge; if you have no domain expertise outside of tech, you are most vulnerable to their march of capability, and it is those with domain expertise who will rely increasingly less on those who have nothing to offer but generalized technical knowledge.

asdev 2 days ago|||

yeah but if Anthropic/OpenAI dedicate resources to gaining domain expertise then any tier 5 is dead in the water. For example, they recently hired a bunch of finance professionals to make specialized models for financial modeling. Any startup in that space will be wiped out

HDThoreaun 2 days ago|||

I dont think the claim is exactly that tier 5 is useless more that tier 5 synergizes so well with tier 4 that all the popular tier 5 products will eventually be made by the tier 4 companies.

extr 2 days ago|||

George Hotz says a lot of things. I think he's directionally correct but you could apply this argument to tech as a whole. Even outside of AI, there are plenty of niches where domain-specific solutions matter quite a bit but are too small for the big players to focus on.

rudedogg 2 days ago|||

Tier 5 requires domain expertise until we reach AGI or something very different from the latest LLMs.

I don’t think the frontier labs have the bandwidth or domain knowledge (or dare I say skills) to do tier 5 tasks well. Even their chat UIs leave a lot to be desired and that should be their core competency.

benatkin 2 days ago|||

Interesting. I found a reference to this in a tweet [1], and it looks to be a podcast. While I'm not extremely knowledgable. I'd put it like this: Tier 1 - fabs, Tier 2 - chip makers, Tier 3 - data centers, Tier 4 - frontier labs, Tier 5 - Model wrappers

However I would think more of elite data centers rather than commodity data centers. That's because I see Tier 4 being deeply involved in their data centers and thinking of buying the chips to feed their data centers. I wouldn't be so inclined to throw in my opinion immediately if I found an article showing this ordering of the tiers, but being a tweet of a podcast it might have just been a rough draft.

1: https://x.com/tbpn/status/1935072881425400016

mitjam 2 days ago|||

Andrew Ng argumented in 2023 (https://www.youtube.com/watch?v=5p248yoa3oE ) that the underlying tiers depend on the app tier‘s success.

That OpenAI is now apparantly striving to become the next big app layer company could hint at George Hotz being right but only if the bets work out. I‘m glad that there is competition on the frontier labs tier.

matsur 2 days ago|||

People were saying the same thing about AWS vs SaaS ("AWS wrappers") a decade ago and none of that came to pass. Same will be true here.

tln 2 days ago||

Claude is a model wrapper, no?

piperswe 2 days ago||

Anthropic is a frontier lab, and Claude is a frontier model

tln 1 day ago||

Anthropic models are Sonnet / Haiku / Opus

https://docs.claude.com/en/docs/about-claude/models/overview

piperswe 1 day ago||

Okay, Claude is a _family_ of frontier models then. IMO that's a pedantic distinction in this context.

More comments...