Claude, please stop trying to memorize random crap

Posted by theahura 4 hours ago

Claude, please stop trying to memorize random crap(12gramsofcarbon.com)

150 points | 112 commentspage 2

trjordan 2 hours ago|

It's because it mostly doesn't matter what you are trying to get the code to do. What matters is what the code does.

Session logs can absolutely be useful, but not when building further. It's just that that the place they slot in is during validation. You know, that place between the markdown plan and CI passing, where there's 800 new lines of code and it all seems sort of fine when you click around?

Session logs can show you what sort of manual validation happened. CI will run the tests you had, and the code will show you what new unit tests were added, but session logs can show you that the agent drove the app with Playwright, or that the agent read and considered the prod config as well as the dev config.

Nothing bulletproof, but not every piece of validation work merits a test in the repo that lives forever. We've gotten a lot of mileage out of re-analyzing the sessions, figuring out where the agent made decisions without asking, and forcing the agent to consider validation for those decisions. That's the sort of thing that's hard to dictate up front but easy to highlight with the session logs.

8note 1 hour ago||

Thats interesting, but what was the methodology?

is the conclusion really that its just more important to create proper artifacts from any tricks that got the llm to understand the code better?

is the tool for searching the history just bad?

adverbly 2 hours ago||

I'm not sure how well this take will age.

Its certainly true at the moment, but give it 10 years and we might have systems that are much cheaper and much better at context management than they are now.

(Apologies to anyone who is under the impression that we were very likely going to be at the singularity in 10 years time. Possible != very likely)

FuckButtons 1 hour ago|

Sure, but it’s equally likely that we hit a point where scaling becomes economically unviable because we can’t come up with enough algorithmic improvements to break free of the tyranny of log linear scaling. (I’m not sure how many 2x in token cost people would be willing to pay)

mastax 2 hours ago||

I found that if you allow any low value things into memory, Claude will notice that established pattern and start trying to add low value memories at an ever increasing pace.

jvuygbbkuurx 2 hours ago|

Explains why it made so many memories on my work machine, but never made one on my personal one. Maybe the project size also affects it.

linsomniac 2 hours ago||

I like the memory system, in general. For reference I'm using mostly Opus 4.8 + Max effort. It will often pull things out of memory that are relevant. Like I'll ask it to come up with a few options I should consider for, say, a self-hosted OIDC provider and it'll say things like "Considering the size of your operations team, this might be a better fit because of X and Y".

Now, I'll agree that this is probably the sort of thing I should put in the CLAUDE.md, but in this case it wasn't on my radar to put that in my CLAUDE.md, so it was nice that it surfaced that.

It does sometimes go awry though. Today I was asking about a problem I was having authenticating, and it said "you may be running into this trusted proxy setting because you put your apps behind an haproxy". That is true of 95% of our apps, so it was worth mentioning, but in this case it was not so I had to correct it. But, I'm glad it mentioned it because if we did have it proxied it could have saved me a lot of time.

zahirbmirza 2 hours ago||

Even with memory off this occurs within a conversation.

It is like an annoying friend, who remembers something from a past conversation, that you have grown and developed from, but they still want to hold it against you.

Fabricio20 2 hours ago||

I specifically disabled claude memory in a project because it kept writing down thigns to memory that didn't need to be in memory, including severly wrong statements that then would confuse it later. At some point it got re-enabled automatically which had me ask claude itself to "turn it the fuck off" by which it promptly figured out that both ("autoMemoryEnabled": false, "autoDreamEnabled": false) are necessary and need to be at the user home settings, not in a project override (which is what I had with the original setup that eventually got ignored by a CC update).

I agree with other commenters here, if anything is worth being rememebered, it will be in code comments, git commit messages, CLAUDE.md or other formal documentation. The auto memory system just causes confusion and leaves stale and outdated information written down.

Its an interesting thought experiment as well, I originally thought that having the model write down memory files by itself would be a nice addition, but after playing around with it, it became clear to me that good as an idea turns out bad in practice because the model can't correctly gauge what deserves being stored as a memory.

andai 2 hours ago||

> "turn it the fuck off" -> "autoDreamEnabled": false

So you told it don't go the fuck to sleep ;)

estetlinus 2 hours ago||

Ugh, agentic _dreaming_. Why on earth would I want that?

grimcompanion 2 hours ago||

> I believed this so strongly that my company built an entire product around this concept. I used to tell folks that "session transcripts were the new oil," that they were more valuable than the code itself.

This is infuriatingly common wrt talking/writing about how to use AI effectively. All of the "this is how you write an AGENTS.md" and "you need to talk to it like X to optimize it". Like sure, you can believe that as much as you want but unless you provide some evidence you can keep your shitty CLAUDE.md to yourself and don't pollute the whole company's git repo, thanks.

estetlinus 2 hours ago|

When nobody actually knows (how to write a CLAUDE.md), everyone’s an expert. Infuriating, indeed. Even more so when people vibe code those files without proofreading.

bluegatty 2 hours ago||

There's a lot of valuable information in there, its' too noisy.

oefrha 3 hours ago|

I have this in my global CLAUDE.md after being annoyed by all the random crap memories.

> Don't start generating an auto-memory entry before asking me. Ask first, write only if I confirm — no speculative drafting.

No more crap after this.

Incidentally I don’t recall Opus 4.8 asking me once in the past few weeks. Older models did ask semi-frequently.

More comments...