Codex logging bug may write TBs to local SSDs

Posted by vantareed 12 hours ago

Codex logging bug may write TBs to local SSDs(github.com)

366 points | 198 commentspage 2

joelthelion 3 hours ago|

A good moment to switch to an open solution like opencode or pi.

purpleidea 9 hours ago||

I want to like codex, but the quality is just not very good, especially when compared to Claude.

It used to work okay, but a while back they landed a major regression for an entire team of folks I work with.

No response, no workaround.

https://github.com/openai/codex/issues/23762

newtwilly 2 hours ago||

Decent sandbox + sandbox override experience with pi coding agent... pi-sandbox uses the same sandbox tech that claude code uses, although it uses a fork that's a little behind, and I'm not sure exactly why it uses a fork.

You can install pi, then install pi-sandbox locked to the current version. Here it is described how pi-sandbox plus an additional extension allow you to have the experience where a sandbox is used, but you can fall back to unsandboxed with approval required. https://github.com/carderne/pi-sandbox/issues/50

christophilus 7 hours ago||

I don’t trust any agent to respect any boundaries. They might today. But tomorrow’s vibe coded slip update might break it in subtle ways.

My solution to this is to only run agents in a sandbox of my own making (a locked down Podman container).

drakythe 4 hours ago|||

They can't respect boundaries as long as those boundaries exist only in the LLM instruction set. A human being who follows rules long enough the rules will become second nature (usually), almost to the point where long running companies are known for having rules no one understands (Chesterton's Fence is alive and well).

But an LLM have a limited "memory" and while the instructions might land in there and be of sufficient priority to be "respected" a single instance of that memory getting too full or the LLM autocompleting the work around because that was the statistical "best" solution and any barriers that exist only in LLM instructions and not in hardcoded guards will evaporate like so much morning fog.

matheusmoreira 6 hours ago|||

I went the full virtual machine route. Just finished hardening the setup and firewalling it off my local network. Not perfect but it does make me feel much safer.

altcognito 6 hours ago||

I think part of the question should be, why is there no QA or test that catches this? It's one thing to be slopware, but why didn't anything run a test that catches this?

theowaway213456 6 hours ago||

Every time you write a test that handles some data, you write an assertion about how much data is handled?

Come on, this is such an easy thing to forget to test. Don't act like there is some magical testing strategy that would have caught this

altcognito 6 hours ago||

I'll acknowledge that this is probably not likely to get caught.

Integration testing could/should catch this, especially for a client side app.

A simple constraints is a good thing. "Our app shouldn't use more than 50mb of ram, or use 3gb of disk space."

java-man 2 hours ago||

what QA?

ares623 11 hours ago||

i hope they find the smoking gun, the key insight, the kicker.

59nadir 11 hours ago||

Then they can apply a clean solve, the cleanest solution.

It's fascinating how offensive some of this verbiage becomes to you when you see it attached to LLM output too much.

jofzar 10 hours ago||

Ugh this one's gets me so bad, same with "wire" and "wired" everything is wired to something.

never_inline 3 hours ago||

that's a real gap

wrxd 7 hours ago||

At least they could call someone who’s is absolutely right so that the tool can see its mistakes now

sigbottle 8 hours ago||

I have noticed absurd lag from the browser usage and sometimes complete bricking of my network too on my computer. I thought it was just my computer getting old, but possibly it's ChatGPT.

xfgong 8 hours ago||

Same issue with Claude Code btw — it writes massive debug logs to ~/.claude/logs. Had to symlink it to a tmpfs to stop wearing out my SSD.

eddyfromtheblok 7 hours ago|

I don't see this. According to their docs, logs are no longer written: https://code.claude.com/docs/en/claude-directory

dundercoder 11 hours ago||

If something like this is helpful or necessary, that’s what ram backed tmpfs is for.

mrweasel 11 hours ago|

Using a RAM backed tmpfs would be a work-around as to not trash your SSD. It's doesn't fix underlying problem. It's incredibly poor design on OpenAIs part.

bob1029 11 hours ago||

I'm struggling with how this much logging information could be generated at any level of verbosity. Is codex writing log entries while it's sitting idle? Why would someone want to look at these logs?

More comments...