Top
Best
New

Posted by HellsMaddy 15 hours ago

Claude Opus 4.6(www.anthropic.com)
1842 points | 766 commentspage 5
petters 12 hours ago|
> We build Claude with Claude.

Yes and it shows. Gemini CLI often hangs and enters infinite loops. I bet the engineers at Google use something else internally.

AstroBen 13 hours ago||
Are these the coding tasks the highlighted terminal-bench 2.0 is referring to? https://www.tbench.ai/registry/terminal-bench/2.0?categories...

I'm curious what others think about these? There are only 8 tasks there specifically for coding

sutterd 4 hours ago||
I thought Opus 4.5 was an incredible quantum leap forward. I have used Opus 4.6 for a few hours and I hate it. Opus 4.5 would work interactively with me and ask questions. I loved that it would not do things you didn't ask it to do. If it found a bug, it would tell me and ask me if I wanted to fix it. One time there was an obvious one and I didn't want it to fix it. It left the bug. A lot of modesl could not have done that. The problem here is that sometimes when model think is a bug, they are breaking the code buyu fixing it. In my limited usage of Opus 4.6, it is not asking me clarifying questions and anything it comes across that it doesn't like, it changes. It is not working with me. The magic is gone. It feels just like those other models I had used.

I will try again tomorrow and see how it goes.

sega_sai 11 hours ago||
Based on these news it seems that Google is losing this game. I like Gemini and their CLI has been getting better, but not enough to catch up. I don't know if it is lack of dedicated models that is problem (my understanding Google's CLI just relies on regular Gemini) or something else.
laxk 6 hours ago|
Google knows how to wait. Let's give them a chance.
jonatron 10 hours ago||
Can someone ask: "what is the current carrying capacity of 25mm multicore armoured thermoplastic insulated cables with aluminium conductors, on perforated cable tray?" just to see how well it can look up information in BS 7671?
jaggederest 10 hours ago|
> what is the current carrying capacity of 25mm multicore armoured thermoplastic insulated cables with aluminium conductors, on perforated cable tray?

  This is an electrical engineering question rather than a software engineering one, but let me look it up for you.

  WEB SEARCH ("current carrying capacity 25mm multicore armoured thermoplastic insulated cable aluminium conductors perforated cable tray BS 7671")
   ↳ Found 10 results.

   Based on BS 7671 Table 4D4B (multicore armoured thermoplastic insulated cables with aluminium conductors), the current-carrying capacity for a 25mm cable on a perforated cable tray (Reference Method E) is approximately 80A for a 3- or 4-core cable at 30C ambient temperature and 70C conductor operating temperature.

    Note: The exact value depends on the number of cores (2-core vs 3/4-core) and whether any rating/correction factors (grouping, ambient temperature, thermal insulation) apply. I'd recommend confirming against the actual BS 7671:2018 Table 4D4B for your specific configuration, as the aluminium conductor ratings are lower than the copper equivalents (~102A for copper in the same conditions).

    This is outside my core domain of software engineering -- for critical electrical installations, always verify with the relevant standards documentation or a qualified electrical engineer.
jonatron 10 hours ago||
That's useless, unexpectedly just Google gives the correct answer.
esafak 7 hours ago||
Great. So it did not waste its memory on trivia. Don't memorize what you can look up.
1970-01-01 6 hours ago||
Here's one I've been using for awhile. The 'smarter' LLMs will overconfidently spit out 7. The dumber ones ask for more info. Opus 4.6 fails.

     A round drink coaster with a diameter of 9 sits between a beer glass and a wood table. The glass has a wall thickness of 1. What is the inner diameter of the glass?
raincole 6 hours ago||
Gemini 3 Pro gives me this:

> Based on the information provided, it is impossible to determine the inner diameter of the glass.

Here is why:

The Coaster Dimension: Knowing the coaster has a diameter of 9 only tells us the size of the object under the glass. It does not mean the glass is the same size; the glass could be smaller (fitting entirely on the coaster) or larger (overhanging the coaster). The Glass Dimensions: While you provided the wall thickness (1), you did not provide the outer diameter of the glass. To find the inner diameter, the formula is: Inner Diameter = Outer Diameter - (2 × Wall Thickness)

Without knowing the Outer Diameter of the glass, the calculation cannot be completed.

raincole 6 hours ago||
GPT 5.2 Chat:

> There isn’t enough information to determine the inner diameter *for sure*.

All we’re told is: - The coaster has a diameter of 9. - The glass sits on the coaster. - The glass wall thickness is 1.

Nothing explicitly states that the *outer diameter of the glass equals the coaster’s diameter*. The glass could be smaller than the coaster, which is very common.

*However*, if we make the usual implicit assumption in this kind of problem—that the coaster’s diameter matches the *outer diameter of the glass base*—then:

- Outer diameter of glass = 9 - Wall thickness = 1 on each side

So: \[ \text{Inner diameter} = 9 - 2(1) = 7 \]

*Inner diameter = 7 (same units as the coaster)*

Without that assumption, the problem is underdetermined.

tototrains 6 hours ago||
Opus 4.6 Extended thinking:

```

Finding the Inner Diameter The coaster sits beneath the glass on the table, meaning the glass rests on top of it. Assuming the coaster matches the outer diameter of the glass base:

Outer diameter of glass = diameter of coaster = 9 Wall thickness = 1 (on each side, so subtract twice)

dinner=9−2(1)=7d_{\text{inner}} = 9 - 2(1) = 7dinner =9−2(1)=7 The inner diameter of the glass is 7.

```

Makes its assumption clear, seems reasonable?

1970-01-01 5 hours ago||
Assumptions need to be stated or you're solving only a discreet part of the problem! Try this, see if you get another deadpan assumption.

     A solar system has 3 planets in concentric orbit. PlanetZ is the farthest with an orbit diameter of 9. PlanetY has an obit diameter one greater than PlanetX. What is the orbit diameter of PlanetX?
mikalauskas 6 hours ago|||
Minimax M2.1:

The inner diameter of the glass is *7*.

Here's the reasoning: - The coaster (diameter 9) sits between the glass and table, meaning the glass sits directly on the coaster - This means the *outer diameter of the glass equals the coaster diameter = 9* - The glass has a wall thickness of 1 on each side - *Inner diameter = Outer diameter - 2 × wall thickness* - Inner diameter = 9 - 2(1) = 9 - 2 = *7*

ra 9 hours ago||
Why are Anthropic such a horrible company to deal with?
danielbln 4 hours ago|
Care to elaborate?
ra 2 hours ago||
obscure billing, unreachable customer support gatekeeped by an overzealous chatbot, no transparency about inclusions, or changes to inclusions over time... just from recent experience.
psim1 14 hours ago||
I need an agent to summarize the buzzwordjargonsynergistic word salad into something understandable.
fhd2 14 hours ago|
That's a job for a multi agent system.
cyanydeez 12 hours ago||
yEAH, he should use a couple of agents to decode this.
winterrx 15 hours ago|
Agentic search benchmarks are a big gap up. let's see Codex release later today
More comments...