Top
Best
New

Posted by andsoitis 1 day ago

China is having another AI moment(www.economist.com)
19 points | 9 comments
solarkraft 1 day ago|
Fine, I’ll try GLM 5.2 since it has arrived in my OpenCode subscription. I haven’t used it much, but GLM 5.1 hasn’t blown me away. If it really is that good, a minor release seems to undersell it. I found Anthropic’s .5 jump a bit silly, but it does its job in communicating the advancement.
aerodexis 1 day ago|
How do you go about comparing models? Personally, I've found that the quality of output is overwhelmingly dictated by the quality of the input I give it - and that the quality of input differs massively depending on the task at hand, how much documentation exists, what kind of mood I'm in, etc.

Hence, I'm beginning to believe that unless you're doing very specialized work - the model itself has become a commodity, incremental quality is irrelevant and that economics and privacy are what actually matter now.

ignoramous 5 hours ago|||
> How do you go about comparing models?

I do this often: I'd have a prompt in mind before I manually find the fix for subtle bugs, or plan out how I'd implement a feature. Then, I prompt the models I want to test (with the prompt I originally had in mind) and see if any of them get it right.

To my surprise, the Claudes always perform better, and so, it seems like my prompting behaviour is either attuned to them, or Claudes are good at "figuring it out".

That said, in my experiments, I've always found that given a "good" prompt (with just the right detail), the top coding models (including from Chinese labs, which I now increasingly prefer) have no discernable difference; in which case, I usually defer to the cheaper & fastest of those models (presently, that's either DeepSeek v4 Flash & Pro, MiMo v2.5 & Pro, MiniMax M3; these are a coding tier below GLM5.2, Kimi K2.7, and Qwen 3.7 Max, imo) as ranked by OpenRouter: https://openrouter.ai/rankings?view=month

dsign 1 day ago|||
> the model itself has become a commodity

Indeed. So the winner is going to be whoever can also commoditize the rest, i.e. hardware and energy costs. In the long run, whoever has the factories to make chips, windmills and solar panels is going to be ahead. I think this is well understood by the American elites, and it can very well explain their exacerbated hawkishness.

sleepyguy 1 day ago|
http://archive.today/uELSX
dieselgate 1 day ago|
What the heck, after selecting the checkbox to "Verify I am Human" on this URL I am prompted to scan a QR code. Believe I may have read mentions of this around HN but never seen it before. Definitely not willing to scan a QR code to prove anything but is this becoming standard practice now?

No criticism of the above poster because they are helping us all out here but I am astounded.

netruk44 1 day ago|||
You can select a different test instead of scanning the QR code.

There's a button under the QR code for a "visual test" which is just the usual "pick pictures containing a bicycle" thing.

dieselgate 1 day ago||
Thank you. After revisiting I actually was only presented with the standard bicycle picture.
jmathai 1 day ago||||
Odd. I receive a non English website when clicking on that link.
abick91 1 day ago|||
[flagged]