A recent experience with ChatGPT 5.5 Pro

Posted by _alternator_ 12 hours ago

A recent experience with ChatGPT 5.5 Pro(gowers.wordpress.com)

https://twitter.com/wtgowers/status/2052830948685676605

https://xcancel.com/wtgowers/status/2052830948685676605

439 points | 292 commentspage 5

jdw64 9 hours ago|

[dead]

bananaflag 9 hours ago||

> it sounds like there were already precedents or existing pieces of knowledge, but humans had not thought to connect them

A lot of math research is like that. And, like the blog post suggests, problems one gives PhD students are 95% like that.

jdw64 9 hours ago||

Maybe I am still fortunate to have become a programmer.

Most of what I do is just assemble things that other people have already built.

tanepiper 8 hours ago|||

Basically medical science too. My wife was able to diagnose her own anemia that the doctors kept missing, and has since been able to have iron infusions.

The human doctors kept ignoring the signals, kept putting it down to 'diet' and 'exercise' (even though she does plenty of both)

agiipullor 6 hours ago||

no blood tests were done?

themafia 9 hours ago||

> there were already precedents or existing pieces of knowledge, but humans had not thought to connect them

We used to call that "low hanging fruit."

shevy-java 9 hours ago||

[flagged]

Smaug123 8 hours ago|

Tim Gowers is a Fields medallist who has supervised 13 students (https://www.genealogy.math.ndsu.nodak.edu/id.php?id=67729). He's totally capable of gauging what a piece of PhD-level research is.

verisimi 9 hours ago||

[flagged]

slopinthebag 8 hours ago||

AI generated article btw.

Maybe if you find AI to be doing stuff you find impressive, the stuff you were doing wasn't that impressive? Worth ruminating on your priors at least.

hodgehog11 7 hours ago||

This is beyond ridiculous to say considering whose blog this is.

For those that don't know, this is Timothy Gowers. He is one of the most accomplished mathematicians in the world. Like Terence Tao, he is considered one of the world leaders in mathematics and tends to have good judgement in where the field is going.

Even without that knowledge, no, this article is certainly not AI generated. It has none of the tells.

reasonableklout 8 hours ago||

What makes you think either the tweet or blog post are AI generated?

bambax 7 hours ago||

> quite a lot of perfectly good human mathematics consists in putting together existing knowledge and proof techniques

Creativity is connecting ideas from different domains and see if something from one field applies to another. I do think AI is overhyped generally; but a major benefit from AI could be that after ingesting all the existing human knowledge (something no single human can ever hope to achieve) it would "mix and connect" it and come up with novel insights.

Most published research sits ignored and unread; AI can uncover and use everything.

imiric 7 hours ago|

> Creativity is connecting ideas from different domains and see if something from one field applies to another.

That's true. The question is whether the produced pattern has any value. LLMs are incapable of determining this, and will still often hallucinate, and make random baseless claims that can convince anyone except human domain experts. And that's still a difficult challenge: a domain expert is still needed to verify the output, which in some fields is very labor intensive, especially if the subject is at the edge of human knowledge.

The second related issue is the lack of reproducibility. The same LLM given the same prompt and context can produce different results. This probability increases with more input and output tokens, and with more obscure subjects.

The tools are certainly improving, but these two issues are still a major hurdle that don't get nearly as much attention as "agents", "skills", and whatever adjacent trend influencers are pushing today.

And can we please stop calling pattern matching and generation "intelligence"? This farce has gone on long enough.

agiipullor 6 hours ago||

> And can we please stop calling pattern matching and generation "intelligence"

thats literally what an IQ test tests - abstract pattern matching. but I guess you dont like IQ tests either

einrealist 8 hours ago|

"After 16 minutes and 41 seconds, it came back" ... "further 47 minutes and 39 seconds" ... "After 13 minutes and 33 seconds" ... "After 9 minutes and 12 seconds" ... "After 31 minutes and 40 seconds" ... plus other computations

Anyone spotting the issue here? What did that really cost?

I am not against compute being used for scientific or other important problems. We did that before LLMs. However, the major LLM gatekeepers want to make all industries and companies dependent on their models. And, at some point, they need to charge them the actual, unsubsidized costs for the compute. In the meantime, companies restructure in the hopes that the compute costs remain cheap.

sidkshatriya 7 hours ago||

> "After 16 minutes and 41 seconds, it came back" ... "further 47 minutes and 39 seconds" ... "After 13 minutes and 33 seconds" ... "After 9 minutes and 12 seconds" ... "After 31 minutes and 40 seconds" ... plus other computations Anyone spotting the issue here? What did that really cost?

Whatever the Joules... (convert to $ using your preferred benchmark price) it is a fraction to what it might take a human Ph. D. weeks to feed and sustain themselves when working on the same problem. The economics on LLMs is just unbeatable (sadly) when compared to us humans.

einrealist 7 hours ago||

Compute in science was already subsidized by public funding or by donations. Most supercomputers are financed this way. And that's a good thing. If you have a good science problem that can be computed, apply for compute time. There is nothing wrong to apply that to LLMs as well, like I wrote in my initial post. The human is still required to identity problems that are worth to be computed, to create prompts that the LLM can act on, and to verify results. But, OpenAI providing compute for basically free is still tied to a different incentive: to fuel the hype and to capture the market, while distorting/obfuscating the real costs. That's also the reason for why we cannot claim that 'economics on LLMs is just unbeatable'. It depends on the problem, the reason for a prompt.

colordrops 7 hours ago||

Still not as bad for the environment as animal agriculture, and animal agriculture is absolutely not necessary and only causes harm and suffering for taste pleasure. At least with LLMs we get many positive advancements from them. I don't see these sorts of comments every time someone posts a burger review.

einrealist 7 hours ago||

Did I praise our animal agriculture anywhere?

colordrops 5 hours ago||

We have a wide audience.