Amateur armed with ChatGPT solves an Erdős problem

Posted by pr337h4m 20 hours ago

Amateur armed with ChatGPT solves an Erdős problem(www.scientificamerican.com)

https://www.erdosproblems.com/1196

453 points | 288 commentspage 2

Eufrat 11 hours ago|

Humans and very often the machines we create solve problems additively. Meaning we build on top of existing foundations and we can get stuck in a way of thinking as a result of this because people are loathe to reinvent the wheel. So, I don’t think it’s surprising to take a naïve LLM and find out that because of the way it’s trained that it came up with something that many experts in the field didn’t try.

I think LLMs can help in limited cases like this by just coming up with a different way of approaching a problem. It doesn’t have to be right, it just needs to give someone an alternative and maybe that will shake things up to get a solution.

That said, I have no idea what the practical value of this Erdős problem is. If you asked me if this demonstrates that LLMs are not junk. My general impression is that is like asking me in 1928 if we should spent millions of dollars of research money on number theory. The answer is no and get out of my office.

traes 5 hours ago||

Discussed at the time: https://news.ycombinator.com/item?id=47774494

yrds96 7 hours ago||

Given by the fact that the problem is 60 year old, isn't there a chance this was indirect solved already and the model just crossed informations to figure out the problem?

By looking the website this problem was never discussed by humans. The last comments were about gpt discovering it. I was expecting older comments coming to a 60 year old problem.

Am I missing something?

Great discovery though, there might be problems like that same case that worth a try for a "gpt check"

traes 5 hours ago||

Exceedingly unlikely. This was one of the more discussed Erdos problems, and multiple experts have attested to the technique's novelty. If you're referring to the lack of comments on the erdosproblems website, that doesn't really mean much. From its own blog[0], the site was only started in 2023 and only really gained momentum as a place to discuss AI solving attempts, you aren't going to see serious mathematicians discussing the problems there even if there have been significant efforts to solve it.

[0]: https://www.erdosproblems.com/forum/thread/blog:1

whiplash451 6 hours ago||

To some extent, does it matter?

If models are able to pull and join information that already existed in pieces but humankind never discovered by itself, doesn’t this count towards progress anyways?

fuglede_ 5 hours ago||

It would be very helpful to know in understanding the capabilities of the models; and in getting intuition about where they are best applicable.

If the reason it was able to output the proof is that it happened to be included in an in-house university report written in Georgian, then that would make it less useful for research than if it's new entirely.

jzer0cool 9 hours ago||

Could someone share a bit into the problem and the key portion from proof? For someone just knowing basics on proofs.

nomilk 5 hours ago||

A similar announcement was made a few months ago, and Terence Tao came out a few days later and said it wasn't what it seemed at first, in that it was a rediscovery of an already known (albeit esoteric) result...

logicprog 2 hours ago|

They literally have a quote from Tao in the article saying it was a novel approach humans hadn't tried, and that the problem hadn't been solved even after a lot of professional attention.

mrabcx 3 hours ago||

Can the other AI agents such as Gemini, Calude or Deepseek etc also solve this problem?

winwang 9 hours ago||

Obviously nowhere near Erdos problem complexity but I've been using GPT (in Codex) to prove a couple theorems (for algos) and I've found it a bit better than Claude (Code) in this aspect.

iqihs 10 hours ago||

referring to Tao as just a 'mathematician' gave me a good chuckle

cubefox 5 hours ago||

Current headline:

"An amateur just solved a 60-year-old math problem—by asking AI"

A more honest title would be:

"An AI just solved a 60-year-old math problem—after being asked by amateur"

(Imagine the headline claimed instead that a professor just solved a math problem by asking a grad student.)

ngruhn 4 hours ago|

Previous problems solved by AI had some amount of expert guidance/steering. Here, I guess the emphasis is that there was none of that.

More comments...