Forum: War Ensemble BBS

Could AlphaEvolve find the sixth busy beaver ?

From Mild Shock@janburse@fastmail.fm to comp.lang.prolog,sci.logic,sci.math on Sun Nov 30 13:36:52 2025

From Newsgroup: comp.lang.prolog

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge. https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

Bye
--- Synchronet 3.21a-Linux NewsLink 1.2

From Mild Shock@janburse@fastmail.fm to comp.lang.prolog,sci.logic,sci.math on Sun Nov 30 13:56:52 2025

From Newsgroup: comp.lang.prolog

Hi,

Wonder why the Coq proof even should be
different from anything that AI could produce.
Its not a typical Euclid proof in a few steps,

it rather uses also enumeration, just like the
Fly Speck proof, for the Keppler Conjecture. So
lets see what happens next, could AlphaEvolve

find the sixth busy beaver?

Bye

P.S.: Here picture of an old Busy Beaver ASIC
(Application-Specific Integrated Circuit)

Application Fun
Technology 1500
Manufacturer VLSI Tech
Type Semester Thesis
Package DIP64
Dimensions 3200μm x 3200μm
Gates 2 kGE
Voltage 5 V
Clock 20 MHz

The Busy Beaver Coprocessor has been designed to solve the Busy Beaver Function for 5 states. This function (also known as the Rado's Sigma
Function) is an uncomputable problem from information theory. The input argument is a natural number 'n' that represents the complexity of an algorithm described as a Turing Machine. http://asic.ethz.ch/cg/1990/Busy_Beaver.html

Mild Shock schrieb:

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge. https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

Bye

--- Synchronet 3.21a-Linux NewsLink 1.2

From Jeff Barnett@jbb@notatt.com to comp.lang.prolog,sci.logic,sci.math on Mon Dec 1 14:29:19 2025

From Newsgroup: comp.lang.prolog

On 11/30/2025 5:36 AM, Mild Shock wrote:

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge. https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent- for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

I'm fascinated by this result and I'd appreciate it if you could
elaborate more. Is the problem presented to the automation:

1. Prove "S(5) = 47,176,870" along with a 'def' of S?
2. Enumerate & check behavior or 47,176,870 machines?
3. Like 2 above but supplied with lemmas such as prove this case halts
implies a large number of other cases halt faster?
4. Like 3 above but lemmas discovered, perhaps with 'encouragement'?
5. other approaches or other chore splits between man and machine?
6. etc?

I think what I'm asking is for the work flow that led to the result.
--
Jeff Barnett

--- Synchronet 3.21a-Linux NewsLink 1.2

From Mild Shock@janburse@fastmail.fm to comp.lang.prolog,sci.logic,sci.math on Tue Dec 2 00:14:18 2025

From Newsgroup: comp.lang.prolog

Hi,

Meanwhile I have found some papers where some
earlier lemmas are proved, that didn't make it
into the Coq proof. So I am not sure

whether Coq is the first. Seems there are
different proofs possible. But I didn't spend
enough time on the matter, to explain

details. Still in the collection phase.

Sorry that I am not an excellent help here.

Bye

Jeff Barnett schrieb:

On 11/30/2025 5:36 AM, Mild Shock wrote:

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge.
https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent- for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research
https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

I'm fascinated by this result and I'd appreciate it if you could
elaborate more. Is the problem presented to the automation:

1. Prove "S(5) = 47,176,870" along with a 'def' of S?
2. Enumerate & check behavior or 47,176,870 machines?
3. Like 2 above but supplied with lemmas such as prove this case halts
implies a large number of other cases halt faster?
4. Like 3 above but lemmas discovered, perhaps with 'encouragement'?
5. other approaches or other chore splits between man and machine?
6. etc?

I think what I'm asking is for the work flow that led to the result.

--- Synchronet 3.21a-Linux NewsLink 1.2

From Mild Shock@janburse@fastmail.fm to comp.lang.prolog,sci.logic,sci.math on Tue Dec 2 00:17:59 2025

From Newsgroup: comp.lang.prolog

Hi,

I suspect to make a serious Coq endeavour,
I would still study first:

The Busy Beaver Frontier / Scott Aaronson - 2022 https://www.scottaaronson.com/papers/bb.pdf

But the above paper is also 22 pages. So
not a 5 minute read,

Bye

Mild Shock schrieb:

Hi,

Meanwhile I have found some papers where some
earlier lemmas are proved, that didn't make it
into the Coq proof. So I am not sure

whether Coq is the first. Seems there are
different proofs possible. But I didn't spend
enough time on the matter, to explain

details. Still in the collection phase.

Sorry that I am not an excellent help here.

Bye

Jeff Barnett schrieb:

On 11/30/2025 5:36 AM, Mild Shock wrote:

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge.
https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent- >>> for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research
https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

I'm fascinated by this result and I'd appreciate it if you could
elaborate more. Is the problem presented to the automation:

  1. Prove "S(5) = 47,176,870" along with a 'def' of S?
  2. Enumerate & check behavior or 47,176,870 machines?
  3. Like 2 above but supplied with lemmas such as prove this case halts >>      implies a large number of other cases halt faster?
  4. Like 3 above but lemmas discovered, perhaps with 'encouragement'?
  5. other approaches or other chore splits between man and machine?
  6. etc?

I think what I'm asking is for the work flow that led to the result.

--- Synchronet 3.21a-Linux NewsLink 1.2

From Mild Shock@janburse@fastmail.fm to comp.lang.prolog,sci.logic,sci.math on Tue Dec 2 00:22:25 2025

From Newsgroup: comp.lang.prolog

Hi,

Here is a 2024 claim of BB(5)

Skelet #17 and the fifth Busy Beaver number
Chris Xu
We prove nonhalting of the Turing machine dubbed "Skelet #17", known to
be one of the toughest 5-state, 2-symbol Turing machines to analyze.
Combined with the efforts of The Busy Beaver Challenge, we are therefore
able to show that BB(5), the fifth Busy Beaver number, equals 47,176,870. https://arxiv.org/abs/2407.02426

Thats before 2025. But dunno whether its flawed.
Maybe the 2025 paper has has the meric that some proof
was computerized. While the above paper carries

proofs more informally. Maybe feed it into an AI
and you get formal proofs, dunno. Maybe?

Bye

Mild Shock schrieb:

Hi,

I suspect to make a serious Coq endeavour,
I would still study first:

The Busy Beaver Frontier / Scott Aaronson - 2022 https://www.scottaaronson.com/papers/bb.pdf

But the above paper is also 22 pages. So
not a 5 minute read,

Bye

Mild Shock schrieb:

Hi,

Meanwhile I have found some papers where some
earlier lemmas are proved, that didn't make it
into the Coq proof. So I am not sure

whether Coq is the first. Seems there are
different proofs possible. But I didn't spend
enough time on the matter, to explain

details. Still in the collection phase.

Sorry that I am not an excellent help here.

Bye

Jeff Barnett schrieb:

On 11/30/2025 5:36 AM, Mild Shock wrote:

Hi,

What we thought:

Prediction 5 . It will never be proved that
Σ(5) = 4,098 and S(5) = 47,176,870.
-- Allen H. Brady, 1990 .

How it started:

To investigate AlphaEvolve’s breadth, we applied
the system to over 50 open problems in mathematical
analysis, geometry, combinatorics and number theory.
The system’s flexibility enabled us to set up most
experiments in a matter of hours. In roughly 75% of
cases, it rediscovered state-of-the-art solutions, to
the best of our knowledge.
https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent- >>>> for-designing-advanced-algorithms/

How its going:

We prove that S(5) = 47, 176, 870 using the Coq proof
assistant. The Busy Beaver value S(n) is the maximum
number of steps that an n-state 2-symbol Turing machine
can perform from the all-zero tape before halting, and
S was historically introduced by Tibor Radó in 1962 as
one of the simplest examples of an uncomputable function.
The proof enumerates 181,385,789 Turing machines with 5
states and, for each machine, decides whether it halts or
not. Our result marks the first determination of a new
Busy Beaver value in over 40 years and the first Busy
Beaver value ever to be formally verified, attesting to the
effectiveness of massively collaborative online research
https://arxiv.org/pdf/2509.12337

They claim not having used much AI. But could for
example AlphaEvolve do it somehow nevertheless, more or
less autonomously, and find the sixth busy beaver?

I'm fascinated by this result and I'd appreciate it if you could
elaborate more. Is the problem presented to the automation:

  1. Prove "S(5) = 47,176,870" along with a 'def' of S?
  2. Enumerate & check behavior or 47,176,870 machines?
  3. Like 2 above but supplied with lemmas such as prove this case halts >>>      implies a large number of other cases halt faster?
  4. Like 3 above but lemmas discovered, perhaps with 'encouragement'? >>>   5. other approaches or other chore splits between man and machine?
  6. etc?

I think what I'm asking is for the work flow that led to the result.

--- Synchronet 3.21a-Linux NewsLink 1.2

Who's Online

System Info

Sysop:	DaiTengu
Location:	Appleton, WI
Users:	1,090
Nodes:	10 (0 / 10)
Uptime:	158:12:45
Calls:	13,922
Files:	187,021
D/L today:	221 files (58,560K bytes)
Messages:	2,457,273

Could AlphaEvolve find the sixth busy beaver ?

Who's Online

System Info