AI Breakthrough: Solving Million-Step Math Problems

Artificial intelligence systems have made breakthrough after breakthrough mastering chess, in which games typically last about 40 moves. Now, to help solve the world’s toughest math problems, researchers have developed a new AI model that finds complex solutions requiring thousands to millions of steps. They suggest the new algorithms they built for the task might one day help detect events such as hurricanes and financial crashes that are rare but have disastrous impacts when they do happen.

Scientists are increasingly exploring how well AI can solve math problems. For example, Google DeepMind’s AlphaProof performed as well as a silver medalist in the 2024 International Mathematical Olympiad, a high-school level math competition, and OpenAI’s o3 system recently debuted with a strong showing on benchmark problems in math, science, and computer programming.

In a new study, which has not yet been peer reviewed, researchers at the California Institute of Technology and their colleagues tackled more challenging math problems, the kind that have perplexed professional mathematicians for decades.

“When it comes to the kind of problems you might find in math olympiads, they’re typically proofs involving 30 or 40 steps, on the same order of magnitude as an average game of chess,” says Sergei Gukov, a professor of theoretical physics and mathematics at the California Institute of Technology, in Pasadena. “We’re focusing on sophisticated research-level math problems with solutions involving thousands or millions or even billions of steps.”

Ultimately, “I’m hoping that we’ll be able to solve Millennium Prize problems using AI,” Gukov says, referring to a contest involving the most difficult mathematical problems in the world. “This is probably too optimistic on my part, but it’s good to have some north stars. At the moment, we’re trying to focus on problems one level down instead, the kind that have remained open for many years.”

AI Tackles the Andrews-Curtis Conjecture

In the new study, Gukov and his colleagues focused on the Andrews-Curtis conjecture, a combinatorial group theory problem first proposed 60 years ago. “Combinatorial group theory is about transformations of objects,” Gukov says. “Think about a Rubik’s cube. It’s a very simple group with basic operations and transformations—you can rotate different planes of a Rubik’s cube vertically and horizontally. The Andrews-Curtis conjecture is like a Rubik’s cube on steroids—instead of a 3 by 3 by 3 group, it’s more like a 100 by 100 by 100 group.”

Although the researchers did not prove the main conjecture, their new system disproved related families of problems known as potential counterexamples that had remained open for about 25 years. These counterexamples are essentially mathematical cases that would disprove the conjecture. Ruling out these counterexamples increases the likelihood that the conjecture is true.

To attack these problems, Gukov and his colleagues adopted a strategy where they looked for unexpected, convoluted solutions. “If you were to ask DeepSeek or o3 or ChatGPT or similar models to solve any of the problems we studied, they wouldn’t be able find answers,” he says. “They’re good at producing solutions that are expected or typical, at parroting what’s seen before. They’re meant to be general-purpose models. We’re looking at long sequences of steps that are hard to find, that are outliers among the statistical distribution of solutions.”

To develop these “super-moves,” as the researchers call them, Gukov and his colleagues used the approach known as reinforcement learning. They first gave the AI easy problems to solve, and then gave it progressively more difficult problems. The scientists looked for strategies that didn’t require large amounts of computing power; Gukov says that all the training was done on a single GPU.

Gukov notes that in reinforcement learning research, people typically use the same 10 to 15 algorithms. “What I find most exciting is that by thinking about solving these problems with these very long horizons, we developed new algorithms for AI,” he says.

Potential Applications Beyond Mathematics

These new algorithms “might have many applications for them outside of pure mathematics,” Gukov says. “They can find outliers, anomalies, black swans—events that are very, very rare, but can have very heavy price tags if they do happen.” The rarity of these events has made them difficult for AI to accurately forecast—their unusual nature means there is little historical data for predictive models to learn from. The ability to forecast the most likely and catastrophic of these scenarios could help societies prepare optimal mitigation strategies.

Gukov’s team is now investigating other longstanding math problems to help them develop the algorithms. They detailed their findings 13 February in a study on the ArXiv preprint server.

From Your Site Articles

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

AI Solves Million-Step Math Problems

New algorithms developed for complex math might help find black swan outlier events

AI Tackles the Andrews-Curtis Conjecture

Potential Applications Beyond Mathematics

A Self-Balancing Exoskeleton Strides Toward Market

IEEE President’s Note: Working Toward One IEEE

Video Friday: Good Over All Terrains

Related Stories

MIT and IBM Find Clever AI Ways Around Brute-Force Math

Brain-inspired Computing Is Ready for the Big Time

"Thinking" Visually Boosts AI Problem Solving

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Access Thousands of Articles — Completely Free

Create an account and get exclusive content and features: Save articles, download collections, and talk to tech insiders — all free! For full access and benefits, join IEEE as a paying member.

AI Solves Million-Step Math Problems

New algorithms developed for complex math might help find black swan outlier events

AI Tackles the Andrews-Curtis Conjecture

Potential Applications Beyond Mathematics

A Self-Balancing Exoskeleton Strides Toward Market

IEEE President’s Note: Working Toward One IEEE

Video Friday: Good Over All Terrains

Related Stories

MIT and IBM Find Clever AI Ways Around Brute-Force Math

Brain-inspired Computing Is Ready for the Big Time

"Thinking" Visually Boosts AI Problem Solving