A key question in artificial intelligence is how often models go beyond just regurgitating and remixing what they have learned and produce truly novel ideas or insights.
A new project from Google DeepMind shows that with a few clever tweaks these models can at least surpass human expertise designing certain types of algorithms—including ones that are useful for advancing AI itself.
The company’s latest AI project, called AlphaEvolve, combines the coding skills of its Gemini AI model with a method for testing the effectiveness of new algorithms and an evolutionary method for producing new designs.
AlphaEvolve came up with more efficient algorithms for several kinds of computation, including a method for calculations involving matrices that betters an approach called the Strassen algorithm that has been relied upon for 56 years. The new approach improves the computational efficiency by reducing the number of calculations required to produce a result.
DeepMind also used AlphaEvolve to come up with better algorithms for several real-world problems including scheduling tasks inside datacenters, sketching out the design of computer chips, and optimizing the design of the algorithms used to build large language models like Gemini itself.
“These are three critical elements of the modern AI ecosystem,” says Pushmeet Kohli, head of AI for science at DeepMind. “This superhuman coding agent is able to take on certain tasks and go much beyond what is known in terms of solutions for them.”
Matej Balog, one of the research leads on AlphaEvolve, says that it is often difficult to know if a large language model has come up with a truly novel piece of writing or code, but it is possible to show that no person has come up with a better solution to certain problems. “We have shown very precisely that you can discover something that’s provably new and provably correct,” Balog says. “You can be really certain that what you have found couldn’t have been in the training data.”
Sanjeev Arora, a scientist at Princeton University specializing in algorithm design, says that the advancements made by AlphaEvolve are relatively small and only apply to algorithms that involve searching through a space of potential answers. But he adds: “search is a pretty general idea applicable to many settings.”
AI-powered coding is starting to change the way developers and companies write software. The latest AI models make it trivial for novices to build simple apps and websites, and some experienced developers are using AI to automate more of their work.
AlphaEvolve demonstrates the potential for AI to come up with completely novel ideas through continual experimentation and evaluation. DeepMind and other AI companies hope that AI agents will gradually learn to exhibit more general ingenuity in many areas, perhaps eventually generating ingenious solutions to a business problem or novel insights when given a particular problem.
Josh Alman, an assistant professor at Columbia University who works on algorithm design, says that AlphaEvolve does appear to be generating novel ideas rather than remixing stuff it’s learned during training. “It has to be doing something new and not just regurgitating,” he says.