* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, October 2, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

    Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

    Major airline to offer new in-flight entertainment options for passengers – PennLive.com

    Major airline to offer new in-flight entertainment options for passengers – PennLive.com

    Penn State-Themed Restaurant and Entertainment Spot Happy Valley Live Set to Open in State College – StateCollege.com

    Penn State-Themed Restaurant and Entertainment Spot Happy Valley Live Set to Open in State College – StateCollege.com

    The Police Made Chart History With This 1979 Hit Nearly 50 Years Ago – Yahoo

    How The Police Changed Music Forever with Their Iconic 1979 Hit Nearly 50 Years Ago

    Good Deed Entertainment Acquires Worldwide Rights To Liza Mandelup’s Documentary ‘Caterpillar’ – Deadline

    Good Deed Entertainment Lands Global Rights to Liza Mandelup’s Captivating Documentary ‘Caterpillar

    Danielle Fishel Explains Why Being on “DWTS” Makes Her Feel ‘Like It’s 1994 Again’ Filming “Boy Meets World” (Exclusive) – Yahoo

    Danielle Fishel Explains Why Being on “DWTS” Makes Her Feel ‘Like It’s 1994 Again’ Filming “Boy Meets World” (Exclusive) – Yahoo

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    A Tech Expo Shows What China Can Make, but Not Who’ll Buy It All – The New York Times

    Inside China’s Tech Expo: Cutting-Edge Innovations Face Uncertain Demand

    Steampunk Metal Oval Technology Sense Sunglasses Personality Handmade Chain Multicolor Sunglasses UV400 – The San Joaquin Valley Sun

    Steampunk Metal Oval Sunglasses with Handmade Multicolor Chain – Bold UV400 Protection and Unique Style

    STELLA Automotive AI Appoints Fred Seidelman as Chief Technology Officer – Yahoo Finance

    STELLA Automotive AI Appoints Fred Seidelman as New Chief Technology Officer

    Saving Energy and Money with Smart Technology – Terms of Service with Clare Duffy – Podcast on CNN Podcasts – CNN

    Saving Energy and Money with Smart Technology – Terms of Service with Clare Duffy – Podcast on CNN Podcasts – CNN

    Four Strategic Signals Technology Leaders Are Tuning In To – SPONSOR CONTENT FROM ARM – Harvard Business Review

    Four Essential Strategic Signals Every Technology Leader Should Watch

    Virginia Tech hosts annual New Music + Technology Festival this week – Cardinal News

    Virginia Tech Kicks Off Exciting Annual New Music and Technology Festival This Week

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

    Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

    Major airline to offer new in-flight entertainment options for passengers – PennLive.com

    Major airline to offer new in-flight entertainment options for passengers – PennLive.com

    Penn State-Themed Restaurant and Entertainment Spot Happy Valley Live Set to Open in State College – StateCollege.com

    Penn State-Themed Restaurant and Entertainment Spot Happy Valley Live Set to Open in State College – StateCollege.com

    The Police Made Chart History With This 1979 Hit Nearly 50 Years Ago – Yahoo

    How The Police Changed Music Forever with Their Iconic 1979 Hit Nearly 50 Years Ago

    Good Deed Entertainment Acquires Worldwide Rights To Liza Mandelup’s Documentary ‘Caterpillar’ – Deadline

    Good Deed Entertainment Lands Global Rights to Liza Mandelup’s Captivating Documentary ‘Caterpillar

    Danielle Fishel Explains Why Being on “DWTS” Makes Her Feel ‘Like It’s 1994 Again’ Filming “Boy Meets World” (Exclusive) – Yahoo

    Danielle Fishel Explains Why Being on “DWTS” Makes Her Feel ‘Like It’s 1994 Again’ Filming “Boy Meets World” (Exclusive) – Yahoo

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    A Tech Expo Shows What China Can Make, but Not Who’ll Buy It All – The New York Times

    Inside China’s Tech Expo: Cutting-Edge Innovations Face Uncertain Demand

    Steampunk Metal Oval Technology Sense Sunglasses Personality Handmade Chain Multicolor Sunglasses UV400 – The San Joaquin Valley Sun

    Steampunk Metal Oval Sunglasses with Handmade Multicolor Chain – Bold UV400 Protection and Unique Style

    STELLA Automotive AI Appoints Fred Seidelman as Chief Technology Officer – Yahoo Finance

    STELLA Automotive AI Appoints Fred Seidelman as New Chief Technology Officer

    Saving Energy and Money with Smart Technology – Terms of Service with Clare Duffy – Podcast on CNN Podcasts – CNN

    Saving Energy and Money with Smart Technology – Terms of Service with Clare Duffy – Podcast on CNN Podcasts – CNN

    Four Strategic Signals Technology Leaders Are Tuning In To – SPONSOR CONTENT FROM ARM – Harvard Business Review

    Four Essential Strategic Signals Every Technology Leader Should Watch

    Virginia Tech hosts annual New Music + Technology Festival this week – Cardinal News

    Virginia Tech Kicks Off Exciting Annual New Music and Technology Festival This Week

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Science

AI System Beats Chess Puzzles With ‘Artificial Brainstorming’

November 15, 2023
in Science
AI System Beats Chess Puzzles With ‘Artificial Brainstorming’
Share on FacebookShare on Twitter

When Covid-19 sent people home in early 2020, the computer scientist Tom Zahavy rediscovered chess. He had played as a kid and had recently read Garry Kasparov’s Deep Thinking, a memoir of the grandmaster’s 1997 matches against IBM’s chess-playing computer, Deep Blue. He watched chess videos on YouTube and The Queen’s Gambit on Netflix.

Despite his renewed interest, Zahavy wasn’t looking for ways to improve his game. “I’m not a great player,” he said. “I’m better at chess puzzles” — arrangements of pieces, often contrived and unlikely to occur during a real game, that challenge a player to find creative ways to gain the advantage.

The puzzles can help players sharpen their skills, but more recently they’ve helped reveal the hidden limitations of chess programs. One of the most notorious puzzles, devised by the mathematician Sir Roger Penrose in 2017, puts stronger black pieces (such as the queen and rooks) on the board, but in awkward positions. An experienced human player, playing white, could readily steer the game into a draw, but powerful computer chess programs would say black had a clear advantage. That difference, Zahavy said, suggested that even though computers could defeat the world’s best human players, they couldn’t yet recognize and work through every kind of tough problem. Since then, Penrose and others have devised sprawling collections of puzzles that computers struggle to solve.

Chess has long been a touchstone for testing new ideas in artificial intelligence, and Penrose’s puzzles piqued Zahavy’s interest. “I was trying to understand what makes these positions so hard for computers when at least some of them we can solve as humans,” he said. “I was completely fascinated.” It soon evolved into a professional interest: As a research scientist at Google DeepMind, Zahavy explores creative problem-solving approaches. The goal is to devise AI systems with a spectrum of possible behaviors beyond performing a single task.

A traditional AI chess program, trained to win, may not make sense of a Penrose puzzle, but Zahavy suspected that a program made up of many diverse systems, working together as a group, could make headway. So he and his colleagues developed a way to weave together multiple (up to 10) decision-making AI systems, each optimized and trained for different strategies, starting with AlphaZero, DeepMind’s powerful chess program. The new system, they reported in August, played better than AlphaZero alone, and it showed more skill — and more creativity — in dealing with Penrose’s puzzles. These abilities came, in a sense, from self-collaboration: If one approach hit a wall, the program simply turned to another.

That approach fundamentally makes sense, said Allison Liemhetcharat, a computer scientist at DoorDash who has worked with multi-agent approaches to problem-solving in robotics. “With a population of agents, there’s a higher probability that the puzzles are in the domain that at least one of the agents was trained in.”

The work suggests that teams of diverse AI systems could efficiently tackle hard problems well beyond the game board. “This is a great example that looking for more than one way to solve a problem — like winning a chess game — provides a lot of benefits,” said Antoine Cully, an AI researcher at Imperial College London who was not involved with the DeepMind project. He compared it to an artificial version of human brainstorming sessions. “This thought process leads to creative and effective solutions that one would miss without doing this exercise.”

Chasing Failures

Before joining DeepMind, Zahavy was interested in deep reinforcement learning, an area of artificial intelligence in which a system uses neural networks to learn some task through trial and error. It’s the basis for the most powerful chess programs (and used in other AI applications like self-driving cars). The system starts with its environment. In chess, for example, the environment includes the game board and possible moves. If the task is to drive a car, the environment includes everything around the vehicle. The system then makes decisions, takes actions and evaluates how close it came to its goal. As it gets closer to the goal, it accumulates rewards, and as the system racks up rewards it improves its performance. The “deep” part of this approach describes the neural networks used to analyze and assess behaviors.

Reinforcement learning is how AlphaZero learned to become a chess master. DeepMind reported that during the program’s first nine hours of training, in December 2017, it played 44 million games against itself. At first, its moves were randomly determined, but over time it learned to select moves more likely to lead toward checkmate. After just hours of training, AlphaZero developed the ability to defeat any human chess player.

But as successful as reinforcement learning can be, it doesn’t always lead to strategies that reflected a general understanding of the game. Over the last half-decade or so, Zahavy and others noticed an uptick in the peculiar glitches that could happen on systems trained with trial and error. A system that plays video games, for example, might find a loophole and figure out how to cheat or skip a level, or it could just as easily get stuck in a repetitive loop. Penrose-style puzzles similarly suggested a kind of blind spot, or glitch, in AlphaZero — it couldn’t figure out how to approach a problem it had never seen before.

But maybe not all glitches are just errors. Zahavy suspected that AlphaZero’s blind spots might actually be something else in disguise — decisions and behaviors tied to the system’s internal rewards. Deep reinforcement learning systems, he said, don’t know how to fail — or even how to recognize failure. The ability to fail has long been linked to creative problem-solving. “Creativity has a human quality,” Kasparov wrote in Deep Thinking. “It accepts the notion of failure.”

AI systems typically don’t. And if a system doesn’t recognize that it’s failed to complete its task, then it may not try something else. Instead, it will just keep trying to do what it’s already done. That’s likely what led to those dead ends in video games — or to getting stuck on some Penrose challenges, Zahavy said. The system was chasing “weird kinds of intrinsic rewards,” he said, that it had developed during its training. Things that looked like mistakes from the outside were likely the consequence of developing specific but ultimately unsuccessful strategies.

The system regarded these weird rewards as steps toward the greater goal, which it couldn’t actually achieve, and it didn’t know to try something new. “I was trying to make sense of them,” Zahavy said.

A Better Game

Part of the reason these glitches can prove so consequential — and so useful — comes from what researchers recognize as a problem with generalization. While reinforcement learning systems can develop an effective strategy for connecting a given situation to a specific action — which researchers call a “policy” — they can’t apply it to different problems. “What normally tends to happen with reinforcement learning, almost regardless of the method, is that you get the policy that solves the particular instance of the problem you’ve been training on, but it doesn’t generalize,” said Julian Togelius, a computer scientist at New York University and research director at modl.ai.

Zahavy saw the Penrose puzzles as requiring just this sort of generalization. Maybe AlphaZero couldn’t solve most puzzles because it was so focused on winning entire games, start to finish. But that approach introduced blind spots exposed by the unlikely arrangements of pieces in Penrose puzzles. Maybe, he reasoned, the program could learn to beat the puzzle if it had enough creative room to brainstorm and access different training methods.

So he and his colleagues first collected a set of 53 Penrose puzzles and 15 additional challenge puzzles. On its own, AlphaZero solved less 4% of the Penrose puzzles and under 12% of the rest. Zahavy wasn’t surprised: Many of these puzzles were designed by chess masters to intentionally confuse computers.

As a test, the researchers tried training AlphaZero to play against itself using the Penrose puzzle arrangement as the starting position, instead of the full board of typical games. Its performance improved dramatically: It solved 96% of the Penrose puzzles and 76% of the challenge set. In general, when AlphaZero trained on a specific puzzle, it could solve that puzzle, just as it could win when it trained on a full game. Perhaps, Zahavy thought, if a chess program could somehow have access to all those different versions of AlphaZero, trained on those different positions, then that diversity could spark the ability to approach new problems productively. Perhaps it could generalize, in other words, solving not only the Penrose puzzles, but any broader chess problem.

His group decided to find out. They built the new, diversified version of AlphaZero, which includes multiple AI systems that trained independently and on a variety of situations. The algorithm that governs the overall system acts as a kind of virtual matchmaker, Zahavy said: one designed to identify which agent has the best chance of succeeding when it’s time to make a move. He and his colleagues also coded in a “diversity bonus” — a reward for the system whenever it pulled strategies from a large selection of choices.

When the new system was set loose to play its own games, the team observed a lot of variety. The diversified AI player experimented with new, effective openings and novel — but sound — decisions about specific strategies, such as when and where to castle. In most matches, it defeated the original AlphaZero. The team also found that the diversified version could solve twice as many challenge puzzles as the original and could solve more than half of the total catalog of Penrose puzzles.

“The idea is that instead of finding one solution, or one single policy, that would beat any player, here [it uses] the idea of creative diversity,” Cully said.

With access to more and different played games, Zahavy said, the diversified AlphaZero had more options for sticky situations when they arose. “If you can control the kind of games that it sees, you basically control how it will generalize,” he said. Those weird intrinsic rewards (and their associated moves) could become strengths for diverse behaviors. Then the system could learn to assess and value the disparate approaches and see when they were most successful. “We found that this group of agents can actually come to an agreement on these positions.”

And, crucially, the implications extend beyond chess.

Real-Life Creativity

Cully said a diversified approach can help any AI system, not just those based on reinforcement learning. He’s long used diversity to train physical systems, including a six-legged robot that was allowed to explore various kinds of movement, before he intentionally “injured” it, allowing it to continue moving using some of the techniques it had developed before. “We were just trying to find solutions that were different from all previous solutions we have found so far.” Recently, he’s also been collaborating with researchers to use diversity to identify promising new drug candidates and develop effective stock-trading strategies.

“The goal is to generate a large collection of potentially thousands of different solutions, where every solution is very different from the next,” Cully said. So — just as the diversified chess player learned to do — for every type of problem, the overall system could choose the best possible solution. Zahavy’s AI system, he said, clearly shows how “searching for diverse strategies helps to think outside the box and find solutions.”

Zahavy suspects that in order for AI systems to think creatively, researchers simply have to get them to consider more options. That hypothesis suggests a curious connection between humans and machines: Maybe intelligence is just a matter of computational power. For an AI system, maybe creativity boils down to the ability to consider and select from a large enough buffet of options. As the system gains rewards for selecting a variety of optimal strategies, this kind of creative problem-solving gets reinforced and strengthened. Ultimately, in theory, it could emulate any kind of problem-solving strategy recognized as a creative one in humans. Creativity would become a computational problem.

Liemhetcharat noted that a diversified AI system is unlikely to completely resolve the broader generalization problem in machine learning. But it’s a step in the right direction. “It’s mitigating one of the shortcomings,” she said.

More practically, Zahavy’s results resonate with recent efforts that show how cooperation can lead to better performance on hard tasks among humans. Most of the hits on the Billboard 100 list were written by teams of songwriters, for example, not individuals. And there’s still room for improvement. The diverse approach is currently computationally expensive, since it must consider so many more possibilities than a typical system. Zahavy is also not convinced that even the diversified AlphaZero captures the entire spectrum of possibilities.

“I still [think] there is room to find different solutions,” he said. “It’s not clear to me that given all the data in the world, there is [only] one answer to every question.”

Quanta is conducting a series of surveys to better serve our audience. Take our computer science reader survey and you will be entered to win free Quanta merchandise.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Quanta Magazine – https://www.quantamagazine.org/google-deepmind-trains-artificial-brainstorming-in-chess-ai-20231115/

Tags: beatssciencesystem
Previous Post

During Pregnancy, a Fake ‘Infection’ Protects the Fetus

Next Post

2°C Rise 8 Years Sooner: Arctic’s Role in Rapid Global Warming

Alabama man earns world record for 3-foot, 6-inch beard locks – upi.com

Alabama man earns world record for 3-foot, 6-inch beard locks – upi.com

October 2, 2025
How Trump could use a government shutdown to turbocharge his economic agenda – Yahoo Finance

How Trump could use a government shutdown to turbocharge his economic agenda – Yahoo Finance

October 2, 2025
Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

Toni Braxton Is Turning Her Biggest Hits Into Lifetime Movies – Yahoo

October 2, 2025
Reproductive Health Emergency Kits To Be Distributed Saturday At Jacksonville Really Really Free Market – Center for Biological Diversity

Reproductive Health Emergency Kits To Be Distributed Saturday At Jacksonville Really Really Free Market – Center for Biological Diversity

October 2, 2025
Times/Siena Survey: Americans Worry Divisions Cannot Be Overcome – The New York Times

Americans Fear Deep Divisions May Be Impossible to Overcome

October 2, 2025
Oak Ridge Reservation Set for $42M Ecological Restoration, Balancing – Hoodline

Oak Ridge Reservation to Undergo $42M Ecological Restoration and Balancing Effort

October 2, 2025
Mayor green lights Science Center development; residents call it ‘giant win’ for St. Pete – WFLA

Mayor green lights Science Center development; residents call it ‘giant win’ for St. Pete – WFLA

October 2, 2025
A ‘Great Wave’ is rippling through our galaxy, pushing thousands of stars out of place – Live Science

A ‘Great Wave’ is rippling through our galaxy, pushing thousands of stars out of place – Live Science

October 2, 2025
These are the best breweries in New Jersey, according to recent online ranking – Yahoo

These are the best breweries in New Jersey, according to recent online ranking – Yahoo

October 2, 2025
A Tech Expo Shows What China Can Make, but Not Who’ll Buy It All – The New York Times

Inside China’s Tech Expo: Cutting-Edge Innovations Face Uncertain Demand

October 2, 2025

Categories

Archives

October 2025
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
2728293031  
« Sep    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (847)
  • Economy (868)
  • Entertainment (21,742)
  • General (17,369)
  • Health (9,911)
  • Lifestyle (881)
  • News (22,149)
  • People (870)
  • Politics (879)
  • Science (16,078)
  • Sports (21,368)
  • Technology (15,851)
  • World (851)

Recent News

Alabama man earns world record for 3-foot, 6-inch beard locks – upi.com

Alabama man earns world record for 3-foot, 6-inch beard locks – upi.com

October 2, 2025
How Trump could use a government shutdown to turbocharge his economic agenda – Yahoo Finance

How Trump could use a government shutdown to turbocharge his economic agenda – Yahoo Finance

October 2, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version