Google’s AlphaGo Zero AI quickly masters ancient board game with no human help

Share

Why it matters to you

AlphaGo Zero can learn on its own, marking a significant step forward for the capabilities of AI algorithms.

Google shocked the world in 2016 when AlphaGo, an artificial intelligence program created specifically to play the ancient board game Go, defeated one of the game’s top competitors in a five-game match. Such a feat wasn’t predicted to occur for at least another decade, leaving tech types and laymen alike wondering just how intelligent AI has become.

A little over one year later, AlphaGo again competed in a high-profile match, this time against the world’s top Go player, a 19-year-old prodigy named Ke Jie. The machine shut the human out, three games to none. With these victories under its belt, Google announced in May that it would retire AlphaGo.

But Google’s AI group, DeepMind, has just unveiled a newer, shinier, smarter version of AlphaGo dubbed AlphaGo Zero, which has pushed beyond the capabilities of its predecessor by mastering the ancient board game without any help from humans. Equipped with just the rules of the game, AlphaGo Zero managed to learn Go from scratch, create its own knowledge along the way, and ultimately defeat its predecessor 100 games to zero.

Both the old and new AlphaGo learned through a process called reinforcement learning, which encourages good moves that are more likely to be rewarded with a win. However, the way DeepMind trained the systems differed, and that’s where AlphaGo Zero really shined.

To train the original AlphaGo, DeepMind researchers fed the system thousands of games that were played by amateur and professional human Go players. These games helped the system develop winning strategies and identify good and bad moves. AlphaGo Zero, on the other hand, only played by itself (albeit millions of time), making moves at random until it recognized strategies. The new system had no help from humans beyond its initial startup.

What’s truly astonishing about AlphaGo Zero’s self-schooling is that it went from chump to champ in just a few days. The system started off as a completely incompetent player. By the third day, after only playing against itself, the system was capable of defeating its predecessor. By day 40, DeepMind suggests the system became the greatest Go player ever.

Where the original AlphaGo was little more than an exceptionally talented board game player, the advances made by AlphaGo Zero — specifically it’s ability to teach itself from scratch — makes the system relevant to a wide range of real-world applications. The same principles that help AlphaGo Zero learn from just the rules could be applied to other rules-based task.

“For us, AlphaGo wasn’t just about winning the game of Go,” Demis Hassabis, CEO of DeepMind, told The Guardian. “It was also a big step for us towards building these general-purpose algorithms.”

DeepMind published a paper detailing the development of AlphaGo Zero in the journal Nature.

News

Company:

The Redmi Pad SE is a new entry-level Android tablet for less than $200

Enthusiast audio brand Moondrop is making a phone, and it doesn’t make sense

Android 15 could make app notification management a breeze

Samsung Galaxy A35 vs. Nothing Phone 2a: Which one is worth your money?

Apple is about to do the unthinkable to its iPads

I reviewed the Samsung Galaxy A55. It didn’t go as expected

The Insta360 X4 360-degree action camera is pocket-sized perfection for vloggers

Withings ScanWatch 2 review: Should you buy it?

A content creation laptop for $1,000 isn’t impossible after all

Spigen Liquid Air Samsung Galaxy S24 case review: Should you buy it?

I’ve worn two of the best smart rings. Here’s which one you should buy

I did a camera test with two $1,800 phones. Then something annoying happened

Google Pixel 7a vs. Pixel 7: don’t buy the wrong Pixel

This is the most unusual Galaxy S23 Ultra camera test I’ve ever done

I tested the Galaxy S23 Ultra and iPhone 14 Pro cameras. Only one is a winner

How to download a video from Facebook

How to do a hanging indent in Microsoft Word

How to use YouTube Music on your Wear OS smartwatch

How to enable 120Hz for all games on the Oculus Quest 2

How to type an em dash on a Mac

8 iPhone browser apps you should use instead of Safari

Are Facebook and Instagram still down? Here’s what we know

Are Facebook and Instagram still down? Here’s what we know

The 1Password Android app just got a huge upgrade

I never knew I needed this mini Mac app, but now I can’t live without it

Google’s AlphaGo Zero AI quickly masters ancient board game with no human help

The Redmi Pad SE is a new entry-level Android tablet for less than $200

Enthusiast audio brand Moondrop is making a phone, and it doesn’t make sense

Android 15 could make app notification management a breeze

Samsung Galaxy A35 vs. Nothing Phone 2a: Which one is worth your money?

Apple is about to do the unthinkable to its iPads

More News

The Redmi Pad SE is a new entry-level Android tablet for less than $200

Enthusiast audio brand Moondrop is making a phone, and it doesn’t make sense

Android 15 could make app notification management a breeze

Samsung Galaxy A35 vs. Nothing Phone 2a: Which one is worth your money?