Mastering perfect and imperfect information systems.
The foundation of modern game intelligence was built on reinforcement learning, progressing from brute-force chess calculations to deep neural networks playing complex, real-time strategy games from self-play.
- ▸Deep Blue (1997): Defeated world champion Garry Kasparov in Chess by calculating 200M positions/sec.
- ▸AlphaGo (2016): Defeated 18-time world champion Lee Sedol 4-1 in Go by combining deep learning with Monte Carlo Tree Search.
- ▸AlphaZero (2017): Mastered Chess, Shogi, and Go entirely from scratch via self-play RL, without using human games.
- ▸AlphaStar (2019): Reached Grandmaster level (top 0.15% of active players) in StarCraft II, mastering imperfect information and real-time planning.
- ▸AlphaDev (2022): Discovered faster sorting algorithms in assembly code, integrated directly into the LLVM libc++ library.