DeepMind introduced Gato, a single transformer model trained to perform hundreds of different tasks including playing Atari games, captioning images, engaging in dialogue, and controlling robotic arms. Using the same network weights across all tasks, Gato demonstrated the possibility of generalist AI agents. It achieved over 50% expert performance on 450 out of 604 tasks.
- Single model for 600+ tasks
- Games, images, text, robotics
- Same weights across all tasks
- Generalist AI approach
- 450/604 tasks at expert level
research-papergeneralist-agentreinforcement-learningdeepmind
DeepMind introduced Flamingo, a visual language model that can rapidly adapt to new tasks with just a few examples (few-shot learning). It combines pretrained vision and language models with new architecture components, achieving state-of-the-art on image and video understanding benchmarks without task-specific fine-tuning. Showed the power of frozen pretrained models with light-weight adapters.
- Few-shot visual understanding
- No task-specific fine-tuning needed
- Frozen pretrained components
- State-of-the-art on multiple benchmarks
- Vision + language integration
research-papermultimodalfew-shotdeepmind
DeepMind's Chinchilla paper revealed that current large models are severely undertrained. A 70B parameter model trained on 1.4 trillion tokens outperformed Gopher (280B params) and GPT-3 (175B params). The paper established new scaling laws showing that model size and training data should scale equally for optimal compute efficiency, fundamentally changing how models are trained.
- 70B model outperforms 280B Gopher
- Data scaling = Parameter scaling
- 1.4 trillion tokens for 70B model
- New scaling laws established
- Fundamentally changed training approaches
research-paperscaling-lawsefficiencydeepmind
DeepMind's AlphaFold2 achieved near-atomic accuracy in predicting protein structures, solving a 50-year-old grand challenge in biology. At CASP14, it achieved a median GDT score of 92.4, competitive with experimental results. This breakthrough has massive implications for drug discovery and understanding disease.
- Solved 50-year protein folding problem
- Near-atomic accuracy (GDT 92.4)
- Competitive with experimental methods
- Opened path to cataloguing 200M+ protein structures
- Massive impact on biology and medicine
google-deepmindresearch-paperalphafoldsciencebenchmark
DeepMind introduced AlphaGo Zero, which achieved superhuman Go performance without any human data. Starting from random play, it surpassed all previous versions within 40 days, demonstrating that AI could achieve mastery purely through self-play reinforcement learning.
- No human data - learned from scratch
- Trained entirely through self-play
- Surpassed previous AlphaGo in 40 days
- Used single neural network (vs. two)
- Demonstrated pure reinforcement learning power
google-deepmindresearch-papergame-playingreinforcement-learningai
In a historic five-game match, AlphaGo defeated legendary Go player Lee Sedol 4-1. The match drew global attention, with over 280 million viewers in China alone. Game 2's move 37 was called 'divine' by commentators, showing creative play beyond human intuition.
- First AI to defeat world champion Go player
- 4-1 victory over Lee Sedol
- Move 37 in Game 2 called 'divine'
- Watched by 280 million in China
- Featured in documentary 'AlphaGo'
google-deepmindgame-playingreinforcement-learningaibenchmark
DeepMind's AlphaGo defeated European Go champion Fan Hui 5-0 in a formal match, the first time a computer program defeated a professional Go player. This was a landmark achievement as Go was considered much more complex than chess.
- First AI to defeat professional Go player
- Combined deep neural networks with Monte Carlo tree search
- Trained on 30 million positions from human games
- Used policy and value networks
- Paved way for Lee Sedol match
google-deepmindproduct-launchgame-playingreinforcement-learningai