Ronald Williams
Biodata
- Profesi: Computer scientist
Kontribusi
- Backpropagation (1986) — paper seminal dengan Rumelhart & Hinton:
“Learning representations by back-propagating errors”
- REINFORCE algorithm (1992) — policy gradient untuk reinforcement learning
- Machine learning di University of Massachusetts
Karier
- UCSD (1980-an)
- University of Massachusetts Amherst (1990-)
Legacy
Williams adalah salah satu 3 penulis paper backpropagation (dengan Rumelhart & Hinton). REINFORCE algorithm-nya menjadi fondasi policy gradient methods di RL modern.