Back to Courses
coursera
Coding
advanced Certificate
Reinforcement Learning Specialization
University of Alberta's RL course — MDPs, temporal difference learning, policy gradient methods, and Dyna architecture.
by University of Alberta4.7 (5,200 reviews)95,000 students80h96 lessons
Technologies & Tools
reinforcement-learning
mdp
policy-gradient
temporal-difference
Dev AI Expertise Covered
MDPs
Temporal Difference
Monte Carlo Methods
Policy Gradient
Dyna Architecture
Function Approximation
Pros
Gold standard RL course
From RL textbook authors
Rigorous theory
Cons
-Math-heavy
-No deep RL (DQN, PPO)
Prerequisites
- Python
- Probability
- Linear Algebra
- Basic ML
Instructor
U
University of Alberta
Course Details
LanguageEnglish
Duration80h
Lessons96
CertificateYes
