Lead AI
Back to Courses
coursera
Coding
advanced
Certificate

Reinforcement Learning Specialization

University of Alberta's RL course — MDPs, temporal difference learning, policy gradient methods, and Dyna architecture.

by University of Alberta4.7 (5,200 reviews)95,000 students80h96 lessons

Technologies & Tools

reinforcement-learning
mdp
policy-gradient
temporal-difference

Dev AI Expertise Covered

MDPs
Temporal Difference
Monte Carlo Methods
Policy Gradient
Dyna Architecture
Function Approximation
Pros
Gold standard RL course
From RL textbook authors
Rigorous theory
Cons
-Math-heavy
-No deep RL (DQN, PPO)

Prerequisites

  • Python
  • Probability
  • Linear Algebra
  • Basic ML

Instructor

U

University of Alberta

Course Details

LanguageEnglish
Duration80h
Lessons96
CertificateYes