In this course, you will be introduced to Reinforcement Learning, an area of Machine Learning. You will learn the Markov Decision Processes, Bandit Algorithms, Dynamic Programming, and Temporal Difference (TD) methods. You will be introduced to Value function, Bellman Equation, and Value iteration. You will also learn Policy Gradient methods. You will learn to make decisions in uncertain environment.
Learning Objectives: The aim of this module is to introduce you to the fundamentals of Reinforcement Learning and its elements. This module also introduces you to OpenAI Gym - a programming environment used for implementing RL agents.
Topics:
Learning Objectives: The aim of this module is to learn Bandit Algorithms and Markov Decision Process.
Topics:
Learning Objectives: The aim of this module is to develop an understanding of Dynamic Programming Algorithms and Temporal Difference Learning methods.
Topics:
Learning Objectives: The aim of this module is to learn Policy Gradients and develop an understanding of Deep Q Learning
Topics:
Goal: The aim of this module is to provide you hands-on experience in Reinforcement Learning.
Required Pre-requisites
Edureka offers you complimentary self-paced courses