- In this course, you will be introduced to Reinforcement Learning, an area of Machine Learning. You will learn the Markov Decision Processes, Bandit Algorithms, Dynamic Programming, and Temporal Difference (TD) methods. You will be introduced to Value function, Bellman Equation, and Value iteration. You will also learn Policy Gradient methods. You will learn to make decisions in uncertain environment.
Reinforcement Learning
- Description
- Curriculum
- FAQ
- Reviews
- Web Developers
- Software Developers
- Programmers
- Anyone who wants to learn reinforcement learning
- Fundamentals in AI & ML, Probability, Python, Neural Networks, Frameworks, Deep Learning library like PyTorch/ Theano/ Tensorflow
- Statistics and Machine learning algorithms
- Python Essentials
-
1Introduction to Reinforcement Learning
Learning Objectives: The aim of this module is to introduce you to the fundamentals of Reinforcement Learning and its elements. This module also introduces you to OpenAI Gym - a programming environment used for implementing RL agents.
Topics:
- Branches of Machine Learning
- What is Reinforcement Learning?
- The Reinforcement Learning Process
- Elements of Reinforcement Learning
- RL Agent Taxonomy
- Reinforcement Learning Problem
- Introduction to OpenAI Gym
-
2Bandit Algorithms and Markov Decision Process
Learning Objectives: The aim of this module is to learn Bandit Algorithms and Markov Decision Process.
Topics:
- Bandit Algorithms
- Markov Process
- Markov Reward Process
- Markov Decision Process
-
3Dynamic Programming
Learning Objectives: The aim of this module is to develop an understanding of Dynamic Programming Algorithms and Temporal Difference Learning methods.
Topics:
- Introduction to Dynamic Programming
- Dynamic Programming Algorithms
- Monte Carlo Methods
- Temporal Difference Learning Methods
-
4Deep Q Learning
Learning Objectives: The aim of this module is to learn Policy Gradients and develop an understanding of Deep Q Learning
Topics:
- Policy Gradients
- Policy Gradients using TensorFlow
- Deep Q learning
- Q learning with replay buffers, target networks, and CNN
-
5In-Class Project
Goal: In this module, you will learn how to approach and implement a Project end to end, and a Subject Matter Expert will share his experience and insights from the industry to help you kickstart your career in this domain. Finally, we will be having a Q&A and doubt clearing session.
Objectives: At the end of this module, you should be able to:
- How to approach a project
- Hands-On project implementation
- What Industry expects
- Industry insights for the Machine Learning domain
- QA and Doubt Clearing Session
View the recorded session of the class available in your LMS.
You can attend the missed session, in any other live batch."