E1 277 January-April 3:1 Reinforcement Learning: Instructor

This document provides information about a reinforcement learning course including the instructor, teaching assistants, schedule, prerequisites, syllabus, course outcomes, grading policy, and resources. The course deals with probabilistic models and algorithms for dynamic decision making under uncertainty, covering topics like stochastic dynamic programming, Q-learning, temporal difference learning, and actor-critic algorithms. Students will learn modeling and analysis techniques that can be applied to problems involving sequential decision making and will gain an understanding of commonly used reinforcement learning algorithms.

Uploaded by

praveen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

168 views2 pages

E1 277 January-April 3:1 Reinforcement Learning: Instructor

Uploaded by

praveen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

E1 277 January-April 3:1

Reinforcement Learning

Instructor
Shalabh Bhatnagar
Email: [email protected]
Teaching Assistant
Sindhu P.R., Raghuram Bharadwaj
Email: [email protected], [email protected]

Department: Computer Science and Automation

Course Time: Tuesday/Thursday 9:30-11:00
Lecture venue: CSA 252
Detailed Course Page:

Announcements

Brief description of the course

The course deals with probabilistic models for problems of dynamic decision making under uncertainty.

Stochastic dynamic programming is a general framework for modelling such problems. However, one requires

knowledge of transition probabilities (i.e., the system dynamics) as well as the associated cost function. Both

of these quantities are normally not known and one only has access to data that is available from the

experiment. For instance, one may not know the transition probabilities but one may see what the next state is

given the current state and the action or control chosen. The course deals with building first the model based

dynamic programming techniques and subsequently the model free, data driven algorithms, and deals with the

theoretical foundations of these.

Prerequisites
Any student who has done the course E0 232 -- Probability and Statistics or an equivalent probability course.
Syllabus
Introduction to reinforcement learning, introduction to stochastic dynamic programming, finite and infinite

horizon models, the dynamic programming algorithm, infinite horizon discounted cost and average cost

Page 1/2
problems, numerical solution methodologies, full state representations, function approximation techniques,

approximate dynamic programming, partially observable Markov decision processes, Q-learning, temporal

difference learning, actor-critic algorithms.

Course outcomes
The students will get to know modelling and analysis tools and techniques for problems of dynamic decision

making under uncertainty. They will know the algorithms they can apply when faced with such problems and

the convergence and accuracy guarantees that such algorithms would provide.
Grading policy
Two mid term exams, One course project, and One final exam
Assignments

Resources

Page 2/2

Puppet Play Therapy - Hulburd
No ratings yet
Puppet Play Therapy - Hulburd
163 pages
List Suggested Books Indian Authors Publishers PDF
No ratings yet
List Suggested Books Indian Authors Publishers PDF
52 pages
STEM Education Teaching and Learning - NSTA
No ratings yet
STEM Education Teaching and Learning - NSTA
7 pages
Control Systems and Reinforcement Learning - Sean Meyn - 2022 - Cambridge University Press - 9781009051873 - Anna's Archive
No ratings yet
Control Systems and Reinforcement Learning - Sean Meyn - 2022 - Cambridge University Press - 9781009051873 - Anna's Archive
454 pages
Cookery 9 COT
100% (2)
Cookery 9 COT
3 pages
Ashwin Rao, Tikhon Jelvis - Foundations of Reinforcement Learning With Applications in Finance-CRC Press - Chapman & Hall (2022)
No ratings yet
Ashwin Rao, Tikhon Jelvis - Foundations of Reinforcement Learning With Applications in Finance-CRC Press - Chapman & Hall (2022)
522 pages
Reinforcement Learning Syllabus
No ratings yet
Reinforcement Learning Syllabus
6 pages
The Most Dangerous Game and The Necklace
No ratings yet
The Most Dangerous Game and The Necklace
8 pages
AbstractDynamic Programming
No ratings yet
AbstractDynamic Programming
422 pages
Helmholtz Pitch Notation
No ratings yet
Helmholtz Pitch Notation
13 pages
Demonstration in Teaching Take-Off/ Motivation
No ratings yet
Demonstration in Teaching Take-Off/ Motivation
2 pages
CVENG 423 - Module 4 - Construction Estimates and Values Engineering
No ratings yet
CVENG 423 - Module 4 - Construction Estimates and Values Engineering
7 pages
Module 2 Psy 122
No ratings yet
Module 2 Psy 122
11 pages
Reinforcement Learning and Optimal Control - Draft Version by Dmitri Bertsekas
No ratings yet
Reinforcement Learning and Optimal Control - Draft Version by Dmitri Bertsekas
268 pages
Unit-6 Reinforcement Learning
No ratings yet
Unit-6 Reinforcement Learning
75 pages
Application of Reinforcement Learning - Finance
No ratings yet
Application of Reinforcement Learning - Finance
540 pages
Plan+Your+Best+Year+Ever+Workbook+ +
No ratings yet
Plan+Your+Best+Year+Ever+Workbook+ +
20 pages
Reinforcement Learning: Foundations
No ratings yet
Reinforcement Learning: Foundations
276 pages
Abstract Dynamic Programming
No ratings yet
Abstract Dynamic Programming
257 pages
Proof of Delivery
No ratings yet
Proof of Delivery
8 pages
RL Test Leif
No ratings yet
RL Test Leif
163 pages
MIT6 231F15 Notes PDF
No ratings yet
MIT6 231F15 Notes PDF
303 pages
Unit 5
No ratings yet
Unit 5
39 pages
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton updated 2025
No ratings yet
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton updated 2025
113 pages
Five Hagen Exercises
100% (1)
Five Hagen Exercises
6 pages
RL Module 4
No ratings yet
RL Module 4
50 pages
5SC28 L7 Machine Learning
No ratings yet
5SC28 L7 Machine Learning
61 pages
RL Class Notes
No ratings yet
RL Class Notes
68 pages
Powell-Tutorial-ComputationalStochasticOptimization Informs Nov152014
No ratings yet
Powell-Tutorial-ComputationalStochasticOptimization Informs Nov152014
142 pages
RL-Notes Book
No ratings yet
RL-Notes Book
119 pages
5SC28 Machine Learning For Systems and Control
No ratings yet
5SC28 Machine Learning For Systems and Control
68 pages
WGMTA Final (C) Brandtraction 2020
No ratings yet
WGMTA Final (C) Brandtraction 2020
44 pages
Biophilia Hypothesis
100% (1)
Biophilia Hypothesis
23 pages
DLMAIRIL01 Q4-2024 Session2
No ratings yet
DLMAIRIL01 Q4-2024 Session2
68 pages
MIT6 231F15 Complete Slide
No ratings yet
MIT6 231F15 Complete Slide
166 pages
DP by Bellman Functional Equation
No ratings yet
DP by Bellman Functional Equation
296 pages
Protestation at Speyer PDF
No ratings yet
Protestation at Speyer PDF
28 pages
Approximate Dynamic Programming and Reinforcement Learning - Algorithms, Analysis and An Application
No ratings yet
Approximate Dynamic Programming and Reinforcement Learning - Algorithms, Analysis and An Application
139 pages
An Introduction To Reinforcement Learning From Theory To Algorithms (December 19, 2024) - Joon Kwon
No ratings yet
An Introduction To Reinforcement Learning From Theory To Algorithms (December 19, 2024) - Joon Kwon
66 pages
Scan Chain: Scan Chain Is A Technique Used in Design
No ratings yet
Scan Chain: Scan Chain Is A Technique Used in Design
7 pages
Breed Registry: Herdbook, Studbook or Register, in Animal
No ratings yet
Breed Registry: Herdbook, Studbook or Register, in Animal
35 pages
FS1 Activity 1
No ratings yet
FS1 Activity 1
8 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
Bayesian Reinforcement Learning
No ratings yet
Bayesian Reinforcement Learning
27 pages
Adprl Chapter Icis
No ratings yet
Adprl Chapter Icis
43 pages
Audio To Text Embedding
No ratings yet
Audio To Text Embedding
144 pages
Algorithms For Reinforcement Learning - Szepesvari
No ratings yet
Algorithms For Reinforcement Learning - Szepesvari
98 pages
Assessment Brief
No ratings yet
Assessment Brief
6 pages
Control Systems and Reinforcement Learning (Sean Meyn) (Z-Library)
No ratings yet
Control Systems and Reinforcement Learning (Sean Meyn) (Z-Library)
453 pages
Cinnamic Acid
No ratings yet
Cinnamic Acid
18 pages
Nautical Time
No ratings yet
Nautical Time
16 pages
RLAlgs in MDPs
No ratings yet
RLAlgs in MDPs
98 pages
MARK5813 Creativity and Innovation in Marketing S22016
No ratings yet
MARK5813 Creativity and Innovation in Marketing S22016
14 pages
CS 4501-Introduction To Reinforcement Learning
No ratings yet
CS 4501-Introduction To Reinforcement Learning
7 pages
Clock Rate: Clock Cycles Per Second or Its Equivalent
No ratings yet
Clock Rate: Clock Cycles Per Second or Its Equivalent
19 pages
Lecture 10
No ratings yet
Lecture 10
25 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
Color Breed PDF
No ratings yet
Color Breed PDF
9 pages
R. Basson. Human Sex-Response Cycles 2001
No ratings yet
R. Basson. Human Sex-Response Cycles 2001
11 pages
Lecture13 Postclass
No ratings yet
Lecture13 Postclass
36 pages
Phye222 13
No ratings yet
Phye222 13
3 pages
Power Affirmations To Spark Charge Success in Your Life Self Carla Da Costa
No ratings yet
Power Affirmations To Spark Charge Success in Your Life Self Carla Da Costa
20 pages
BSN1 Ha Lec 2
No ratings yet
BSN1 Ha Lec 2
8 pages
CSA3003 - REINFORCEMENT-LEARNING - LT - 1.0 - 1 - CSA3003 - Reinforcement Learning
No ratings yet
CSA3003 - REINFORCEMENT-LEARNING - LT - 1.0 - 1 - CSA3003 - Reinforcement Learning
2 pages
Scheduled Power Outages in Northern California Begin
No ratings yet
Scheduled Power Outages in Northern California Begin
8 pages
AIML II Test Scheme and Soluion 2023
No ratings yet
AIML II Test Scheme and Soluion 2023
12 pages
Product Binning Is The Categorizing of
No ratings yet
Product Binning Is The Categorizing of
11 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
Data Signaling Rate
No ratings yet
Data Signaling Rate
10 pages
TOPIO
No ratings yet
TOPIO
10 pages
The Common and Distinct Neural Bases of Affect Labeling and Reappraisal in Healthy Adults
No ratings yet
The Common and Distinct Neural Bases of Affect Labeling and Reappraisal in Healthy Adults
10 pages
Anglo-French Conference On Time-Keeping at Sea
No ratings yet
Anglo-French Conference On Time-Keeping at Sea
6 pages
Acrl Syllabus
No ratings yet
Acrl Syllabus
2 pages
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
No ratings yet
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
3 pages
Structure of The Comprehensive Examination in The ME Department For Circulation To Students
No ratings yet
Structure of The Comprehensive Examination in The ME Department For Circulation To Students
4 pages
Soccer Dribbling Lesson
100% (1)
Soccer Dribbling Lesson
7 pages
Markov Decision Processes: Lecture Notes For STP 425: Jay Taylor
100% (1)
Markov Decision Processes: Lecture Notes For STP 425: Jay Taylor
86 pages
Summated Scales
100% (3)
Summated Scales
5 pages
1. Definition of transformations.: The sun disappeared behind a cloud. - Сонце заховалося за хмарою
No ratings yet
1. Definition of transformations.: The sun disappeared behind a cloud. - Сонце заховалося за хмарою
7 pages
What Is Literature?: Truth and Beauty
No ratings yet
What Is Literature?: Truth and Beauty
3 pages
Diversity and Sel Lesson Plan
No ratings yet
Diversity and Sel Lesson Plan
6 pages
MIT6 231F11 Notes Short
No ratings yet
MIT6 231F11 Notes Short
125 pages
Lecture 30 Reinforcement-Learning
No ratings yet
Lecture 30 Reinforcement-Learning
50 pages
RLcourseoutline 2025
No ratings yet
RLcourseoutline 2025
2 pages
DL Unit 6 QP Solution
No ratings yet
DL Unit 6 QP Solution
15 pages
Add-On DRL CS06
No ratings yet
Add-On DRL CS06
23 pages
Syllabus Ob 2HS401
No ratings yet
Syllabus Ob 2HS401
2 pages
RL Monograph1
No ratings yet
RL Monograph1
42 pages
ME 688 Advanced Machining Processes (3-0-0-6) : Textbooks
No ratings yet
ME 688 Advanced Machining Processes (3-0-0-6) : Textbooks
1 page
Thought Progression Adjustment Shakti Master
100% (2)
Thought Progression Adjustment Shakti Master
11 pages
Orientation: Cultural and Heritage Tourism
No ratings yet
Orientation: Cultural and Heritage Tourism
11 pages
Introduction To Stochastic Dynamic Programming: Sheldon Ross
No ratings yet
Introduction To Stochastic Dynamic Programming: Sheldon Ross
4 pages
The Miracle Worker: Anne Sullivan Helen Keller
No ratings yet
The Miracle Worker: Anne Sullivan Helen Keller
1 page
ME150 Non-Conventional Manufacturing
No ratings yet
ME150 Non-Conventional Manufacturing
1 page
Title of The Program Media (SLE's) : "Am I Not Enough?" Goals
No ratings yet
Title of The Program Media (SLE's) : "Am I Not Enough?" Goals
3 pages
MGT 420 (Individual)
100% (3)
MGT 420 (Individual)
6 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
45 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
Origins of Frame Story
No ratings yet
Origins of Frame Story
1 page
Conservation Status: Near Threatened (IUCN 3.1)
No ratings yet
Conservation Status: Near Threatened (IUCN 3.1)
1 page
RL Frontmatter
No ratings yet
RL Frontmatter
11 pages
Free Imperial City PDF
No ratings yet
Free Imperial City PDF
57 pages
Bionomical Name
No ratings yet
Bionomical Name
1 page
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
42 pages
La5 PDF
No ratings yet
La5 PDF
35 pages
Reinforcement Learning and Dynamic Programming For Control
100% (1)
Reinforcement Learning and Dynamic Programming For Control
111 pages
20ai903 - RL - Unit 2
No ratings yet
20ai903 - RL - Unit 2
27 pages
Lesson Plan 6th Grade
No ratings yet
Lesson Plan 6th Grade
4 pages
Benifits of Job Analysis
0% (1)
Benifits of Job Analysis
2 pages
5.4-Reinforcement Learning-Part1-Introduction
No ratings yet
5.4-Reinforcement Learning-Part1-Introduction
15 pages
Approximate Dynamic Programming - II: Algorithms: Warren B. Powell
No ratings yet
Approximate Dynamic Programming - II: Algorithms: Warren B. Powell
22 pages
Deep Reinforcement Learning Handout v2.0
0% (1)
Deep Reinforcement Learning Handout v2.0
6 pages

E1 277 January-April 3:1 Reinforcement Learning: Instructor

Uploaded by

E1 277 January-April 3:1 Reinforcement Learning: Instructor

Uploaded by

E1 277 January-April 3:1

Department: Computer Science and Automation

Brief description of the course

theoretical foundations of these.

difference learning, actor-critic algorithms.

You might also like