SlideShare a Scribd company logo
Reinforcement Learning
Yigit UNALLAR
Machine Learning
Learn without explicitly programmed!
● Supervised Learning
● Unsupervised Learning
● Reinforcement Learning
Reinforcement Learning
● Learning from interaction!
○ Driving a car,
○ Holding a conversation,
● Goal-directed approach
○ Closed-loop,
○ Reward oriented,
Reinforcement vs. Unsupervised Learning
● Hidden structures!
● Unlabeled data!
● No reliance on structures!
● Maximize a reward!
Exploration vs. Exploitation Dilemma
● Exploit to obtain rewards!
● Explore to perform better!
● Either Exploration or Exploitation?
● Closest to the human and animal learning!
Examples
● Mobile Robot
○ More trash to find,
○ Way back to battery station,
● Adaptive Controller for Petrol Refinery
○ Optimize yield/cost/quality,
○ Specified marginal costs,
Agent & Environment
● Policy,
○ Mapping from states to actions,
● Reward,
○ Pain, pleasure,
● Value Function,
○ Farsighted judgement,
● Model,
○ Mimics the environment,
Pick and Place Robot
Action:
Voltages at motors,
States:
Latest joint data,
Reward:
+1 for successful pick-up, computed in the environment!
Goals & Markov Decision Process
Goals:
Markov Decision Process:
Retaining all relevant information, Markov Property!
Markov Decision Process ctd.
MDP if,
● The state and action spaces are finite,
● Satisfies Markov property,
Example: Recycling Robot
● Actively search for a can,
● Remain still and wait for a can,
● Go back to station,
Recycling Robot
Value Functions- Bellman Equations
Solving RL tasks for WHAT?!
● Finding a policy
○ Achieves lots of reward
■ Over the long RUN!
Recycling Robot Revised
Dynamic Programming
● Use value functions,
● Organize and structure a search,
● GOOD POLICIES!
Dynamic Programming
Monte Carlo Methods
● Used in algorithm to mimic policy iteration,
○ Policy Evaluation,
■ (s,a) averages over time ==> Q
○ Policy Iteration,
■ Next policy from Q, (Greedy Policy),
● Given s, new policy returns a that max Q(s, . )
● Works in episodic problems ONLY!
Any Questions?
References
[1] Reinforcement Learning: Introduction, R. Sutton, A. Barto
[2] AIMA, S. Russell, P. Norvig

More Related Content

PDF
A brief overview of Reinforcement Learning applied to games
Thomas da Silva Paula
 
PDF
An introduction to deep reinforcement learning
Big Data Colombia
 
PDF
Human-level Control Through Deep Reinforcement Learning (Presentation)
Muhammed Kocabaş
 
PPTX
Intro to Deep Reinforcement Learning
Khaled Saleh
 
PPTX
Deep Reinforcement Learning
Usman Qayyum
 
PDF
Deep Reinforcement Learning
MeetupDataScienceRoma
 
PDF
Deep Reinforcement Learning: MDP & DQN - Xavier Giro-i-Nieto - UPC Barcelona ...
Universitat Politècnica de Catalunya
 
PPTX
An introduction to reinforcement learning
Subrat Panda, PhD
 
A brief overview of Reinforcement Learning applied to games
Thomas da Silva Paula
 
An introduction to deep reinforcement learning
Big Data Colombia
 
Human-level Control Through Deep Reinforcement Learning (Presentation)
Muhammed Kocabaş
 
Intro to Deep Reinforcement Learning
Khaled Saleh
 
Deep Reinforcement Learning
Usman Qayyum
 
Deep Reinforcement Learning
MeetupDataScienceRoma
 
Deep Reinforcement Learning: MDP & DQN - Xavier Giro-i-Nieto - UPC Barcelona ...
Universitat Politècnica de Catalunya
 
An introduction to reinforcement learning
Subrat Panda, PhD
 

What's hot (20)

PDF
An introduction to reinforcement learning
Jie-Han Chen
 
PDF
Reinforcement Learning (DLAI D7L2 2017 UPC Deep Learning for Artificial Intel...
Universitat Politècnica de Catalunya
 
PDF
Deep Q-Learning
Nikolay Pavlov
 
PPTX
Reinforcement Learning
Salem-Kabbani
 
PDF
Multi armed bandit
Jie-Han Chen
 
PDF
Reinforcement learning
DongHyun Kwak
 
PPTX
Reinforcement Learning
DongHyun Kwak
 
PDF
Introduction of Deep Reinforcement Learning
NAVER Engineering
 
PDF
Frontier in reinforcement learning
Jie-Han Chen
 
PPT
Reinforcement learning
Chandra Meena
 
PDF
Actor critic algorithm
Jie-Han Chen
 
PDF
Reinforcement Learning - DQN
Mohammaderfan Arefimoghaddam
 
PDF
Deep reinforcement learning from scratch
Jie-Han Chen
 
PPTX
An Introduction to Reinforcement Learning - The Doors to AGI
Anirban Santara
 
PDF
Planning and Learning with Tabular Methods
Dongmin Lee
 
PPTX
Reinforcement Learning
butest
 
PPTX
Reinforcement Learning : A Beginners Tutorial
Omar Enayet
 
PDF
Generalized Reinforcement Learning
Po-Hsiang (Barnett) Chiu
 
PDF
Introduction to Deep Reinforcement Learning
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Discrete sequential prediction of continuous actions for deep RL
Jie-Han Chen
 
An introduction to reinforcement learning
Jie-Han Chen
 
Reinforcement Learning (DLAI D7L2 2017 UPC Deep Learning for Artificial Intel...
Universitat Politècnica de Catalunya
 
Deep Q-Learning
Nikolay Pavlov
 
Reinforcement Learning
Salem-Kabbani
 
Multi armed bandit
Jie-Han Chen
 
Reinforcement learning
DongHyun Kwak
 
Reinforcement Learning
DongHyun Kwak
 
Introduction of Deep Reinforcement Learning
NAVER Engineering
 
Frontier in reinforcement learning
Jie-Han Chen
 
Reinforcement learning
Chandra Meena
 
Actor critic algorithm
Jie-Han Chen
 
Reinforcement Learning - DQN
Mohammaderfan Arefimoghaddam
 
Deep reinforcement learning from scratch
Jie-Han Chen
 
An Introduction to Reinforcement Learning - The Doors to AGI
Anirban Santara
 
Planning and Learning with Tabular Methods
Dongmin Lee
 
Reinforcement Learning
butest
 
Reinforcement Learning : A Beginners Tutorial
Omar Enayet
 
Generalized Reinforcement Learning
Po-Hsiang (Barnett) Chiu
 
Introduction to Deep Reinforcement Learning
IDEAS - Int'l Data Engineering and Science Association
 
Discrete sequential prediction of continuous actions for deep RL
Jie-Han Chen
 
Ad

Viewers also liked (14)

PDF
우울증 리서치 개인
주연 박
 
DOC
Aula Jonatas 61: Autoridade
Andre Nascimento
 
PPT
Derecho fundamental al proceso_IAFJSR
Mauri Rojas
 
PPTX
Creating a Customer-Centric Learning Culture
Qualtrics
 
PPTX
Slideshare
Margie Ortiz Rojas
 
PDF
Fraud Detection Class Slides
Max De Marzi
 
PDF
Video Conferencing over WebRTC
Yigit UNALLAR
 
PPTX
Elastic - DASH
Yigit UNALLAR
 
PPTX
LOAD BEARING WALL
wan izzati
 
PDF
Machine Learning
Joshua Robinson
 
PPT
Retaining Walls
Mereia Kali
 
PDF
Machine Learning
gezeitenraum gbr
 
PDF
ISOBAGS-About It
abrahamprice012
 
우울증 리서치 개인
주연 박
 
Aula Jonatas 61: Autoridade
Andre Nascimento
 
Derecho fundamental al proceso_IAFJSR
Mauri Rojas
 
Creating a Customer-Centric Learning Culture
Qualtrics
 
Slideshare
Margie Ortiz Rojas
 
Fraud Detection Class Slides
Max De Marzi
 
Video Conferencing over WebRTC
Yigit UNALLAR
 
Elastic - DASH
Yigit UNALLAR
 
LOAD BEARING WALL
wan izzati
 
Machine Learning
Joshua Robinson
 
Retaining Walls
Mereia Kali
 
Machine Learning
gezeitenraum gbr
 
ISOBAGS-About It
abrahamprice012
 
Ad

Similar to Reinforcement Learning (20)

PPTX
Survey of Modern Reinforcement Learning
Julia Maddalena
 
PDF
anintroductiontoreinforcementlearning-180912151720.pdf
ssuseradaf5f
 
PDF
Reinforcement Learning - Learning from Experience like a Human
Rising Media Ltd.
 
PPTX
reinforcement-learning-141009013546-conversion-gate02.pptx
MohibKhan79
 
PDF
Reinforcement Learning
CloudxLab
 
PDF
reinforcement-learning-141009013546-conversion-gate02.pdf
VaishnavGhadge1
 
PDF
Lecture 1 - introduction.pdf
NamanJain758248
 
PDF
Reinforcement learning Russell and Norvig CMSC
sfsmj710f
 
PDF
Shanghai deep learning meetup 4
Xiaohu ZHU
 
PPTX
Reinforcement Learning, Application and Q-Learning
Abdullah al Mamun
 
PPT
RL.ppt
AzharJamil15
 
PPTX
Introduction to reinforcement learning
Pramod Ramachandra
 
PDF
Reinforcement Learning 1. Introduction
Seung Jae Lee
 
PDF
What is Reinforcement Learning.pdf
Aiblogtech
 
PDF
REINFORCEMENT LEARNING
pradiprahul
 
PPTX
Reinforcement learning
Zahra Khoobi
 
PPTX
reinforcement learning in artificial intelligence
panditadesh123
 
PPTX
Designing an AI that gains experience for absolute beginners
Tanzim Saqib
 
PDF
Real-world Reinforcement Learning
Max Pagels
 
PDF
TensorFlow London 17: Practical Reinforcement Learning with OpenAI
Seldon
 
Survey of Modern Reinforcement Learning
Julia Maddalena
 
anintroductiontoreinforcementlearning-180912151720.pdf
ssuseradaf5f
 
Reinforcement Learning - Learning from Experience like a Human
Rising Media Ltd.
 
reinforcement-learning-141009013546-conversion-gate02.pptx
MohibKhan79
 
Reinforcement Learning
CloudxLab
 
reinforcement-learning-141009013546-conversion-gate02.pdf
VaishnavGhadge1
 
Lecture 1 - introduction.pdf
NamanJain758248
 
Reinforcement learning Russell and Norvig CMSC
sfsmj710f
 
Shanghai deep learning meetup 4
Xiaohu ZHU
 
Reinforcement Learning, Application and Q-Learning
Abdullah al Mamun
 
RL.ppt
AzharJamil15
 
Introduction to reinforcement learning
Pramod Ramachandra
 
Reinforcement Learning 1. Introduction
Seung Jae Lee
 
What is Reinforcement Learning.pdf
Aiblogtech
 
REINFORCEMENT LEARNING
pradiprahul
 
Reinforcement learning
Zahra Khoobi
 
reinforcement learning in artificial intelligence
panditadesh123
 
Designing an AI that gains experience for absolute beginners
Tanzim Saqib
 
Real-world Reinforcement Learning
Max Pagels
 
TensorFlow London 17: Practical Reinforcement Learning with OpenAI
Seldon
 

Recently uploaded (20)

PDF
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
PPTX
quantum computing transition from classical mechanics.pptx
gvlbcy
 
PDF
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
PDF
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
PPTX
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
PPTX
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PDF
Chad Ayach - A Versatile Aerospace Professional
Chad Ayach
 
PDF
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
PPTX
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
PPTX
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
PPTX
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
PDF
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
PDF
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
PPTX
MSME 4.0 Template idea hackathon pdf to understand
alaudeenaarish
 
PDF
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
PDF
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
PDF
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
PPTX
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
PDF
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
quantum computing transition from classical mechanics.pptx
gvlbcy
 
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
Chad Ayach - A Versatile Aerospace Professional
Chad Ayach
 
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
MSME 4.0 Template idea hackathon pdf to understand
alaudeenaarish
 
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 

Reinforcement Learning