0% found this document useful (0 votes)
6 views3 pages

3GP ML Reinforcement Learning

Reinforcement Learning (RL) is a distinct machine learning method focused on decision making through rewarding desired behaviors and punishing undesired ones. It enables agents to learn from experience without labeled data, making it applicable in various fields such as autonomous driving, video games, and resource management. RL can be categorized into positive reinforcement, which encourages behavior by adding rewards, and negative reinforcement, which promotes behavior by avoiding adverse conditions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views3 pages

3GP ML Reinforcement Learning

Reinforcement Learning (RL) is a distinct machine learning method focused on decision making through rewarding desired behaviors and punishing undesired ones. It enables agents to learn from experience without labeled data, making it applicable in various fields such as autonomous driving, video games, and resource management. RL can be categorized into positive reinforcement, which encourages behavior by adding rewards, and negative reinforcement, which promotes behavior by avoiding adverse conditions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Reinforcement Learning

“Learn from Mistakes”

Definition:
Reinforcement Learning (RL) is the science of decision making. When compared
to supervised and unsupervised learning, reinforcement learning is quite
distinct. It is based on rewarding desired behaviors and punishing undesired
ones.

Reinforcement Learning is defined as a Machine Learning method concerned


with how software agents should take actions in an environment. Reinforcement
Learning is a part of the deep learning method that helps you maximize some
portion of the cumulative reward.

In Reinforcement Learning, the agent learns automatically using feedback


without any labeled data, unlike supervised learning. Without a training dataset,
it is bound to learn from its experience. In simple words, we can say that the
output depends on the state of the current input and the following input
depends on the output of the previous input.

1
Applications:
A possible application of RL is any real-world situation in which an agent must
interact with an unpredictable environment to achieve a particular objective.

● Autonomous driving
An autonomous driving system must execute many perceptual and
planning tasks in an unpredictable environment. Vehicle route planning
and motion prediction are two particular applications where RL might be
helpful.

● Video games
Learning to play video games is one of the most popular applications of
reinforcement learning. Consider Google's reinforcement learning
applications, AlphaZero and AlphaGo, which learned to play Go.

● Managing resources
Reinforcement learning is effective in navigating complicated
surroundings. It may deal with the necessity to balance various
requirements. Take Google's data centers, for example. They employed
reinforcement learning to balance the need to meet our power demand
while being as efficient as possible, resulting in significant cost savings.

Types of Reinforcement Learning:


There are mainly two types of reinforcement learning, which are:

Positive Reinforcement:
Positive reinforcement learning means adding something to increase the
expected behavior's tendency to occur again. It impacts positively on the agent's
behavior and increases the strength of the behavior.
This type of reinforcement can sustain the changes for a long time, but too much
positive reinforcement may lead to an overload of states that can reduce the
consequences.

Negative Reinforcement:

2
Negative reinforcement learning is the opposite of positive reinforcement as it
increases the tendency that the specific behavior will occur again by avoiding the
adverse condition.
It can be more effective than positive reinforcement depending on the situation
and behavior, but it provides reinforcement only to meet minimum behavior.

You might also like