3GP ML Reinforcement Learning
3GP ML Reinforcement Learning
Definition:
Reinforcement Learning (RL) is the science of decision making. When compared
to supervised and unsupervised learning, reinforcement learning is quite
distinct. It is based on rewarding desired behaviors and punishing undesired
ones.
1
Applications:
A possible application of RL is any real-world situation in which an agent must
interact with an unpredictable environment to achieve a particular objective.
● Autonomous driving
An autonomous driving system must execute many perceptual and
planning tasks in an unpredictable environment. Vehicle route planning
and motion prediction are two particular applications where RL might be
helpful.
● Video games
Learning to play video games is one of the most popular applications of
reinforcement learning. Consider Google's reinforcement learning
applications, AlphaZero and AlphaGo, which learned to play Go.
● Managing resources
Reinforcement learning is effective in navigating complicated
surroundings. It may deal with the necessity to balance various
requirements. Take Google's data centers, for example. They employed
reinforcement learning to balance the need to meet our power demand
while being as efficient as possible, resulting in significant cost savings.
Positive Reinforcement:
Positive reinforcement learning means adding something to increase the
expected behavior's tendency to occur again. It impacts positively on the agent's
behavior and increases the strength of the behavior.
This type of reinforcement can sustain the changes for a long time, but too much
positive reinforcement may lead to an overload of states that can reduce the
consequences.
Negative Reinforcement:
2
Negative reinforcement learning is the opposite of positive reinforcement as it
increases the tendency that the specific behavior will occur again by avoiding the
adverse condition.
It can be more effective than positive reinforcement depending on the situation
and behavior, but it provides reinforcement only to meet minimum behavior.