IBM-types of MC Learning
IBM-types of MC Learning
4. Reinforcement learning
Reinforcement learning, also called reinforcement learning from human feedback (RLHF), is a
type of dynamic programming that trains algorithms using a system of reward and punishment.
To deploy reinforcement learning, an agent takes actions in a specific environment to reach a
predetermined goal. The agent is rewarded or penalized for its actions based on an established
metric (typically points), encouraging the agent to continue good practices and discard bad ones.
With repetition, the agent learns the best strategies.
Reinforcement learning algorithms are common in video game development and are frequently
used to teach robots how to replicate human tasks.
5. Semi-supervised learning
The fifth type of machine learning technique offers a combination between supervised and
unsupervised learning.
Semi-supervised learning algorithms are trained on a small labeled dataset and a large unlabeled
dataset, with the labeled data guiding the learning process for the larger body of unlabeled data.
A semi-supervised learning model might use unsupervised learning to identify data clusters and
then use supervised learning to label the clusters.
Generative adversarial networks (GANs)—deep learning tool that generates unlabeled data by
training two neural networks—are an example of semi-supervised machine learning.
Regardless of type, ML models can glean data insights from enterprise data, but their
vulnerability to human/data bias make responsible AI practices an organizational imperative.