0% found this document useful (0 votes)

9 views3 pages

Generatuvemodals

Uploaded by

apogeestudycentre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Generatuvemodals

Uploaded by

apogeestudycentre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

1.

Generative Models

Definition:

Generative models attempt to model how data is generated by learning the probability distribution of the
input data. After training, these models can generate new data points similar to those in the training set.

Key Concepts:

 Generative vs. Discriminative:

o Generative models learn the joint probability distribution P(x,y)P(x, y)P(x,y), where xxx
is the input (features) and yyy is the label (or outcome).
o Discriminative models (like logistic regression) learn the conditional distribution
P(y∣x)P(y|x)P(y∣x) to predict labels.

How It Works:

 Generative models estimate the probability P(x)P(x)P(x) of the data itself. Once trained, they can
be used to:
o Generate new data (e.g., create new images or text).
o Simulate what unseen data points might look like.

Examples:

1. Naive Bayes Classifier:

o A generative model that assumes features are conditionally independent, given the label. It
is simple but works well for text classification.
2. Gaussian Naive Bayes:
o Assumes that the features follow a normal (Gaussian) distribution. It calculates the
probability of each class given the input data and predicts the class with the highest
likelihood.
3. Generative Adversarial Networks (GANs):
o Two networks: A generator and a discriminator. The generator creates fake data, and the
discriminator tries to distinguish between real and fake data. The two networks "compete"
until the generator becomes skilled at producing realistic data.
o Example: Generating realistic images of people or objects.
4. Variational Autoencoders (VAEs):
o Uses neural networks to learn a compressed, latent representation of the input data, then
generates new data based on this learned representation.
o Example: Generate handwritten digits after being trained on the MNIST dataset.

Applications:

 Image and video generation (GANs).

 Text generation (VAEs, GPT).
 Data augmentation (creating synthetic data for training).

2. Mixture Models

Definition:
A mixture model is a probabilistic model that represents data as being generated from multiple
underlying distributions. Each data point belongs to one of these distributions, but we don't know in
advance which one.

Key Concepts:

 Latent Variables: These variables indicate which distribution (or "component") a data point
came from.
 Soft Clustering: In contrast to hard clustering (like k-means), where each data point belongs to
exactly one cluster, mixture models assign probabilities to data points belonging to different
clusters.

How It Works:

 Mixture models assume the data is generated by a combination of several probability

distributions.
 For a new data point, the model assigns a probability that it belongs to each distribution.
 This probability can be interpreted as a "soft" cluster assignment.

Examples:

1. Gaussian Mixture Model (GMM):

o A GMM assumes that the data is generated from a mixture of several Gaussian
distributions. Each component of the model is a Gaussian distribution with its own mean
and variance.
o Mathematically, the GMM represents the probability of the data as:
P(x)=∑k=1KπkN(x∣μk,Σk)P(x) = \sum_{k=1}^{K} \pi_k \mathcal{N}(x|\mu_k,
\Sigma_k)P(x)=k=1∑KπkN(x∣μk,Σk) Where KKK is the number of Gaussian components,
πk\pi_kπk is the weight of the kkk-th component, and N(x∣μk,Σk)\mathcal{N}(x|\mu_k,
\Sigma_k)N(x∣μk,Σk) is the normal distribution.
2. Expectation-Maximization (EM) Algorithm:
o E-step: Estimate which component each data point likely came from (assign probabilities
to each component).
o M-step: Maximize the likelihood by updating the parameters of each distribution (mean,
variance, and weights).
o EM is often used to fit a GMM to data.

Applications:

 Clustering: GMM can be used to identify clusters in data where each cluster represents a
different Gaussian distribution.
 Anomaly detection: In scenarios like fraud detection, data points that don’t fit well into any
Gaussian component can be flagged as anomalies.

3. Latent Factor Models

Definition:

Latent factor models aim to explain the observed data using hidden variables, called latent factors.
These factors represent the underlying structure or patterns that influence the data, even though they
cannot be directly observed.

Key Concepts:
 Latent Variables: Hidden or unobserved variables that indirectly affect the observed data.
 Dimensionality Reduction: By finding latent factors, these models can reduce the number of
observed features into a smaller, more manageable set.

How It Works:

 Latent factor models decompose observed data into a set of underlying latent factors and their
contributions to each observation.
 These models try to uncover the underlying reasons (latent factors) why the observed data
behaves in a certain way.

Examples:

1. Principal Component Analysis (PCA):

o A method of dimensionality reduction that projects data onto a lower-dimensional space.
o It finds directions (principal components) along which the data varies the most and uses
these components to represent the data in fewer dimensions.
o Example: In image compression, PCA can reduce the number of pixels used to represent
an image by capturing only the most important variation.
2. Matrix Factorization (used in Recommendation Systems):
o Collaborative Filtering: Used to predict user preferences based on latent factors. In a
movie recommendation system, latent factors could represent things like “action vs.
drama” or “complexity of plot”.
o Factorization: The observed matrix (user ratings) is factorized into two lower-rank
matrices: one representing users and one representing items (movies, books, etc.). These
latent factors explain why certain users prefer certain items.
3. Latent Dirichlet Allocation (LDA):
o A generative probabilistic model used in natural language processing (NLP) for topic
modeling. It assumes that documents are mixtures of topics, and topics are distributions
over words.
o Example: Discovering the main themes (topics) in a collection of news articles.

Applications:

 Recommendation systems: Finding latent factors that influence user preferences.

 Natural Language Processing (NLP): Topic modeling with LDA.
 Data Compression: PCA for reducing the size of data while retaining important information.

Summary Table:

Model Type Purpose Examples Applications

Learn to model data
Generative Data generation, text/image
distribution and generate Naive Bayes, GANs, VAEs
Models synthesis
data
Mixture Model data as coming from Gaussian Mixture Models
Clustering, anomaly detection
Models a mixture of distributions (GMM), EM Algorithm
Latent Factor Discover hidden factors that PCA, Matrix Factorization, Dimensionality reduction,
Models explain observed data LDA recommendation systems

BCS602 Model Question Paper Solved(Search Creators)-2-37
0% (2)
BCS602 Model Question Paper Solved(Search Creators)-2-37
36 pages
BCS602 Model Question paper Solved(Search Creators)
No ratings yet
BCS602 Model Question paper Solved(Search Creators)
37 pages
Deep Gen Models Tutorial
No ratings yet
Deep Gen Models Tutorial
96 pages
Data Science By Internshala Trainings
No ratings yet
Data Science By Internshala Trainings
46 pages
Arngren - 2007 - Unknown - Modelling cognitive representations
No ratings yet
Arngren - 2007 - Unknown - Modelling cognitive representations
114 pages
Modelling
No ratings yet
Modelling
69 pages
3 DM Classification (2)
No ratings yet
3 DM Classification (2)
62 pages
MLT
No ratings yet
MLT
32 pages
Lecture1 2015
No ratings yet
Lecture1 2015
52 pages
Machine Learning Models
100% (1)
Machine Learning Models
2 pages
TT02 Data, Methods, and Scenarios
No ratings yet
TT02 Data, Methods, and Scenarios
44 pages
Lecture 3 Ver2
No ratings yet
Lecture 3 Ver2
42 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
AAI IA1 QUE ANS
No ratings yet
AAI IA1 QUE ANS
17 pages
AI PROJECT CYCLE
No ratings yet
AI PROJECT CYCLE
30 pages
Unit 3
No ratings yet
Unit 3
97 pages
unsupervised machine learning
No ratings yet
unsupervised machine learning
10 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
AI LIFE CYCLE
No ratings yet
AI LIFE CYCLE
30 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
ML IMP QUES 1
No ratings yet
ML IMP QUES 1
22 pages
lec12
No ratings yet
lec12
15 pages
Generative Learning algorithims 1233
No ratings yet
Generative Learning algorithims 1233
33 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
DataEnggineering
No ratings yet
DataEnggineering
16 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Programming and Problem Solving Through Python
No ratings yet
Programming and Problem Solving Through Python
245 pages
AI_Book 10_Part B_Answer Key (New Version)
No ratings yet
AI_Book 10_Part B_Answer Key (New Version)
16 pages
Unit3_Datamining
No ratings yet
Unit3_Datamining
5 pages
aai
No ratings yet
aai
10 pages
CSGL
No ratings yet
CSGL
11 pages
NIT ML SUGG
No ratings yet
NIT ML SUGG
5 pages
ml 5
No ratings yet
ml 5
28 pages
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
No ratings yet
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
40 pages
Adv Ai
No ratings yet
Adv Ai
9 pages
AIML MODEL
No ratings yet
AIML MODEL
13 pages
UNIT2
No ratings yet
UNIT2
20 pages
AI As Subset
No ratings yet
AI As Subset
16 pages
Question Bank 3&4 Unit ML
No ratings yet
Question Bank 3&4 Unit ML
6 pages
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
No ratings yet
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
40 pages
Data Literacy
No ratings yet
Data Literacy
5 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Mmds
No ratings yet
Mmds
12 pages
L11 - UCLxDeepMind DL2020
No ratings yet
L11 - UCLxDeepMind DL2020
68 pages
Week 12 Chats
No ratings yet
Week 12 Chats
4 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
AAI Module 1
No ratings yet
AAI Module 1
12 pages
Deep Learning u5
No ratings yet
Deep Learning u5
5 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
machine 2023 part 1
No ratings yet
machine 2023 part 1
4 pages
Tools of Machine Learning
No ratings yet
Tools of Machine Learning
3 pages
Koushal Vichare Assingment
No ratings yet
Koushal Vichare Assingment
5 pages
Morton D. Davis The Math of Money Making Mathematical Sense of Your Personal Finances
No ratings yet
Morton D. Davis The Math of Money Making Mathematical Sense of Your Personal Finances
181 pages
Data Exploration
No ratings yet
Data Exploration
5 pages
Sets
No ratings yet
Sets
17 pages
Xii-Hots-Solutions (Mathmission)
No ratings yet
Xii-Hots-Solutions (Mathmission)
54 pages
The Quartile For Grouped Data
100% (1)
The Quartile For Grouped Data
13 pages
Machine Learning Mastery Notes
No ratings yet
Machine Learning Mastery Notes
4 pages
Chapter 7
No ratings yet
Chapter 7
48 pages
Syllabus B.Tech. MC 2nd Year New Scheme
No ratings yet
Syllabus B.Tech. MC 2nd Year New Scheme
22 pages
Test-07 - Kinematics - Question - Neet - Class Xi
No ratings yet
Test-07 - Kinematics - Question - Neet - Class Xi
5 pages
(Ebook Math) - Numerical Methods - Real-Time and Embedded Systems Programming - 1992 PDF
No ratings yet
(Ebook Math) - Numerical Methods - Real-Time and Embedded Systems Programming - 1992 PDF
20 pages
KOM M5
No ratings yet
KOM M5
23 pages
Astro Cycles
100% (1)
Astro Cycles
5 pages
Final Research Proposal
No ratings yet
Final Research Proposal
31 pages
MATH-224 - LectureNotes 1
No ratings yet
MATH-224 - LectureNotes 1
24 pages
XII CS PRATICALS 1 To16 PDF
No ratings yet
XII CS PRATICALS 1 To16 PDF
26 pages
Capacidad de carga en rocas articulo
No ratings yet
Capacidad de carga en rocas articulo
14 pages
Solving Equations Using Properties of Equality
No ratings yet
Solving Equations Using Properties of Equality
64 pages
Testing of Nonlinear Diamond-Turned Reflaxicons
No ratings yet
Testing of Nonlinear Diamond-Turned Reflaxicons
5 pages
PRR 6 2017
No ratings yet
PRR 6 2017
46 pages
Discussion
100% (3)
Discussion
3 pages
Question Solution Key MIDSEM PDF
No ratings yet
Question Solution Key MIDSEM PDF
10 pages
Discrete Math Module 12 Functions
No ratings yet
Discrete Math Module 12 Functions
20 pages
5.1, 5.2 & 5.3 Extra Practice 2: Graphing Reciprocal & Rational Functions
No ratings yet
5.1, 5.2 & 5.3 Extra Practice 2: Graphing Reciprocal & Rational Functions
5 pages
21CEC201T - HMHE CT1 ODD QP - 28-8-2023 - NBA Format - Prem
No ratings yet
21CEC201T - HMHE CT1 ODD QP - 28-8-2023 - NBA Format - Prem
4 pages
3 Day-Fundamental of Mathematics
No ratings yet
3 Day-Fundamental of Mathematics
3 pages
Variance, Root-Mean Square, Operators, Eigenfunctions, Eigenvalues
No ratings yet
Variance, Root-Mean Square, Operators, Eigenfunctions, Eigenvalues
6 pages
Sankranti Holiday Work PDF
No ratings yet
Sankranti Holiday Work PDF
6 pages
Time Series Chap21
No ratings yet
Time Series Chap21
27 pages
Jackson 5 21 Homework Solution
No ratings yet
Jackson 5 21 Homework Solution
3 pages
Schedule of Taiwan Baseball League: 1 Days. in Each Day of The Competition
No ratings yet
Schedule of Taiwan Baseball League: 1 Days. in Each Day of The Competition
2 pages
Morgan Mchale: TH RD
No ratings yet
Morgan Mchale: TH RD
2 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet

Generatuvemodals

Uploaded by

Generatuvemodals

Uploaded by

1.

 Generative vs. Discriminative:

1. Naive Bayes Classifier:

 Image and video generation (GANs).

 Mixture models assume the data is generated by a combination of several probability

1. Gaussian Mixture Model (GMM):

3. Latent Factor Models

1. Principal Component Analysis (PCA):

 Recommendation systems: Finding latent factors that influence user preferences.

Model Type Purpose Examples Applications

You might also like