0% found this document useful (0 votes)

55 views14 pages

DSWFME - Naive Bayes

Uploaded by

olicematostrader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views14 pages

DSWFME - Naive Bayes

Uploaded by

olicematostrader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Data Science with Football Made Easy

The 15-Algorithm Playbook for Beginners

Naïve Bayes

Unpacking Premier League Scoring Chances with Bukayo Saka

@MartinOnData

version 1.1 [26-04-2024]

Copyright © 2024 Antoine Martin

Naïve Bayes
Unpacking Premier League Scoring Chances with Bukayo Saka

Naïve Bayes is a simple yet remarkably powerful algorithm for predictive modelling, particularly
well-suited for classification tasks. It is based on Bayes' Theorem, a foundational principle in
probability theory that describes the probability of an event, based on prior knowledge of
conditions related to the event. The "naïve" aspect comes from the algorithm's assumption that
the features it uses to make predictions are independent of each other, a simplification that might
not always hold true in real-world data but often works surprisingly well in practice.

In this short guide, you will learn everything you need to get started with Naïve Bayes using Bukayo
Saka’s Premier League shots on goal. We begin by unpacking the formal definition of Naïve Bayes
and the underlying Bayes’ Theorem (section 1), after which we translate this into practical terms
– how likely is Bukayo Saka to score a goal from the penalty box with his left foot (section 2). We
proceed with a step-by-step guide on how to compute this probability using R from the ground
up using real world data without installing any software on our computer (section 3). We conclude
with a discussion on the general applications of Naïve Bayes and potential future exercises
involving similar shooting data (section 4). Let’s dive in.

1. Formal Definition
At its core, Naïve Bayes leverages Bayes' Theorem to calculate the probability that a given data
point belongs to a certain class, given the data point's features. Bayes' Theorem is expressed as

𝑃(𝐵| 𝐴) ∗ 𝑃(𝐴)
𝑃(𝐴| 𝐵) =
𝑃(𝐵)

where:

- 𝑃(𝐴|𝐵) is the probability of event A happening given that B is true (posterior probability).
- 𝑃(𝐵|𝐴) is the probability of observing event B given that A is true (likelihood).
- 𝑃(𝐴) is the probability of observing event A (prior probability).
- 𝑃(𝐵) is the probability of observing event B (evidence).

2
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Now let's repeat this definition only this time using Bukayo Saka's probability of scoring a goal,
given that he shoots with his left foot from the penalty line box (a distance of 16.5 meters from the
goal).

2. Practical Definition
Here is how the Bayes' Theorem is applied to calculate the probability of an event (Bukayo Saka
scoring a goal) based on specific conditions (shot taken with his left foot from 16.5 meters):

𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠| 𝐺𝑜𝑎𝑙) ∗ 𝑃(𝐺𝑜𝑎𝑙)

𝑃(𝐺𝑜𝑎𝑙| 𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠) =
𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠)

where:

- 𝑃(𝐺𝑜𝑎𝑙| 𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠) is the probability of scoring given the shot's characteristics.
- 𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠| 𝐺𝑜𝑎𝑙) is the likelihood of observing these specific shot features (left
foot, 16.5 meters) when a goal is scored.
- 𝑃(𝐺𝑜𝑎𝑙) is the prior probability of scoring a goal, before considering the shot's specifics.
- 𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠) is the probability of observing these specific shot characteristics,
irrespective of the scoring outcome.

Feature Independence: Note that Naïve Bayes simplifies the probability calculation by assuming
the shot features (distance to goal and foot used) affect the goal-scoring probability independently.
This assumption allows us to consider how each factor (shooting from 16.5 meters with the left
foot) individually influences Saka's likelihood of scoring, without intertwining their effects. This
assumption, although simplistic, allows for efficient computation and often yields robust results
despite its naivety.

Practical Calculation for Saka’s Shot: Imagine historical data shows:

- 𝑃(𝐺𝑜𝑎𝑙): The overall probability of scoring a goal, let's say Saka’s scores on 15% of his
shots.
- 𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠| 𝐺𝑜𝑎𝑙): This would involve understanding the likelihood of the shot's
specifics given a goal is scored. For our example, if 40% of Saka's goals come from shots
16.5 meters out (𝑃(𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒 < 16.5 𝑚| 𝐺𝑜𝑎𝑙)), and 70% of Saka's goals are scored with
the left foot (𝑃(𝐹𝑜𝑜𝑡 = 𝐿𝑒𝑓𝑡| 𝐺𝑜𝑎𝑙)), then to calculate 𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠| 𝐺𝑜𝑎𝑙) under
the assumption of feature independence, we multiply those two likelihoods of the individual
shot features given a goal. This becomes 0.40 × 0.70 = 0.28 or 28%, representing the

3
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

combined likelihood of the specific shot features (16.5 m distance with the left foot) given
that a goal is scored.
- 𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠): The probability of observing these shot features (left foot and 16.5
meters distance) in the general context of all shots taken. This factor is crucial as it helps
normalize our posterior probability, ensuring we are accounting for how common these
features are among all shots, not just the successful ones. Suppose 20% of all shots are from
16.5 meters out with the left foot.

So, based on the Naïve Bayes calculation with the given data, Saka has a 21% probability of scoring
a goal when he takes a shot with his left foot from 16.5 meters out.

0.28 ∗ 0.15 0.042

𝑃(𝐺𝑜𝑎𝑙| 𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠) = = = 0.21
0.2 0.2

Fantastic, now that you have grasped the concept and mechanics of Naïve Bayes, let's explore how
to compute the actual probability using real-world data in R (the figures used previously were for
illustrative purposes only).

3. Naïve Bayes with R

Before we start programming, here are a couple of important notes.

 The complete version of the code is easily downloadable here.

 To make the code accessible for individuals who do not have or prefer not to install
RStudio, all programming has been conducted using Google Colab. Google Colab is a free,
cloud-based platform that enables you to write and execute R (and Python) code directly
from your browser without any prior software installation. It is an ideal platform for
beginners owing to its simplicity and ease of use. For instructions on how to quickly get
started using it, please refer to Section 5 at the very end of this guide.

Once you have set up your account, we can proceed with some programming. We begin by
importing the tidyverse library for data wrangling, the worldfootballR package for football data
scraping, and e1071 for our Naïve Bayes algorithm functions. Keep in mind that two of these
packages must be installed before they can be loaded, as they are not included by default on the
Google Colab platform.

4
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Feel free to comment out the install.packages lines (#) after installation (as illustrated below) if
you plan to rerun the entire code. This will prevent multiple installations and save you
approximately 19 seconds each time you run this part.

Before we conduct the analysis, we first need to collect our data. For the purposes of the analysis
in this guide, we will use Fbref.com, which hosts (among tons of other useful information) all
shots taken by each Premier League team. In the example below (the 2022/23 game between
Crystal Palace and Arsenal), we can easily observe that Saka took a total of three shots in the game
– all with his right foot – two of which were off target and one was blocked. In addition to the
body part with which the shot was taken and its outcome, we can also easily get the distance to
the goal in meters. So how do we get this kind of data for all Arsenal games? That is where
worldfootballR comes into play.

5
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Source : https://fbref.com/en/matches/e62f6e78/Crystal-Palace-Arsenal-August-5-2022-Premier-League

Jason Zivkovic’s worldfootballR package facilitates data collection from FBREF by providing
clean wrappers for scraping various types of data from FBREF’s website. For a comprehensive
overview of the package’s capabilities, you may refer to the documentation here. To gather all of
Bukayo Saka’s shots, it is necessary to download data on all shots taken by Arsenal players. To
accomplish this, we download all links to Arsenal games using the following code snippet. It
utilizes the fb_match_urls function by specifying country = "ENG" for England, gender = "M"
for the male championship, season_end_year = 2023 for the 2022/23 season, and tier = "1st" for
the English Premier League. The second line filters all the links to include only Arsenal games.
Finally, we review the first few observations and the total number of fb_match_urls_arsenal
entries to ensure we have captured all 38 games of Arsenal's 2022/23 season. This code executes
in 4 seconds.

Once we have our links to all of Arsenal’s 2022/23 games, we loop over each one and extract the
all the shots taken by the two teams. We do that using the fb_match_shooting function. This
code executes in 2 minutes.

6
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Next, we refine our df_raw data to retain only the shooting data from Arsenal players. We rename
the columns to lowercase, convert numeric variables to numeric types (if this hasn't been done
already), and finally, select only a few variables of interest. These include the date of the game, the
squad, the player name, the shot’s distance to the goal, the body part used for the shot, and the
outcome.

Below is what the head of the data should look like.

7
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Next, we prepare the data for our analysis by focusing on shots taken by Bukayo Saka using either
his left or right foot. We also create a dummy variable to indicate whether each shot resulted in a
goal. Additionally, we convert the body_part variable into a factor, which is necessary for running
our Naïve Bayes function shortly.

Below is what the head of the data should look like.

The next code snippet fits the naïve bayes model using the naiveBayes function from the e1071
package.

8
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Here is how to interpret the model’s results.

A-priori Probabilities: These are the model's initial guesses or "prior" probabilities about the
likelihood of each outcome (goal = 1, no goal = 0) before considering the specific features
(distance and body part) of the shots.

- 0 (No Goal): 0.8554217 chance. This means, based on the data provided to the model, about
85.54% of Bukayo Saka's shots did not result in a goal.
- 1 (Goal): 0.1445783 chance. Conversely, about 14.46% of his shots resulted in a goal.

These percentages reflect the overall distribution of goals and no-goals in the dataset used to train
the model.

Conditional Probabilities for `distance`: These show how the model views the relationship
between the distance of a shot and the outcome (goal or no goal), expressed in terms of mean
(average distance) and standard deviation (variation in distance).

- For No Goal (0): The average distance is about 16.77 meters, with a standard deviation of
5.58 meters. This means most shots that did not result in a goal were taken from around this
distance, give or take about 5.58 meters.

9
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

- For Goal (1): The average distance for goals is shorter, about 12.08 meters, with a standard
deviation of 5.52 meters. This suggests goals tend to come from closer shots, with similar
variability in distance as no-goal shots.

Conditional Probabilities for `body_part`: These indicate the likelihood of using a specific body
part for shots, given the outcome.

- Left Foot and No Goal: About 72% of Saka's no-goal shots were taken with the left foot.
- Right Foot and No Goal: About 28% of his no-goal shots used the right foot.
- Left Foot and Goal: For shots resulting in goals, approximately 75% were taken with the
left foot.
- Right Foot and Goal: Around 25% of goal shots used the right foot.

These proportions show a slight preference for the left foot in goal-scoring shots compared to
shots that did not result in goals.

Overall, the Naïve Bayes model tells us two main things about Bukayo Saka's shots:

1. General Chances: Before looking at where or how he shoots, he is more likely not to score
(85.54%) than to score (14.46%) based on past data.
2. Impact of Distance and Body Part:
a. Distance: Goals are generally scored from closer (about 12.08 meters on average),
while missed shots tend to come from a bit further away (about 16.77 meters on
average).
b. Body Part: Whether Saka scores or not, he predominantly uses his left foot, but
the model suggests a slightly higher proportion of left-footed shots among
successful goals than unsuccessful attempts.

And finally, here are the final calculations.

10
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

For a shot taken with the left foot from a distance of 16.5 meters, the model predicts there is an
approximately 11.5% probability that it will result in a goal (prob_goal). For a shot taken with the
right foot from the same distance, the probability of scoring is correspondingly lower, at
approximately 9.9%.

4. Conclusion
This tutorial demonstrated how to build a Naïve Bayes model to predict the likelihood of scoring
a goal based on the distance to the goal and the foot used to hit the ball. It is clear that we cannot
cover all aspects of Naïve Bayes in just ten pages. However, this provides a solid introduction to
a simple yet powerful model. If you are interested in further exploration of our football illustration,
you can rerun the analysis with different players, teams, leagues, or seasons. All these modifications
are easily achievable using the worldfootballR package with just a few adjustments in the function
specification parameters. Why not try rerunning the analysis using Jude Bellingham’s data from
the 2023/24 season? To do this, modify the fb_match_url parameters by setting the country to
“ESP” (instead of “ENG”) and the season_end_year to 2024. Next, filter the URLs to include
only those for “Real-Madrid” (instead of Arsenal). When preparing the df database, ensure the
squad is filtered to “Real Madrid.” Lastly, for the df_analysis, retain only the data pertaining to
Jude Bellingham and you will be ready to go .

11
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Overall, despite its simplicity, Naïve Bayes can be incredibly effective in applications such as spam
detection and document classification. Additionally, it finds utility in financial modelling for credit
scoring and fraud detection, as well as in preliminary image classification tasks like facial
recognition. Despite the simplistic assumption that input features are independent—an
assumption often not held in real-world data—Naïve Bayes can deliver surprisingly accurate
results, particularly when the data is pre-processed appropriately. Its computational efficiency and
the minimal data requirement for reasonable predictions make it an invaluable tool in the data
scientist's toolkit, especially suitable for scenarios requiring quick decision-making and for use as
a baseline in complex predictive modelling.

5. Getting started with Google Colab for R programming

Google Colab is a free, cloud-based service that allows you to write and execute R (and Python)
code through your browser without any setup required. It is an ideal platform for beginners due
to its simplicity and accessibility. While experienced R programmers might use dedicated interfaces
like RStudio, Google Colab offers a straightforward alternative that is perfect for those just starting
their programming journey. Here is how to get set up:

Opening a New R Notebook in Google Colab

1. Navigate to Google Colab: Go to https://colab.research.google.com and sign in with

your Google account.
2. Create a New Notebook: Once you are on the Colab dashboard, click on `File` in the
top menu, then select `New notebook`. By default, this will create a Python notebook.

3. Switch to R: To change the notebook to R, click on the small arrow next to the `Connect`
button in the top-right corner. Then click on ‘Change runtime type’.

12
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Then select `R` from the dropdown menu under `Runtime type`, and save. The notebook will
refresh, and you can now start writing R code.

4. Write Your First R Code: Start with something simple to ensure everything is set up
correctly. For example, type `print("Hello, R in Colab!")` and press `Shift + Enter` to run
the cell. You should see the output directly below the code.

To add a new piece of code you can either continue writing in the same cell or create a new one
by clicking on the ‘+ Code’ button in the upper left side.

13
Data Science with Football Made Easy: The 15-Algorithm Playbook for Beginners

Soccer Match Prediction Analysis
No ratings yet
Soccer Match Prediction Analysis
25 pages
Football Match Prediction Guide
No ratings yet
Football Match Prediction Guide
17 pages
The Bivariate Poisson Distribution
No ratings yet
The Bivariate Poisson Distribution
45 pages
Key Principles For Data Skills 2024
No ratings yet
Key Principles For Data Skills 2024
11 pages
Tennis Match Outcome Prediction
No ratings yet
Tennis Match Outcome Prediction
6 pages
NNMANUAL
No ratings yet
NNMANUAL
72 pages
Paola Zuccolotto - Marica Manisera - Basketball Data Science - With Applications in R-CRC Press (2020)
No ratings yet
Paola Zuccolotto - Marica Manisera - Basketball Data Science - With Applications in R-CRC Press (2020)
245 pages
TheChampSystem Free
No ratings yet
TheChampSystem Free
41 pages
Simulating A Basketball Match With A Homogeneous Markov Model and Forecasting The Outcome
100% (1)
Simulating A Basketball Match With A Homogeneous Markov Model and Forecasting The Outcome
11 pages
R Is For Racing: Colin Magee January 2019
No ratings yet
R Is For Racing: Colin Magee January 2019
26 pages
Sports Betting Pure: The Educated Bet, by S.K. (Ebook)
No ratings yet
Sports Betting Pure: The Educated Bet, by S.K. (Ebook)
51 pages
Machine Learning Predicts Polish Horse Racing
No ratings yet
Machine Learning Predicts Polish Horse Racing
36 pages
Football Scores The Poisson Distribution and 30 Ye
No ratings yet
Football Scores The Poisson Distribution and 30 Ye
7 pages
EPL Match Prediction Using ML
No ratings yet
EPL Match Prediction Using ML
9 pages
Statistics From Basics To Advanced
No ratings yet
Statistics From Basics To Advanced
25 pages
The Exterminator Sports Betting System Review
No ratings yet
The Exterminator Sports Betting System Review
5 pages
Football Betting 사설토토 Tips - How november 23 Without Losing Your Shirt
No ratings yet
Football Betting 사설토토 Tips - How november 23 Without Losing Your Shirt
2 pages
The NBA and Fatigue
100% (1)
The NBA and Fatigue
42 pages
Calculus in Football 1
0% (1)
Calculus in Football 1
4 pages
Tennis Winner Prediction Based On Time-Series
No ratings yet
Tennis Winner Prediction Based On Time-Series
6 pages
Probability & Expected Value Basics
No ratings yet
Probability & Expected Value Basics
2 pages
Sports Betting Tracker
No ratings yet
Sports Betting Tracker
78 pages
Fantasy Football Revamp
0% (1)
Fantasy Football Revamp
5 pages
Betting Tracker v2 21 Advanced
No ratings yet
Betting Tracker v2 21 Advanced
76 pages
Script
No ratings yet
Script
4 pages
User Manual: One-Click To MAXIMISING The Returns From Your Arbitrage Trade
No ratings yet
User Manual: One-Click To MAXIMISING The Returns From Your Arbitrage Trade
18 pages
Modeling Basketball's Points Per Possession With Application To Predicting The Outcome of College Basketball Games
No ratings yet
Modeling Basketball's Points Per Possession With Application To Predicting The Outcome of College Basketball Games
19 pages
Log-Linear Poisson Models Soccer
No ratings yet
Log-Linear Poisson Models Soccer
78 pages
Jump Racing For Profit
No ratings yet
Jump Racing For Profit
62 pages
A Bradley-Terry Type Model For Forecasting Tennis Match Results
No ratings yet
A Bradley-Terry Type Model For Forecasting Tennis Match Results
12 pages
Football Alchemy 2014 - THE Profitable Sports Betting Investment
0% (1)
Football Alchemy 2014 - THE Profitable Sports Betting Investment
11 pages
Introduction To NFL Analytics With R (Bradley J. Congelio) (Z-Library)
No ratings yet
Introduction To NFL Analytics With R (Bradley J. Congelio) (Z-Library)
383 pages
Strategy PDF
No ratings yet
Strategy PDF
5 pages
2 Odds Football Predictions For From Top Experts
No ratings yet
2 Odds Football Predictions For From Top Experts
1 page
Betting for Statistical Clarity
No ratings yet
Betting for Statistical Clarity
30 pages
Notes On ML
No ratings yet
Notes On ML
42 pages
English Football Clubs' Profit Motives
No ratings yet
English Football Clubs' Profit Motives
34 pages
Sports Intro.
No ratings yet
Sports Intro.
33 pages
Betfair Strategy Testing Guide
No ratings yet
Betfair Strategy Testing Guide
95 pages
How A Moneyline Works in Sports Betting
No ratings yet
How A Moneyline Works in Sports Betting
4 pages
Betting Bankroll Management Guide
No ratings yet
Betting Bankroll Management Guide
4 pages
Sports Betting Odds and Profit PDF - Google Search
No ratings yet
Sports Betting Odds and Profit PDF - Google Search
1 page
Modelling Prices of In-Play Football Betting Markets
No ratings yet
Modelling Prices of In-Play Football Betting Markets
24 pages
Feature Engineering and Selection: A Practical Approach For Predictive Models 1st Edition Max Kuhn Download PDF
No ratings yet
Feature Engineering and Selection: A Practical Approach For Predictive Models 1st Edition Max Kuhn Download PDF
50 pages
Pest Analysis of Soccer Football Manufacturing Company
100% (1)
Pest Analysis of Soccer Football Manufacturing Company
6 pages
Mathematics in Sports Analytics PDF
No ratings yet
Mathematics in Sports Analytics PDF
14 pages
The Application of Machine Learning For Sport Result Prediction A Review
No ratings yet
The Application of Machine Learning For Sport Result Prediction A Review
49 pages
Winning Sports Betting - Masaru Kanemoto
No ratings yet
Winning Sports Betting - Masaru Kanemoto
175 pages
Horse Racing Prediction at The Champ de Mars Using A Weighted Probabilistic Approach
No ratings yet
Horse Racing Prediction at The Champ de Mars Using A Weighted Probabilistic Approach
4 pages
Betting Traker V2
No ratings yet
Betting Traker V2
87 pages
Beating Bookmakers: Football Betting Strategy
No ratings yet
Beating Bookmakers: Football Betting Strategy
31 pages
Betting Odds Bias Analysis
No ratings yet
Betting Odds Bias Analysis
15 pages
Statistical Sports Models in Excel Download Instantly
0% (1)
Statistical Sports Models in Excel Download Instantly
308 pages
Westgate NFL Betting Lines
No ratings yet
Westgate NFL Betting Lines
18 pages
One and Two Logistic
No ratings yet
One and Two Logistic
17 pages
Bayesian xG: Player & Position Impact
100% (1)
Bayesian xG: Player & Position Impact
28 pages
Deriving A Model To Calculate The Probability of Scoring A Goal From Every Shooting Position in The Football Pitch and Applying It To Predict The XG For Different Matches.
No ratings yet
Deriving A Model To Calculate The Probability of Scoring A Goal From Every Shooting Position in The Football Pitch and Applying It To Predict The XG For Different Matches.
28 pages
Module 2 - Metrics & Terminology
No ratings yet
Module 2 - Metrics & Terminology
4 pages
1.8 Scale of Measurements
No ratings yet
1.8 Scale of Measurements
12 pages
IP Project File (Utkarsh)
No ratings yet
IP Project File (Utkarsh)
28 pages
Centrifugal Pumps
100% (2)
Centrifugal Pumps
70 pages
Eatonroadranger RTOF-14909MLL
No ratings yet
Eatonroadranger RTOF-14909MLL
32 pages
Hcr2 Multi Scripts Gemz
No ratings yet
Hcr2 Multi Scripts Gemz
6 pages
DeepSeek-R1: Enhancing LLM Reasoning via RL
No ratings yet
DeepSeek-R1: Enhancing LLM Reasoning via RL
22 pages
8.CNS Assignment
No ratings yet
8.CNS Assignment
2 pages
Solar Power Plant
No ratings yet
Solar Power Plant
20 pages
Purchase Info Record
No ratings yet
Purchase Info Record
14 pages
Ficom Fireman Intercom System: Typical Single Line Diagram
No ratings yet
Ficom Fireman Intercom System: Typical Single Line Diagram
1 page
Understanding Fire Extinguishers - A Guide by Palmer Asia Inc.
No ratings yet
Understanding Fire Extinguishers - A Guide by Palmer Asia Inc.
3 pages
Strings
No ratings yet
Strings
5 pages
25 Best Mathematics Questions For CUET
No ratings yet
25 Best Mathematics Questions For CUET
127 pages
Information Brochure - MBA CET 2024 - Final
No ratings yet
Information Brochure - MBA CET 2024 - Final
22 pages
Media PDF On Vocational Education
No ratings yet
Media PDF On Vocational Education
20 pages
AlterG VIA
No ratings yet
AlterG VIA
2 pages
Thanh Tran: Work Experience
No ratings yet
Thanh Tran: Work Experience
3 pages
Abhibus AY5681641197
No ratings yet
Abhibus AY5681641197
2 pages
All Best Links For All (Ever Green)
No ratings yet
All Best Links For All (Ever Green)
12 pages
K-Means for Customer Segmentation
No ratings yet
K-Means for Customer Segmentation
13 pages
Absolute Value and Integers
No ratings yet
Absolute Value and Integers
2 pages
SARSA Reinforcement Learning Algorithm
No ratings yet
SARSA Reinforcement Learning Algorithm
5 pages
Saep 325
No ratings yet
Saep 325
43 pages
Podar International School Caiet Form
No ratings yet
Podar International School Caiet Form
2 pages
Fp-510 Opacity Pigment: Technical Data Sheet 24 JAN 2012
No ratings yet
Fp-510 Opacity Pigment: Technical Data Sheet 24 JAN 2012
1 page
ZTP Configuration Elements
No ratings yet
ZTP Configuration Elements
3 pages
Learn Python 3 - Modules Cheatsheet - Codecademy
No ratings yet
Learn Python 3 - Modules Cheatsheet - Codecademy
4 pages
Operation Manual (Betriebsanleitung)
No ratings yet
Operation Manual (Betriebsanleitung)
481 pages
Siebel Development & EIM Guide
No ratings yet
Siebel Development & EIM Guide
10 pages
UI Final Project
No ratings yet
UI Final Project
11 pages

DSWFME - Naive Bayes

Uploaded by

DSWFME - Naive Bayes

Uploaded by

Data Science with Football Made Easy

The 15-Algorithm Playbook for Beginners

Unpacking Premier League Scoring Chances with Bukayo Saka

version 1.1 [26-04-2024]

Copyright © 2024 Antoine Martin

𝑃(𝑆ℎ𝑜𝑡 𝐹𝑒𝑎𝑡𝑢𝑟𝑒𝑠| 𝐺𝑜𝑎𝑙) ∗ 𝑃(𝐺𝑜𝑎𝑙)

Practical Calculation for Saka’s Shot: Imagine historical data shows:

0.28 ∗ 0.15 0.042

3. Naïve Bayes with R

 The complete version of the code is easily downloadable here.

Below is what the head of the data should look like.

Below is what the head of the data should look like.

Here is how to interpret the model’s results.

And finally, here are the final calculations.

5. Getting started with Google Colab for R programming

Opening a New R Notebook in Google Colab

1. Navigate to Google Colab: Go to https://colab.research.google.com and sign in with

You might also like