Problem Statement

Uploaded by

lepesa8247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

Problem Statement

Uploaded by

lepesa8247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

In this project we are going to build a simple Neural Network to realize Credit Card Fraud Detection.

The data contains customer features and a binary label where “1” indicates positive (fraud), and “0” means negative
(legitimate).

Before modeling, you will need to perform a train-test split to reserve part of your data for validation. One problem of
fraud detection data is that usually you will have only a few positive data points, a.k.a imbalanced dataset problem. You
need to think of strategies of splitting and sampling your data to avoid the issues caused by the imbalance.

Imbalance data could cause the following issues. Suppose we have 100 data points and only two of them are positive:

1. You will likely have 0 positive cases in your test data when you split randomly.

2. The model, if predicting everyone to be negative, then the accuracy would be 98%, but does it mean we have a
good model?

Therefore, accuracy is not the sole metric that we need to evaluate. There are precisions and recalls, AUC and F1
scores that can be considered as better metrics.

A confusion matrix better illustrated this problem:

We will use Keras, a high-level open source deep learning library that enables users to build and train NN models
efficiently without explicitly programming NN details.

Keras official documentation: https://keras.io/

To summarize, you will proceed the project by the following steps:

1. Load the data and briefly explore what are the suitable columns to use as features and what is the distribution of
the label.
2. Preprocess your feature set. If your data contains significant imbalance problems, think of strategies to draw
representative validation sets. Besides, think of how to re-balance your TRAINING dataset .
3. Once you have cleaned X_train, X_test, Y_train and Y_test, start building your NN, by using keras, this is as
simple as a few lines of code. Varying your network structure is like playingLEGO!
4. Train your model with your training set. You may play with hyperparameters such as number of layers, size of
hidden neurons, different activation functions, learning rate, number of epochs to improve your model
performance.
5. Evaluate your model with your reserved validation set and report your performance in accuracy, precision and
recall. Discuss briefly what you observe from these metrics. 6. [Bonus] Try if you can implement some other
classic ML algorithm and compare in terms of performance and computation time.
7. [Bonus 2] If you can learn more about and apply dropout, batch normalization and weight initialization to
see if you can further improve.
8. [Bonus 3] Implement cross validation.
9. [Bonus 4] Evaluate using AUC: https://towardsdatascience.com/understanding-auc-roc-curve- 68b2303cc9c5

A final hint: NN can be expensive to train. If you do not possess powerful hardware, start with a mini NNand test how
your machine can handle the computation.

C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
Credit Card Fraud Detection (Data Analyst)
No ratings yet
Credit Card Fraud Detection (Data Analyst)
22 pages
Credit Card Fraud Detection
100% (1)
Credit Card Fraud Detection
20 pages
Seismic Eval
100% (1)
Seismic Eval
664 pages
Math 170
No ratings yet
Math 170
8 pages
Module 3.4 Classification Models, Case Study
No ratings yet
Module 3.4 Classification Models, Case Study
12 pages
final-way
No ratings yet
final-way
15 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
25 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Code
No ratings yet
Code
6 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Lab 12
No ratings yet
Lab 12
6 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
10 pages
Disaster
No ratings yet
Disaster
20 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
DL Practical
No ratings yet
DL Practical
23 pages
4th Assign
No ratings yet
4th Assign
6 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
Even More Detailed Explination[1]
No ratings yet
Even More Detailed Explination[1]
38 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
dl lab_merged (2)
No ratings yet
dl lab_merged (2)
60 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
credit card fraud detection
No ratings yet
credit card fraud detection
8 pages
Your First Neural Network
No ratings yet
Your First Neural Network
15 pages
Credit Card Frau
No ratings yet
Credit Card Frau
34 pages
Report
No ratings yet
Report
14 pages
Phase3 Credit Card Fraud Detection
No ratings yet
Phase3 Credit Card Fraud Detection
7 pages
aiml_assignment[1]
No ratings yet
aiml_assignment[1]
15 pages
Supervised Assignment 2
No ratings yet
Supervised Assignment 2
2 pages
A4 - Jupyter Notebook PDF
No ratings yet
A4 - Jupyter Notebook PDF
8 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
exp 5 (1)
No ratings yet
exp 5 (1)
9 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Machine Learning HW3 - Image Classification
No ratings yet
Machine Learning HW3 - Image Classification
48 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DOC-20250430-WA0006
No ratings yet
DOC-20250430-WA0006
6 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
_ML Report__22112037
No ratings yet
_ML Report__22112037
9 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
AI and DS Final Document For Phase 5
No ratings yet
AI and DS Final Document For Phase 5
9 pages
A1
No ratings yet
A1
2 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Project 1 - ANN With Backprop
No ratings yet
Project 1 - ANN With Backprop
3 pages
TE Seminar Formatfinal
No ratings yet
TE Seminar Formatfinal
16 pages
Introduction of Phase 4
No ratings yet
Introduction of Phase 4
14 pages
Introduction to ANN with steps 10 25
No ratings yet
Introduction to ANN with steps 10 25
30 pages
Plagiarism Scan Report Words Statistics: Content Checked For Plagiarism
No ratings yet
Plagiarism Scan Report Words Statistics: Content Checked For Plagiarism
3 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Deep Learning For Credit Risk 1713932406
No ratings yet
Deep Learning For Credit Risk 1713932406
13 pages
Lecture 7 - Perceptrons and Multi-Layer Feedforward Neural Networks Using Matlab Part 3
No ratings yet
Lecture 7 - Perceptrons and Multi-Layer Feedforward Neural Networks Using Matlab Part 3
6 pages
Aiml
No ratings yet
Aiml
20 pages
Credit Card Fraud Detection
100% (1)
Credit Card Fraud Detection
14 pages
Introduction to Genetic Algorithm Neural Networks
No ratings yet
Introduction to Genetic Algorithm Neural Networks
44 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Module 5.pptx_20250608_201231_0000
No ratings yet
Module 5.pptx_20250608_201231_0000
43 pages
DL JOURNAL - Merged
No ratings yet
DL JOURNAL - Merged
27 pages
Credit Card Fraud Detection Using Deep Learning
No ratings yet
Credit Card Fraud Detection Using Deep Learning
9 pages
ECN 2101 2017-2018 Course Outline 2
No ratings yet
ECN 2101 2017-2018 Course Outline 2
2 pages
Industri Bioteknologi 2021 - Kumpulan 5
No ratings yet
Industri Bioteknologi 2021 - Kumpulan 5
15 pages
Umur: Output Spss A. Analisis Univariat
No ratings yet
Umur: Output Spss A. Analisis Univariat
6 pages
Teaching Internship Handbook
No ratings yet
Teaching Internship Handbook
42 pages
Kriging: Principle of Kriging Types of Kriging Influence of Variogram Model Parameters On Kriging Results
No ratings yet
Kriging: Principle of Kriging Types of Kriging Influence of Variogram Model Parameters On Kriging Results
25 pages
Trademark Surveys_ Development of Computer-Based Survey Methods
No ratings yet
Trademark Surveys_ Development of Computer-Based Survey Methods
36 pages
Revision Citicolina
No ratings yet
Revision Citicolina
20 pages
Agri Techno
No ratings yet
Agri Techno
5 pages
Pengaruh Perbandingan Kentang Kukus Dengan Terigu Terhadap Karakteristik Flakes Kadek Ari Ramadhani, Ni Wayan Wisaniyasa, Putu Ari Sandhi W
No ratings yet
Pengaruh Perbandingan Kentang Kukus Dengan Terigu Terhadap Karakteristik Flakes Kadek Ari Ramadhani, Ni Wayan Wisaniyasa, Putu Ari Sandhi W
10 pages
Course Syllabus Facets of Human Settlements
No ratings yet
Course Syllabus Facets of Human Settlements
12 pages
Computers in Industry: Concetta Semeraro, Mario Lezoche, Hervé Panetto, Michele Dassisti
No ratings yet
Computers in Industry: Concetta Semeraro, Mario Lezoche, Hervé Panetto, Michele Dassisti
23 pages
BUS 9200 Syllabus Summer 2012
No ratings yet
BUS 9200 Syllabus Summer 2012
12 pages
Properties of Water Lab Report
No ratings yet
Properties of Water Lab Report
4 pages
(eBook PDF) The Oxford Handbook of Political instant download
100% (1)
(eBook PDF) The Oxford Handbook of Political instant download
54 pages
Training-Workshop On Documenting Good Practices For Effective Local Governance
No ratings yet
Training-Workshop On Documenting Good Practices For Effective Local Governance
68 pages
Mcom Project Format
100% (1)
Mcom Project Format
8 pages
Final Report Eco541
No ratings yet
Final Report Eco541
20 pages
Air Conditioner
No ratings yet
Air Conditioner
54 pages
Introduction To Data Science What Is Data Science?
No ratings yet
Introduction To Data Science What Is Data Science?
11 pages
Theoretical Statistics and Asymptotics: Nancy Reid
No ratings yet
Theoretical Statistics and Asymptotics: Nancy Reid
13 pages
Ecotourism and Local Perception About Its Impacts A Study of Village Sam, Jaisalmer, Rajasthan
50% (2)
Ecotourism and Local Perception About Its Impacts A Study of Village Sam, Jaisalmer, Rajasthan
6 pages
Richard Minnitt
No ratings yet
Richard Minnitt
39 pages
Herauf10e PPT Ch01
No ratings yet
Herauf10e PPT Ch01
27 pages
Faculty of Commerce and Economics School of Economics: ECON1202/ECON2291 (ARTS) Q M A
No ratings yet
Faculty of Commerce and Economics School of Economics: ECON1202/ECON2291 (ARTS) Q M A
18 pages
Statistical Thinking In Clinical Trials Michael A Proschan download
No ratings yet
Statistical Thinking In Clinical Trials Michael A Proschan download
82 pages
William T. O'Donohue, Matthew Fanetti (Eds.) - Forensic Interviews Regarding Child Sexual Abuse_ a Guide to Evidence-Based Practice-Springer International Publishing (2016)
No ratings yet
William T. O'Donohue, Matthew Fanetti (Eds.) - Forensic Interviews Regarding Child Sexual Abuse_ a Guide to Evidence-Based Practice-Springer International Publishing (2016)
371 pages
Transformational and Servant Leadership Content and Contextual Comparisons
No ratings yet
Transformational and Servant Leadership Content and Contextual Comparisons
12 pages
Sharing Thoughts on Blogs,Social Networking,Discussion Forums
No ratings yet
Sharing Thoughts on Blogs,Social Networking,Discussion Forums
2 pages

Problem Statement

Uploaded by

Problem Statement

Uploaded by

In this project we are going to build a simple Neural Network to realize Credit Card Fraud Detection.

A confusion matrix better illustrated this problem:

Keras official documentation: https://keras.io/

To summarize, you will proceed the project by the following steps:

You might also like