Credit Card Fraud Detection: Title
Credit Card Fraud Detection: Title
ABSTRACT
Credit card fraud detection is presently the most frequently occurring problem
in the present world. This is due to the rise in both online transactions and e-
commerce platforms. Credit card fraud generally happens when the card was
stolen for any of the unauthorized purposes or even when the fraudster uses
the credit card information for his use. In the present world, we are facing a lot
of credit card problems. To detect the fraudulent activities the credit card
fraud detection system was introduced. This project aims to focus mainly on
machine learning algorithms. The algorithms used are random forest algorithm
and the Adaboost algorithm. The results of the two algorithms are based on
accuracy, precision, recall, and F1-score. The ROC curve is plotted based on the
confusion matrix. The Random Forest and the Adaboost algorithms are
compared and the algorithm that has the greatest accuracy, precision, recall,
and F1-score is considered as the best algorithm that is used to detect the
fraud.
In this work, almost every simple yet best language programming and
applications (Technology Stack) are used, and are listed as follows.
● PYTHON
● MACHINE LEARNING
● ANACONDA
Review of Literature:
Fraud act as the unlawful or criminal deception intended to result in financial
or personal benefit. It is a deliberate act that is against the law, rule or policy
with an aim to attain unauthorized financial benefit.
There have also been efforts to progress from a completely new aspect.
Attempts have been made to improve the alert-feedback interaction in case of
fraudulent transaction.
Artificial Genetic Algorithm, one of the approaches that shed new light in this
domain, countered fraud from a different direction.
The basic rough architecture diagram can be represented with the following
figure:
When looked at in detail on a larger scale along with real life elements, the full
architecture diagram can be represented as follows:
First of all, we obtained our dataset from Kaggle, a data analysis website
which provides datasets.
Inside this dataset, there are 31 columns out of which 28 are named as v1-v28
to protect sensitive data.
The other columns represent Time, Amount and Class. Time shows the time
gap between the first transaction and the following one. Amount is the
amount of money transacted. Class 0 represents a valid transaction and 1
represents a fraudulent one.
Advantage:
With this level of control, fraudsters don't have the chance to make multiple
transactions on a stolen or counterfeit card before the cardholder is aware of
the fraudulent activity. This alone can save a significant amount of money that
would traditionally be lost to fraud.
CONCLUSION:
Credit card fraud is without a doubt an act of criminal dishonesty. This article
has listed out the most common methods of fraud along with their detection
methods and reviewed recent findings in this field. This paper has also
explained in detail, how machine learning can be applied to get better results
in fraud detection along with the algorithm, pseudocode, explanation its
implementation and experimentation results.