Crime Type and Occurrence Prediction Using Machine Learning Algorithm
Crime Type and Occurrence Prediction Using Machine Learning Algorithm
3,4,5
UG Scholar, Kongu Engineering College
1
[email protected]
2
[email protected]
3
[email protected]
4
[email protected]
5
[email protected]
Abstract - In this era of recent times, crime activities. Hence, use of machine learning techniques
has become an evident way of making people and and its records is required to predict the crime type
society under trouble. An increasing crime factor and patterns. It imposes the uses of existing crime
leads to an imbalance in the constituency of a data and predicts the crime type and its occurrence
country. In order to analyse and have a response bases on the location and time. Researchers
ahead this type of criminal activities, it is necessary undergone many studies that helps in analysing the
to understand the crime patterns. This study crime patterns along with their relations in a specific
imposes one such crime pattern analysis by using location. Some of the hotspots analysed has become
crime data obtained from Kaggle open source easier way of classifying the crime patterns. This
which in turn used for the prediction of most leads to assist the officials to resolve them faster. This
recently occurring crimes. The major aspect of this approach uses a dataset obtained from Kaggle open
project is to estimate which type of crime source based on various factors along with the time
contributes the most along with time period and and space where it occurs over a certain period of
location where it has happened. Some machine time. We implied a classification algorithm that helps
learning algorithms such as Naïve Bayes is implied in locating the type of crime and hotspots of the
in this work in order to classify among various criminal actions that takes place on the certain time
crime patterns and the accuracy achieved was and day. In this proposed one to impose a machine
comparatively high when compared to pre- learning algorithms to find the matching criminal
composed works. patterns along with the assist of its category with the
Keywords: Crime, Analyse, Crime patterns, given temporal and spatial data.
Kaggle, Estimate, Naïve Bayes, Accuracy
II. Literature Survey
I. Introduction
Crime are of different type that occurs at
Crime has become a major thread imposed
different locations around the various geographical
which is considered to grow relatively high in
location. Many research scholars have been
intensity. An action stated is said to be a crime, when
suggesting a mechanism to analyse the relationship
it violates the rule, against the government laws and it
between crime and social variables that includes
is highly offensive. The crime pattern analysis
unemployed individuals, earning amount, level of
requires a study in the different aspects of
education and so on.
criminology and also in indicating patterns. The
Suhong Kim and Param Joshi [1] proposed
Government has to spend a lot of time and work to
two different machine learning models which is used
imply technology to govern some of these criminal
for prediction, K nearest neighbour algorithm (KNN )
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
and decision tree approach. The accuracy obtained over these recent years, system has to handle an
ranges between 39 to 44 percent when predicting enormous amount of data which requires more time to
crime patterns and finding the crime type. Benjamin analyse them manually. Hence, advance machine
Fredrick David. H [2] imposed a data mining learning approaches like K means clustering has been
technique that involves evaluating and inspect large used. A literature survey on Spatial and Temporal
pre-existing datasets in accordance to deliver more Hotspot prediction of crime [6] proposed a study to
information. The extraction of new patterns is cross categorize and evaluate the location and time of the
checked with predefined datasets available. crime hotspot detection techniques by performing
Shraddha S. Kavathekar [3] used association (SLR) Systematic Literature Review. Fuzhan Nasiri,
rule mining in predicting crimes. Some Machine Zakikhani, Kimiya and Tarek Zayed [7] suggested a
learning algorithms including Deep Neural Network failure prediction model that helps in detecting the
(DNN) and Artificial Neural Network (ANN) have corrosion in the pipelines of gas transmission. Most of
been implied. A deep neural network works more the prediction model depend absolutely on the
accurately using the feature level dataset. Using DNN, experimental tests data or involving some of the
entirely connected convolution layers has been used limited historical data records. This helps in ignoring
in building the prediction model, mainly for multi- the corrosion from various geographical
labelled data classification. It was implemented using circumstances. Nikhli Dubey and Setu K. Chaturvedi
Tenserflow that is an API mainly designed for Deep [8] imposed pertinent analysis of data mining
learning technique with the dropout layers. These approaches for the detection of the impeding future
findings suggest that when there is more count of crime. A Computational mechanism to classify the
missing values, there is a need for pre-processing crime using machine learning techniques [9] proposed
because crimes do not occur in the same manner but a malleable computational implementation tool to
focuses on some particular areas. Artificial Neural analyse the crime rate in a country helps in classifying
Network [ANN] is based on the prognosis by trend cybercrimes. Hyeon-Woo Kang and Hang-Bong Kang
analysis in solving problems. It comprises of [10] suggested a fusion method based on Deep Neural
enormous amount of processing constituent that Network in predicting the criminal activities from the
works altogether in building a model. Chandy and feature level data with sufficient parameters.
Abraham [4] proposed a random forest classifier in III. Existing System
extracting the features for data processing using cloud In pre-work, the dataset obtained from the
computing. The extracted features are request number, open source are first pre-processed to remove the
user identification, expiry time, time of arrival nd duplicated values and features. Decision tree has been
memory requirement. After feature extraction, the used in the factor of finding crime patterns and also
prediction of work load is done by using the trained extracting the features from large amount of data is
data that has been perceived from the learning stage inclusive. It provides a primary structure for further
that allows to learn the details of the extracted classification process. The classified crime patterns
features from user’s request. are feature extracted using Deep Neural network.
Rohit Patil, Muzamil Kacchi, Pranali Gavali Based on the prediction, the performance is calculated
and Komal Pimparia [5] suggests an Apriori for both trained and test values. The crime prediction
algorithm for frequent patterns and the result obtained helps in forecasting the future happening of any type
from K-means is used. Due to increase in crime rate
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
NEIGHBORHOOD_ID IS_CRIME
montbello 1
gateway-green-valley-ranch 2
wellshire 3
belcaro 2
cherry-creek 2
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
1. m represents Month
Fig 2. Plotting the highest occurrence month
2. t represents Time
3. a represents Area
4. d represents Day
5. y presents Year
6. c represents Type
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
VIII. Conclusion
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the International Conference on Artificial Intelligence and Smart Systems (ICAIS-2021)
IEEE Xplore Part Number: CFP21OAB-ART; ISBN: 978-1-7281-9537-7
Prediction using Data mining techniques”, Crime Offenses using Machine Learning”,
ICTACT Journal on Soft Computing on Sustainability Journals, Volume 12, Issue 10,
April 2012. Published on May 2020.
[3] Shruti S.Gosavi and Shraddha S. [10] Hyeon-Woo Kang and Hang-Bong Kang,
Kavathekar,“A Survey on Crime Occurrence “Prediction of crime occurrence from multi-
Detection and prediction Techniques”, modal data using deep learning”, Peer-
International Journal of Management, reviewed journal, published on April 2017.
Technology And Engineering , Volume 8,
Issue XII, December 2018.
[4] Chandy, Abraham, "Smart resource usage
prediction using cloud computing for
massive data processing systems" Journal of
Information Technology 1, no. 02 (2019):
108-118.
[5] Learning Rohit Patil, Muzamil Kacchi,
Pranali Gavali and Komal Pimparia, “Crime
Pattern Detection, Analysis & Prediction
using Machine”, International Research
Journal of Engineering and Technology,
(IRJET) e-ISSN: 2395-0056, Volume: 07,
Issue: 06, June 2020
[6] Umair Muneer Butt, Sukumar Letchmunan,
Fadratul Hafinaz Hassan, Mubashir Ali,
Anees Baqir and Hafiz Husnain Raza
Sherazi, “Spatio-Temporal Crime Hotspot
Detection and Prediction: A Systematic
Literature Review”, IEEE Transactions on
September 2020.
[7] Nasiri, Zakikhani, Kimiya and Tarek Zayed,
"A failure prediction model for corrosion in
gas transmission pipelines", Proceedings of
the Institution of Mechanical Engineers, Part
O: Journal of Risk and Reliability, (2020).
[8] Nikhil Dubey and Setu K. Chaturvedi, “A
Survey Paper on Crime Prediction Technique
Using Data Mining”, Corpus ID: 7997627,
Published on 2014.
[9] Rupa Ch, Thippa Reddy Gadekallu, Mustufa
Haider Abdi and Abdulrahman Al-Ahmari,
“Computational System to Classify Cyber
Authorized licensed use limited to: University of Prince Edward Island. Downloaded on May 15,2021 at 17:55:24 UTC from IEEE Xplore. Restrictions apply.