Fake Review Detection Using Machine Learning Algorithm On Online Product Selling Platforms Publication Paper
Fake Review Detection Using Machine Learning Algorithm On Online Product Selling Platforms Publication Paper
ABSTRACT: Fake reviews detection attracts many generalization in order to get a system that performs
researchers’ attention due to the negative impacts on well on yet unseen data instances.
the society. Most existing fake reviews detection [1]. Machine learning is a relatively new
approaches mainly focus on semantic analysis of discipline within Computer Science that provides a
review’s contents. We propose a novel fake reviews collection of data analysis techniques. Some of these
Logistic regression technique. The increasing techniques are based on well-established statistical
popularity of online review systems motivates methods (e.g. logistic regression and principal
malevolent intent in competing sellers and service component analysis) while many others are not.
providers to manipulate consumers by fabricating [2]. Most statistical techniques follow the
product/service reviews. Immoral actors use Sybil paradigm of determining a particular probabilistic
accounts, bot farms, and purchase authentic accounts model that best describes observed data among a
to promote products and vilify competitors. Facing class of related models. Similarly, most machine
the continuous advancement of review spamming learning techniques are designed to find models that
techniques, the research community should step best fit data (i.e. they solve certain optimization
back, assess the approaches explored to date to problems), except that these machine learning models
combat fake reviews, and regroup to define new ones. are no longer restricted to probabilistic ones.
This paper reviews the literature on Fake Review [3]. Therefore, an advantage of machine
Detection (FRD) on online platforms. It covers both learning techniques over statistical ones is that the
basic research and commercial solutions, and latter require underlying probabilistic models while
discusses the reasons behind the limited level of the former do not. Even though some machine
success that the current approaches and regulations learning techniques use probabilistic models, the
have had in preventing damage due to deceptive classical statistical techniques are most often too
reviews. stringent for the oncoming Big Data era, because data
KEYWORDS: Fake Review Detection, Web sources are increasingly complex and multi-faceted.
Scraping, Spam Review Detection, Machine Prescribing probabilistic models relating variables
Learning, Fraud Detection, Google Reviews. from disparate data sources that are plausible and
amenable to statistical analysis might be extremely
I. INTRODUCTION difficult if not impossible.
Predictive analytics tools are powered [4]. While machine learning and predictive
by several different models and algorithms that can analytics can be a boon for any organization,
be applied to wide range of use cases. Determining implementing these solutions haphazardly, without
what predictive modeling techniques are best for your considering how they will fit into everyday
company is key to getting the most out of a predictive operations, will drastically hinder their ability to
analytics solution and leveraging data to make deliver the insights the organization needs.
insightful decisions in the statistical context. Machine [5]. A user review is a review conducted by
Learning is defined as an application of artificial any person who has access to the internet and
intelligence where available information is used publishes their experience to a review site or social
through algorithms to process or assist the processing media platform following product testing or the
of statistical data. evaluation of a service. User reviews are commonly
While Machine Learning involves provided by consumers who volunteer to write the
concepts of automation, it requires human guidance. review, rather than professionals who are paid to
Machine Learning involves a high level of evaluate the product or service.
DOI: 10.35629/5252-45122323 | Impact Factor value 7.429 | ISO 9001: 2008 Certified Journal Page 50
International Journal of Advances in Engineering and Management (IJAEM)
Volume 4, Issue 6, pp: 1243-1248 www.ijaem.net ISSN: 2395-5252
User reviews might be compared to of social media platforms has enabled the facilitation
professional nonprofit reviews from a consumer of interaction between consumers after a review has
organization, or to promotional reviews from an been placed on online communities such as blogs,
advertiser or company marketing a product. Growth internet forums or other popular platforms.
II. SOFTWARE ANALYSIS not yet been able to eradicate the damaging effects of
EXISTING SYSTEM fake review activity in practice. In doing so, we point
In the existing method, fake Reviews at the difficulties associated with combating the
detection multi-task learning model has been different types of malignant influencers.
presented which is based on the following
observations: PROPOSED SYSTEM
Some certain topics have higher In the proposed method, we proposed the
percentages of fake reviews. Some certain news Fake reviews detection technique with logistic
authors have higher intentions to publish fake news. regression architecture. For the preprocessing, the
FDML model investigates the impact of Natural Language processing (NLP) processes are
topic labels for the fake reviews and introduce perform to extract the information from the text data.
contextual information of news at the same time to After that the classification o, Logistic regression is
boost the detection performance on the short fake take places in order to perform operations.
reviews. The existing methods and regulations have
V. SYSTEM DESIGN
engineering practices that have proven successful in
SYSTEM ARCHITECTURE the modeling of large and complex systems.
A system architecture is the conceptual
model that defines the structure, behavior, and more USE CASE DIAGRAM
views of a system. An architecture description is a A use case diagram in the Unified Modeling
formal description and representation of a system, Language (UML) is a type of behavioral diagram
organized in a way that supports reasoning about the defined by and created from a Use-case analysis. Its
structures and behaviors of the system. purpose is to present a graphical overview of the
functionality provided by a system in terms of actors,
their goals (represented as use cases), and any
dependencies between those use cases.
UML DIAGRAMS
UML stands for Unified Modeling
Language. UML is a standardized general-purpose
modeling language in the field of object-oriented
software engineering. The standard is managed, and
was created by, the Object Management Group. The
goal is for UML to become a common language for
creating models of object oriented computer
software. In its current form UML is comprised of
two major components: a Meta-model and a notation.
In the future, some form of method or process may
also be added to , or associated with, UML. The UML
represents a collection of best
SYSTEM TESTING
DOI: 10.35629/5252-45122323 | Impact Factor value 7.429 | ISO 9001: 2008 Certified Journal Page 51
International Journal of Advances in Engineering and Management (IJAEM)
Volume 4, Issue 6, pp: 1243-1248 www.ijaem.net ISSN: 2395-5252
DOI: 10.35629/5252-45122323 | Impact Factor value 7.429 | ISO 9001: 2008 Certified Journal Page 51
International Journal of Advances in Engineering and Management (IJAEM)
Volume 4, Issue 6, pp: 1243-1248 www.ijaem.net ISSN: 2395-5252
DOI: 10.35629/5252-45122323 | Impact Factor value 7.429 | ISO 9001: 2008 Certified Journal Page 52
International Journal of Advances in Engineering and Management (IJAEM)
Volume 4, Issue 6, pp: 1243-1248 www.ijaem.net ISSN: 2395-5252
DOI: 10.35629/5252-45122323 | Impact Factor value 7.429 | ISO 9001: 2008 Certified Journal Page 51