Deep Transfer Learning For IoT Attack Detection

security of smart devices using AI approach

Uploaded by

Amna Safder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views10 pages

Deep Transfer Learning For IoT Attack Detection

security of smart devices using AI approach

Uploaded by

Amna Safder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Received April 8, 2020, accepted May 24, 2020, date of publication June 8, 2020, date of current version June

18, 2020.
Digital Object Identifier 10.1109/ACCESS.2020.3000476

Deep Transfer Learning for IoT Attack Detection

LY VU1 , QUANG UY NGUYEN 1 , DIEP N. NGUYEN 2 , (Senior Member, IEEE),
DINH THAI HOANG 2 , (Member, IEEE), AND ERYK DUTKIEWICZ 2 , (Senior Member, IEEE)
1 Faculty of Information Technology, Le Quy Don Technical University, Hanoi 11917, Vietnam
2 School of Electrical and Data Engineering, University of Technology Sydney, Ultimo, NSW 2007, Australia
Corresponding author: Quang Uy Nguyen ([email protected])
This work was supported by the Vietnam National Foundation for Science and Technology Development (NAFOSTED) under Grant
102.05-2019.05.

ABSTRACT The digital revolution has substantially changed our lives in which Internet-of-Things (IoT)
plays a prominent role. The rapid development of IoT to most corners of life, however, leads to various
emerging cybersecurity threats. Therefore, detecting and preventing potential attacks in IoT networks have
recently attracted paramount interest from both academia and industry. Among various attack detection
approaches, machine learning-based methods, especially deep learning, have demonstrated great potential
thanks to their early detecting capability. However, these machine learning techniques only work well when
a huge volume of data from IoT devices with label information can be collected. Nevertheless, the labeling
process is usually time consuming and expensive, thus, it may not be able to adapt with quick evolving IoT
attacks in reality. In this paper, we propose a novel deep transfer learning (DTL) method that allows to learn
from data collected from multiple IoT devices in which not all of them are labeled. Specifically, we develop a
DTL model based on two AutoEncoders (AEs). The first AE (AE1 ) is trained on the source datasets (source
domains) in the supervised mode using the label information and the second AE (AE2 ) is trained on the target
datasets (target domains) in an unsupervised manner without label information. The transfer learning process
attempts to force the latent representation (the bottleneck layer) of AE2 similarly to the latent representation
of AE1 . After that, the latent representation of AE2 is used to detect attacks in the incoming samples in the
target domain. We carry out intensive experiments on nine recent IoT datasets to evaluate the performance
of the proposed model. The experimental results demonstrate that the proposed DTL model significantly
improves the accuracy in detecting IoT attacks compared to the baseline deep learning technique and two
recent DTL approaches.

INDEX TERMS Deep transfer learning, IoT, cyberattack detection, AutoEncoder.

I. INTRODUCTION cyber attacks than computers [2], [3]. Consequently, detect-

The Internet-of-Things (IoT) refers to connected devices, ing attacks to protect IoT devices from malicious behaviors
sensors, an actuators used in vehicles, electronic appliances, is critical to broadening the applications of IoT [4]–[7].
buildings, and structures. As the sensors, data storage, and the IoT attack detection methods can be categorized into
Internet become cheaper, faster, and more integrated together, signature-based and machine learning-based methods
IoT devices will find more and more applications [1] (e.g., [8]–[10]. The signature-based methods [11]–[14] seek to find
in smart buildings, smart city, intelligent transportation sys- the signatures of IoT attacks in the incoming traffic. These
tems, and healthcare). The rapid development of IoT to most methods require a high prior knowledge of known IoT attacks
corners of life, however, leads to various emerging cyberse- to define the signatures. The machine learning-based meth-
curity threats. This is because IoT devices are often limited ods, on the other hand, attempt to learn the features of normal
in computing capability and energy, making them particu- and malicious data in the training/offline phase. In the pre-
larly vulnerable to adversaries. IoT devices are more exposed dicting/online phase, these models are used to detect attacks
to and unfortunately more difficult to be protected from in the incoming traffic. Thanks to the capability to auto-
matically and progressively learn useful information/features
The associate editor coordinating the review of this manuscript and from collected data, machine-learning based methods can
approving it for publication was Omid Kavehei . early detect various IoT attacks [3], [9], [15]–[17].

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
VOLUME 8, 2020 107335
L. Vu et al.: DTL for IoT Attack Detection

However, the machine learning-based methods only per- provides detailed analysis and discussion related to exper-
form well under an important assumption, i.e., the distri- imental results. Finally, Section VII concludes with future
butions of the training data and the predicting data are work.
similar [18]. Nevertheless, in many practical applications,
this assumption may not be always the case [19], [20]. II. RELATED WORK
Especially, in network security, new types of attacks (e.g., There are two main directions for cyberattack detection,
zero-day attacks) can be found on a daily basis [16]. As such, i.e., signature-based and machine learning-based approaches,
the practical IoT data for machine learning models (in the e.g., [8]–[10], [21]. The signature-based methods maintain a
predicting/online phase) is usually very much different from database of predefined signatures (i.e., patterns) that corre-
the data used during the training/offline phase. To alleviate spond to IoT known attacks and perform the detection task
the above problem, a large volume of training data with by comparing these to the incoming data stream [11]–[13],
label from multiple IoT devices is often required. However, [24]. Zhang and Green II [11] proposed a lightweight and
manually labeling a huge volume of data is very time con- low-complexity algorithm to prevent Distributed Denial of
suming and expensive [21], [22]. It, thus, limits the practical Service (DDoS) attacks in which each IoT working node has
deployment of machine learning-based methods in detecting a deep packet inspection to find attack signatures. If a sender
IoT attacks for various scenarios. repeatedly sends requests with the same content, it will be
Given the above, this work proposes a novel deep trans- flagged as malicious requests. Dietz et al. [12] proposed a
fer learning (DTL) approach based on AutoEncoder (AE) solution to proactively block the spreading of IoT attacks
to enable further applications of machine learning in IoT and isolate vulnerable IoT devices. Each IoT device is ver-
attack detection. The proposed model is referred to as Multi- ified in two steps, i.e., scanning to open ports and services
Maximum Mean Discrepancy AE (MMD-AE). MMD-AE and using predefined list of commonly known credentials to
can be trained on a dataset including both labeled samples check authentication. After that, a list of predefined rules is
(in the source domain) and unlabeled samples (in the target used to isolate the vulnerable IoT devices. Nobakht et al. [13]
domain). After training, MMD-AE is used to predict IoT proposed a solution for IoT attack detection using Software
attacks in the incoming traffic in the target domain. Specif- Defined Network with the OpenFlow protocol to address
ically, MMD-AE consists of two AEs: AE1 and AE2 . AE1 malicious behaviours and block intruders from accessing
in trained with labeled data while AE2 is trained on the the IoT devices. This method incorporates a database of
unlabeled data. The whole model, i.e., MMD-AE, is trained all known in-home IoT devices along with the correspond-
to drive the latent representation of AE2 closely to the latent ing patterns of potential security risks. Then, the detection
representation of AE1 . As a result, the latent representation method simply maps the IoT traffic with the signatures of
of AE2 can be used to classify the unlabeled IoT data in the security risks stored in the database. The advantage of the
target domain. The major contributions of this paper are as signature-based methods is providing a low false positive
follows: rate attack detection system [24]. However, they require
• We propose a novel DTL model based on AEs, a prior human knowledge about the behaviours of known
i.e., MMD-AE, that allows to transfer knowledge, IoT attacks to design the database of attack signatures. Thus,
i.e., labeled information, from the source domain to the the accuracy of these methods depends on the quality of
target domain. This model helps to lessen the problem the signature databases. Moreover, if the size of databases
of ‘‘lack label information’’ in collected traffic datasets is increased, the processing time (i.e., search time) can be
from IoT devices. excessive [24].
• We introduce the Maximum Mean Discrepancy (MMD) The machine learning-based methods first train the detec-
metric to minimize the distance between multiple hidden tion models from collected data samples in IoT networks.
layers of AE1 and multiple hidden layers of AE2 . This Then, the trained models are used to classify the new incom-
metric helps to improve the effectiveness of knowledge ing IoT data samples into normal or attack data. The pop-
transferred from the source to the target domain in IoT ular traditional machine learning algorithms for IoT attack
attack detection systems. detection are Decision tree (C4.5), Support Vector Machine
• We experiment our proposed method using nine IoT (SVM), K-Nearest Neighbour, Bayes Classifier, Neural Net-
attack datasets and compare its performance with the works [8], [24]. Recently, the deep learning approach is
canonical deep learning model and the state-of-the-art widely used and achieved high performance in detect-
TL models [18], [31]. The experimental results demon- ing cyberattacks [3], [9], [15]–[17]. Among, deep learning
strate the advantage of our proposed model against the approaches, AE-based models project the original data to a
other tested methods. new latent representation space to improve the accuracy in
The rest of paper is organized as follows. Section II high- detection tasks [3], [15], [16]. Nevertheless, to train a good
lights recent works on IoT attack detection. In Section III, machine learning model for detecting IoT attacks, it is usually
we define a DTL model and briefly describe the AE archi- required to label a huge volume of training data as normal or
tecture. The proposed model is then presented in Section IV. attack [24]. Moreover, general machine learning models often
Section V discusses the experiment settings and Section VI need to assume that the data distribution of training datasets