Learning Stochastic Weight Masking to Resist Adversarial Attacks

Kubo, Yoshimasa

dc.contributor.author	Kubo, Yoshimasa
dc.date.accessioned	2019-12-02T14:18:31Z
dc.date.available	2019-12-02T14:18:31Z
dc.date.issued	2019-12-02T14:18:31Z
dc.identifier.uri	http://hdl.handle.net/10222/76737
dc.description.abstract	Adding small perturbations to test images can drastically change the classification accuracy of machine learning models. These perturbed examples are called adversarial examples (Szegedy et al., 2013). Studying these examples may shed light on the learned structure in the network, as well as on the potential security threat that they pose for practical machine learning applications (Kurakin et al., 2016}. Furthermore, since human observers can be fooled by adversarial examples (Elsayed et al., 2018), this study may aid in preventing the manipulation of human observers' reactions. In this thesis, at first, we focus on gaining an understanding of the cause of adversarial examples. We argue, adding to the view of Galloway et al., that overfitting is a factor of adversarial examples, while the other researchers found the cause of adversarial examples is not related to overfitting. To make this argument, we include two directions in our study, the first is to evaluate several standard regularization techniques with adversarial attacks, and the second is to evaluate stochastic binarized neural networks on adversarial examples. We report that strong regularizations including stochastic binarized neural networks do not only improve overfitting but also help the networks in fighting against adversarial attacks. Furthermore, we introduce a model called the Stochastic-Gated Partially Binarized Network (SGBN), which incorporates binarization and input dependent stochasticity. In particular, a gate module learns the probability that individual weights in corresponding convolutional filters should be masked (turned on or off). The gate module itself consists of a shallow convolutional neural network, and its sigmoid outputs are stochastically binarized and pointwise multiplied with corresponding filters in the convolutional layer of the main network. We test and compare our model with several related approaches on both white- and black-box attacks, and to try to gain an understanding of our model, we visualize activations of some of the gating network outputs and their corresponding filters. Moreover, we apply a simple version of SGBN to a toy experiment to gain an understanding of how changeable the activations of the gate modules may be.	en_US
dc.language.iso	en	en_US
dc.subject	Deep learning	en_US
dc.subject	Stochastic binarized network	en_US
dc.subject	Adversarial examples	en_US
dc.title	Learning Stochastic Weight Masking to Resist Adversarial Attacks	en_US
dc.date.defence	2019-11-18
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.degree	Doctor of Philosophy	en_US
dc.contributor.external-examiner	Dr. Pawan Lingras	en_US
dc.contributor.graduate-coordinator	Dr. Michael McAllister	en_US
dc.contributor.thesis-reader	Dr. Fernando Paulovich	en_US
dc.contributor.thesis-reader	Dr. Malcolm Heywood	en_US
dc.contributor.thesis-supervisor	Dr. Thomas Trappenberg	en_US
dc.contributor.thesis-supervisor	Dr. Sageev Oore	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.manuscripts	Yes	en_US
dc.contributor.copyright-release	Yes	en_US

Find Full text

Files in this item

Name:: Kubo-Yoshimasa-PhD-CSCI-Novemb ...
Size:: 9.551Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Faculty of Graduate Studies Online Theses

Show simple item record