Exploring data leakage via supervised learning

Khaleghimoghaddam, Amir

Exploring data leakage via supervised learning

Files

Khaleghimoghaddam_Amir_McSc_CSCI_April_2020.pdf (4.6 MB)

Date

2020-04-20T15:03:53Z

Authors

Khaleghimoghaddam, Amir

Abstract

Data security includes but not limited to, data encryption, tokenization, and key management practices that protect data across all applications and platforms. In this thesis, I aim to explore whether any data leakage takes place in data encryption when encrypted data is analyzed using supervised machine learning techniques. In the literature, researchers studied reverse engineering the encrypted data or brute forcing the attacks against encryption algorithms in order to study data leakage. However, in this research, my goal is not to reverse engineer or brute force the ciphertext, but to explore whether a supervised learning algorithm could identify a pattern that could potentially leak data in ciphertext. To this end, I analyze four encryption algorithms using five supervised learning techniques on four different datasets. The results show that as the encryption algorithms get stronger, the data leakage decreases, even though the data leakage is never zero percent.

Keywords

machine learning, data mining, text classification, cyber security

URI

http://hdl.handle.net/10222/78640

Collections

Faculty of Graduate Studies Online Theses

Full item page

Exploring data leakage via supervised learning

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections