Show simple item record

dc.contributor.authorArndt, Daniel Joseph
dc.date.accessioned2012-08-20T12:38:19Z
dc.date.available2012-08-20T12:38:19Z
dc.date.issued2012-08-20
dc.identifier.urihttp://hdl.handle.net/10222/15279
dc.description.abstractEncrypted protocols, such as Secure Socket Layer (SSL), are becoming more prevalent because of the growing use of e-commerce, anonymity services, gaming and Peer-to-Peer (P2P) applications such as Skype and Gtalk. The objective of this work is two-fold. First, an investigation is provided into the identification of web browsing behaviour in SSL tunnels. To this end, C5.0, naive Bayesian, AdaBoost and Genetic Programming learning models are evaluated under training and test conditions from a network traffic capture. In these experiments flow based features are employed without using Internet Protocol (IP) addresses, source/destination ports or payload information. Results indicate that it is possible to identify web browsing behaviour in SSL encrypted tunnels. Test performance of ~95% detection rate and ~2% false positive rate is achieved with a C5.0 model for identifying SSL. ~98% detection rate and ~3% false positive rate is achieved with an AdaBoost model for identifying web browsing within these tunnels. Second, the identifying characteristics of SSL traffic are investigated, whereby a new tool is introduced to generate new flow statistics that focus on presenting the features in a unique way, using bins to represent distributions of measurements. These new features are tested using the best performers from previous experiments, C5.0 and AdaBoost, and increase detection rates by up to 32.40%, and lower false positive rates by as much as 54.73% on data sets that contain traffic from a different network than the training set was captured on. Furthermore, the new feature set out-preforms the old feature set in every case.en_US
dc.language.isoenen_US
dc.subjectencryptionen_US
dc.subjectnetwork securityen_US
dc.subjectmachine learningen_US
dc.subjectSSLen_US
dc.subjectsecure socket layeren_US
dc.titleAn Investigation of Using Machine Learning with Distribution Based Flow Features for Classifying SSL Encrypted Network Trafficen_US
dc.date.defence2012-08-13
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.external-examinerN/Aen_US
dc.contributor.graduate-coordinatorDr. Qigang Gaoen_US
dc.contributor.thesis-readerDr. Malcolm Heywooden_US
dc.contributor.thesis-readerDr. Srinivas Sampallien_US
dc.contributor.thesis-supervisorDr. Nur Zincir-Heywooden_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record