ADDRESSING SUBJECTIVITY IN CODE SMELL DETECTION BY LEVERAGING HUMAN FEEDBACK IN A DEEP-LEARNING-BASED SOLUTION

Nandani, Himesh

dc.contributor.author	Nandani, Himesh
dc.date.accessioned	2023-12-18T14:09:37Z
dc.date.available	2023-12-18T14:09:37Z
dc.date.issued	2023-12-15
dc.identifier.uri	http://hdl.handle.net/10222/83298
dc.description.abstract	Code smells, being inherently subjective, can vary based on individual software developers' opinions and perspectives. Though many deep-learning-based models have been proposed for code smell detection, they often overlook the importance of considering each developer's subjective context, undermining the effectiveness of these methods. In this thesis, we address this limitation by (a) constructing a manually annotated dataset for three subjective code smells incorporating multiple annotations for each sample, (b) developing three state-of-the-art deep learning models with different architectures and comparing their performance on the dataset, and (c) conducting an extensive experiment using the best-performing deep-learning model to evaluate the impact of human feedback. We start by building a dataset containing 10,267 annotations for 5,192 code snippets, targeting three code smells at different granularity levels: multifaceted abstraction, complex method, and long parameter list. Additionally, we create a supplementary dataset comprising ‘definitely clean’ and ‘definitely smelly’ samples, identified using the thresholds during dataset construction. To facilitate developers' involvement, we designed a plugin for IntelliJ IDEA and developed a container-based web server to offer services of our baseline deep-learning model within the IDE. This setup lets developers see code smells within the IDE and provide feedback (i.e., whether a snippet has the identified smell and whether a snippet has a smell not identified by our approach). In a controlled experiment, we collected feedback on code smells from 14 participants in two rounds: one with our baseline model and the second with a fine-tuned model. In the second round, we fine-tuned the model based on the participants' feedback and reevaluated the smell detection performance before and after adjustment. The results demonstrate that such calibration improves the smell detection model's performance by 15.49% in the F1 score on average across the experimental group participants. Notably, this improvement is achieved while maintaining a low Krippendorff score, indicating that the smell detection model's ability is enhanced while considering the subjectivity of the feedback.	en_US
dc.language.iso	en	en_US
dc.subject	Code smells	en_US
dc.subject	software engineering	en_US
dc.title	ADDRESSING SUBJECTIVITY IN CODE SMELL DETECTION BY LEVERAGING HUMAN FEEDBACK IN A DEEP-LEARNING-BASED SOLUTION	en_US
dc.type	Thesis	en_US
dc.date.defence	2023-12-11
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.degree	Master of Science	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.thesis-reader	Masud Rahman	en_US
dc.contributor.thesis-reader	Srini Sampalli	en_US
dc.contributor.thesis-supervisor	Tushar Sharma	en_US
dc.contributor.ethics-approval	Received	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.copyright-release	No	en_US

Find Full text

Files in this item

Name:: HimeshNandani2023.pdf
Size:: 2.188Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Faculty of Graduate Studies Online Theses

Show simple item record