Addressing Reproducibility and Energy-efficiency in AI Deployments

Sobhani, Ghazal

View/Open

GhazalSobhani2024.pdf (3.786Mb)

Date

2024-08-30

Author

Sobhani, Ghazal

Metadata

Show full item record

Abstract

Deploying AI models on edge devices presents challenges in ensuring reliable and energy-efficient operations. Edge AI processes data directly on devices like IoT sensors and industrial machinery, enabling real-time decision-making and reducing latency, which is crucial for applications such as autonomous driving and robotics. However, deploying these models often involves custom Infrastructure as Code (IaC) scripts, and a lack of reproducibility in these scripts can cause inconsistencies and affect system reliability. Additionally, while advancements in hardware like SoCs, FPGAs, and AI accelerators have improved Edge AI capabilities, these deployments can lead to high energy consumption. Our research addresses these challenges through two main contributions. First, we identify and categorize reproducibility smells in IaC scripts, particularly focusing on an automation platform, Ansible, that allows imperative infrastructure configuration. We developed a tool, Reduse, to detect these reproducibility smells, in the pursuit to ensure that IaC scripts are reliable and consistent. Our empirical study reveals the occurrence of these smells in open-source projects, with significant correlations and co-occurrence patterns among them. For instance, the broken dependency chain smell was found in approximately 71% of Ansible tasks analyzed, highlighting common reproducibility issues. Second, we comprehensively evaluate the selection of AI models on edge devices, including the Raspberry Pi, NVIDIA Jetson Nano, and Intel Neural Compute Stick. By measuring inference power consumption, accuracy, inference time, and memory utilization, we offer insights into the performance and energy efficiency trade-offs of these models. For instance, Jetson Nano provides the best accuracy at the cost of a high energy budget. Thus, our work advances the field of edge AI with the best practices in IaC, contributing to more reliable and effective AI deployments in real- world scenarios.

URI

http://hdl.handle.net/10222/84544

Subject

Collections

Faculty of Graduate Studies Online Theses

Find Full text