Neural network splitting for energy-efficient Edge-AI
Profile picture for user daniel.may
Daniel May

Neural network splitting for energy-efficient Edge-AI

Förderjahr 2024 / Stipendium Call #19 / Stipendien ID: 7170

My thesis focuses on developing DynaSplit, a framework designed to optimize energy efficiency and latency for deploying deep neural networks in edge-cloud environments. The growing demand for intelligent applications on edge devices, such as IoT sensors and smartphones, poses significant challenges due to limited computational resources, energy constraints, and strict latency requirements.

DynaSplit addresses these challenges by jointly optimizing the split point of neural networks - deciding which layers run on edge devices versus the cloud - and hardware parameters such as CPU frequency and accelerator utilization. This dual optimization enables efficient resource utilization while ensuring performance meets Quality of Service (QoS) requirements.

The framework employs a two-phase methodology: an Offline Phase that uses multi-objective optimization to identify Pareto-optimal configurations, and an Online Phase that dynamically selects the best configuration for incoming inference requests. Experimental results on real-world AI models show that DynaSplit can reduce energy consumption by up to 72% compared to cloud-only solutions, while meeting 90% of user-defined latency constraints.

By enabling efficient and sustainable deployment of AI models in resource-constrained environments, DynaSplit demonstrates the potential of combining advanced optimization techniques with the flexibility of split computing.

Uni | FH [Universität]

Technische Universität Wien

Themengebiet

Artificial Intelligence
,
Distributed Systems
,
Energiemanagement
,
IoT
,
Künstliche Intelligenz /AI / Machine Learning
,
Mobile Devices

Zielgruppe

Öffentliche Hand
,
Systemintegratoren
,
Techniker

Gesamtklassifikation

Diplomarbeit

Technologie

AI | KI
,
Client Server Applikation
,
Cloud Service
,
Python
,
raspberry pi
,
Sensorik

verwendete Open Source SW

TensorFlow
,
Optuna
,
gRPC
,
scikit-learn
,
pandas
,
psutil
,
Paramiko

Lizenz

CC-BY-SA
Datenschutzinformation
Der datenschutzrechtliche Verantwortliche (Internet Privatstiftung Austria - Internet Foundation Austria, Österreich) würde gerne mit folgenden Diensten Ihre personenbezogenen Daten verarbeiten. Zur Personalisierung können Technologien wie Cookies, LocalStorage usw. verwendet werden. Dies ist für die Nutzung der Website nicht notwendig, ermöglicht aber eine noch engere Interaktion mit Ihnen. Falls gewünscht, können Sie Ihre Einwilligung jederzeit via unserer Datenschutzerklärung anpassen oder widerrufen.