Edge/cloud slice resource allocation for beyond 5G networks with distributed LSTM

Ehsanian, Ali; Spyropoulos, Thrasyvoulos
PIMRC 2024, 35th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, 2-5 September 2024, Valencia, Spain

Efficient resource allocation among slices/users with different Service Level Agreements (SLAs) is a critical task in 5G+ networks, which has prompted recent research into Deep Neural Networks (DNNs). However, challenges arise when dealing with edge resources, including the ability to rapidly scale resources (in the order of milliseconds), and the cost of transmitting large data volumes to a cloud for centralized DNN-based processing. Addressing these issues, we introduce a novel architecture based on Distributed Deep Neural Networks (DDNN). This architecture features a compact set of DNN layers located at the network’s edge, designed to function as an autonomous resource allocation unit. Complementing this, there is an intelligent offloading mechanism that delegates a fraction of hard decisions to additional DNN layers situated in a remote cloud (when needed). To implement offloading, we propose a theoretically informed method that learns to mimic an oracle that knows which sample will benefit from additional processing in the cloud. We compare this to a previously proposed heuristic, based on a Bayesian-confidence mechanism. We investigate the interplay of (offline) joint training of the DDNN exits and the ML-based offloading mechanism, and demonstrate that our architecture resolves more than 50% of decisions at the edge with no additional penalty compared to centralized models as well as consistently outperforms previous methods.


Type:
Conférence
City:
Valencia
Date:
2024-09-02
Department:
Systèmes de Communication
Eurecom Ref:
7762
Copyright:
© 2024 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/7762