Tactics Overview for Implementing High-Performance Computing on Embedded Platforms |
Paper ID : 1161-ASAT19 |
Authors: |
Ahmed Waleed Elshazly *1, Ahmed Elliethy2, Mohamed Abdelmoneim Taha Elshafey3 1Computer engineering dept.,military technical college ,cairo,egypt 2Department of Computers, MTC 3Department of Computer Engineering, Military Technical College, Cairo, Egypt. |
Abstract: |
Future space missions will rely on novel high-performance computing to support advanced intelligent on-board algorithms with substantial workloads that mandates firm real-time and power constraints requirements. Consequently, these advanced algorithms require significantly faster processing beyond the conventional space-grade central processing unit capabilities. Moreover, they require careful selection of the target embedded platform from a diverse set of available architectures along with several implementation tactics to map the algorithms to the target architecture to fully unlock its capabilities. Moreover, we present a comprehensive overview of recent implementation tactics such as source code mapping and transformations. Additionally, we highlight some optimization techniques such as partitioning and co-designing using hardware accelerators. Finally, we discuss several implementation analysis methodologies to derive optimized code implementations. The top ranked YOLO-v3, as a deep learning based object detection algorithm, is selected as a case study model to be optimized using OpenVINO toolkit. The experimental results show an improvement ratios up to 73%, 41%, and 34% in terms of frames per second, CPU utilization, and cache memory, respectively. The study presented in this paper aims to guide the researchers in the field of high performance embedded computing in terms of different hardware architectures along with several implementation tactics. |
Keywords: |
High-Performance Computing, OBC Architectures, Code Optimizations, Code Transformations, Hardware Accelerators, OpenVINO, YOLO-v3 |
Status : Conditional Accept (Oral Presentation) |