A layer‐wise frequency scaling for a neural processing unit
نویسندگان
چکیده
Dynamic voltage frequency scaling (DVFS) has been widely adopted for run-time power management of various processing units. In the case neural units (NPUs), network applications is required to adjust and every layer consider behavior performance each layer. Unfortunately, DVFS inappropriate layer-wise NPUs due long latency compared with execution time. Because fast enough keep up layer, we propose a dynamic (DFS) technique an NPU. Our proposed DFS exploits highest under limit NPU To determine allowable frequency, build model predict consumption based on real measurement fabricated evaluation results show that our improves frame per second (FPS) by 33% saves energy 14% average, DVFS.
منابع مشابه
Statestream: a Toolbox to Explore Layerwise- Parallel Deep Neural Networks
Building deep neural networks to control autonomous agents which have to interact in real-time with the physical world, such as robots or automotive vehicles, requires a seamless integration of time into a network’s architecture. The central question of this work is, how the temporal nature of reality should be reflected in the execution of a deep neural network and its components. Most artific...
متن کاملA Novel Multiply-Accumulator Unit Bus Encoding Architecture for Image Processing Applications
In the CMOS circuit power dissipation is a major concern for VLSI functional units. With shrinking feature size, increased frequency and power dissipation on the data bus have become the most important factor compared to other parts of the functional units. One of the most important functional units in any processor is the Multiply-Accumulator unit (MAC). The current work focuses on the develop...
متن کاملpassivity in waiting for godot and endgame: a psychoanalytic reading
this study intends to investigate samuel beckett’s waiting for godot and endgame under the lacanian psychoanalysis. it begins by explaining the most important concepts of lacanian psychoanalysis. the beckettian characters are studied regarding their state of unconscious, and not the state of consciousness as is common in most beckett studies. according to lacan, language plays the sole role in ...
Precision Scaling of Neural Networks for Efficient Audio Processing
While deep neural networks have shown powerful performance in many audio applications, their large computation and memory demand has been a challenge for real-time processing. In this paper, we study the impact of scaling the precision of neural networks on the performance of two common audio processing tasks, namely, voice-activity detection and single-channel speech enhancement. We determine ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Etri Journal
سال: 2022
ISSN: ['1225-6463', '2233-7326']
DOI: https://doi.org/10.4218/etrij.2022-0094