0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
طراحی و تحلیل تضعیفکننده متغیر و مدولاتور قابل تنظیم پلاسمونی
رضا رحیم پور - امیر حبیب زاده شریف
Hybrid ECG Signal Denoising Using Wavelet Transform and Adaptive Notch Filtering
Hossein Kodoori - Mehrnaz Monajati
Design and Simulation of a 2.4 GHz Class E Power Amplifier With High PAE and Linearity Improvement in 0.13μm CMOS Technology
Hamidreza Taghavi gharaghaji - Morteza Mojarad
Optimum Design of GaAs/AlGaAs Surface-Relief VCSELs with Single-Mode Operation at 808 nm
Hassan Hooshdar Rostami - Vahid Ahmadi - Saeed Pahlavan
A Linearity Enhanced Open-Loop Residue Amplifier for Pipeline ADCs
Tohid Kheyrandish - Kaveh Azizi - Sarang Kazeminia
طراحی یک فلیپ فلاپ کم مصرف، پرسرعت و مقاوم در برابر خطاهای نرم برای فناوریهای نانومتری
سیده عارفه رضوی - وحید جمشیدی
Broadband All-Dielectric Metasurface Absorber For VLC Applications
Ershad Sharifi - Mohammad Razaghi - Keyhan Hosseini
بررسی عددی انتقال حرارت و جریان سیال در مبدل حرارتی مبتنی بر ریز کانالهای سامانه میکرو الکترومکانیکی
رسول عدلی بیله سوار - فرهاد صادق مغانلو - محمد وجدی حکم آباد
Design and Fabrication of Carbon Nanoparticles-Based Sensor by Arc Discharge Method
Golsa Taghizadeh Afshari - Mohammad Taghi Ahmadi - Amir Fathi
Optimizing High Dynamic Range Current Measurement Circuit for IoT Applications
Yas Hosseini Tehrani - ُSeyed Mojtaba Atarodi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.4.0