0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
طراحی و شبیه سازی سوییچ و مدولاتور الکترواپتیک مبتنی بر گرافن در طیف مادون قرمز میا نی با بهره گیری از ساختار JFET
توفیق نورمحمدی
Beyond Silicon: A Roadmap for Graphene-Based Field-Effect Transistors in RF Amplifiers, Biosensing, and Integrated Flexible Electronics
Sina Pourhajiagha Golestani - Esmaeil Najafiaghdam - Kiyumars Jalili
Design and Implementation of novel Microgrid Inverter
Amirhasan Sobhi - Alireza Zabihi - Sina Chartabi - Mina Salim
Few-layered phosphorene synthesis by CVD approach as an anode for sodium-ion battery
Pooya Dehghan - Soraya Hoornam - Zeinab Sanaee - Shams Mohajerzadeh
Design and Simulation of a 2.4 GHz Class E Power Amplifier With High PAE and Linearity Improvement in 0.13μm CMOS Technology
Hamidreza Taghavi gharaghaji - Morteza Mojarad
Optimum Design of GaAs/AlGaAs Surface-Relief VCSELs with Single-Mode Operation at 808 nm
Hassan Hooshdar Rostami - Vahid Ahmadi - Saeed Pahlavan
OptiCore: A Novel Approach for Designing a Low Power Multi-Core Processor
Abolfazl Rajaiyan - Yas Hosseini Tehrani - Seyed Mojtaba Atarodi
Dual-Mode Gain Control in a Two-Stage Programmable Gain Amplifier
Javad Mostofisharq
High-Speed AQAR Circuit Using Schmitt Trigger in 65nm CMOS for High Photon Rate Detection
Kiana Pasha - Sayed Masoud Sayedi
Current Source Impedance Effect on Static Performance of Hybrid R-2R DAC
Mohsen Azimikia - Arash Esmaili - Hadiseh Babazadeh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.6.0