0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
Design and Implementation of novel Microgrid Inverter
Amirhasan Sobhi - Alireza Zabihi - Sina Chartabi - Mina Salim
Base Transit Time Investigation of InP/InGaAs HBT Optoelectronic Mixer Using Different Base Doping Profiles
Hassan Kaatuzian - Mehrdad Ghasemi - Mahdi NoroozOliaei
Using GDI Structure in Hardware Implementation of Convolution Operation in Deep Neural Networks
Maedeh Kadkhodaie - Sayed Masoud Sayedi
A High-Precision Low-Dropout Regulator With High Current Efficiency and Slew-Rate Enhancement
Yeganeh Moradzadeh Rezaei - Mortaza Mojarad
Evaluation of Run-Time Energy Efficiency using Controlled Approximation in a RISC-V Core
Arvin Delavari - Faraz Ghoreishy - Hadi Shahriar Shahhoseini - Sattar Mirzakuchaki
Plasmonic CH4 Sensor Using an MIM Waveguide with a Hexagonal Cavity and Silver Square Island
Mohammad Ghanavati - Mohammad Azim Karami
Design and Simulation of a 2.4 GHz Class E Power Amplifier With High PAE and Linearity Improvement in 0.13μm CMOS Technology
Hamidreza Taghavi gharaghaji - Morteza Mojarad
Role of Doping Concentration of n- and p-Strip Regions on Optoelectronical Characterization in IBC-SHJ Solar Cell
Pegah Paknazar - Maryam Shakiba
Neural networks & logistic regression for FPGA hardware trojan detection
Milad Pazira - Yasser Baleghi - Mohammad-Ali Mahmoodpour
Design of long signal path Ternary computational blocks using Dynamic and Pass Transistor Logic based on Carbon Nanotube Field Effect Transistors
Farzin Mahboob Sardroudi - Mehdi Habibi - Mohammad Hossein Moaiyeri
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.1.2