0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
Optimizing High Dynamic Range Current Measurement Circuit for IoT Applications
Yas Hosseini Tehrani - ُSeyed Mojtaba Atarodi
Ultra Low Power SRAM-PUF for IoT Devices Based on CNTFETs
Alireza Shafiei - Mehrnaz Monajati
Evaluation of Run-Time Energy Efficiency using Controlled Approximation in a RISC-V Core
Arvin Delavari - Faraz Ghoreishy - Hadi Shahriar Shahhoseini - Sattar Mirzakuchaki
Synthesis of TiNb2O7 by mechanical alloying and subsequent heat treatment as an anode material for Li-ion batteries
Shiva Rashidi Kia - Mehdi Khodae
A Curvature Compensated CMOS Bandgap Voltage Reference With 6.8 ppm/°C Temperature Coefficient and Low Quiescent Current
Elaheh Pakravan - Mortaza Mojarad - Behboud Mashoufi
Design of long signal path Ternary computational blocks using Dynamic and Pass Transistor Logic based on Carbon Nanotube Field Effect Transistors
Farzin Mahboob Sardroudi - Mehdi Habibi - Mohammad Hossein Moaiyeri
Influence of piezoelectric actuator on the stability of micromechanical device via Casimir force
Fatemeh Mahdi Maleki - Fatemeh Tajik
Current-Mode Wideband Frontends With Linearity Enhancement for 5G Receivers
Adibeh Rahmani - Mortaza Mojarad - Seyed Sadra Kashef
A MEMS Resonant Pressure Sensore Based on 2D Graphene Material
Amir Noroolahi - Farshad Babazadeh
Design and Simulation of a 2.4 GHz Class E Power Amplifier With High PAE and Linearity Improvement in 0.13μm CMOS Technology
Hamidreza Taghavi gharaghaji - Morteza Mojarad
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.5.5