0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
Design and Numerical Assessment of a Novel Dielectrophoretic Microfluidic Chip to Separate CTCs
Fatemeh Ghaffari - Hadi Veladi
Numerical Investigation of Electroosmotic Micromixing: Performance Comparison Between Models With Different Frequencies
Ali Adnan Banna - Sajad Rezazadeh - Haleh Sadeghi
A High-Speed, Tunable Dead-Zone Phase-Frequency Detector
Zaher Kakehbra - Khayrollah Hadidi
A Low-Power Fully Differential LC Oscillator with Phase Noise Reduction for LTE Applications
Yeganeh Moradzadeh Rezaei - SIROUS TOOFAN - Jafar Sobhi
Possible Teleportation of Quantum States using Squeezed Sources and Photonic Integrated Circuits
Mobin Motaharifar - Hassan Kaatuzian - Mahmood Hasani
Adaptive Oversampling-based CDR with Phase Correction for Low-Cost FPGAs
Amin Khalilzadegan - Asal Malekara - Amir Fathi - Mir Majid Ghasemi
Design and simulation of a sensor for detecting tuberculosis bacteria by Regarding the self weight
Mohadeseh Ebrahimian Pirbazari - Mir Majid Ghasemi - Saeed Afrang
Passive Component Area Optimization of Fully Integrated Hybrid Switched Capacitor Converter
Hossein Bemanalizadeh - Nasrin Rezaei-Hosseinabadi - S.Ali Khajehoddin
A Novel Approach for Offline and Online Application-Dependent testing of FPGA interconnects
Ahmad Menbari - Hemn Rahimi - Hadi Jahanirad
Modeling GaN-HEMT Electrostatic Band Diagram under full depletion approximation
Behnam Jafari Touchaei - Majid Shalchian
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.9.1