0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
Plasmonic CH4 Sensor Using an MIM Waveguide with a Hexagonal Cavity and Silver Square Island
Mohammad Ghanavati - Mohammad Azim Karami
Study of Nanoscale Material NEMS Resonant Pressure Sensors: Simulation and Comparison
Amir Noroolahi - Abolfazl Hosseini
Frequency Response and Design Based on gm/ID of Amplifier in CNFET Technology
S. Mohammadali Zanjani - Mehdi Dolatshahi - Massoud Dousti - Zahra Alaie - Ata Jahangir Moshayedi - Arash Mehrabi
Analysis of electrostatic interaction between a charge trap and a quantum dot based single electron transistor
Fatemeh Hamedvasighi - Majid Shalchian
A Nanowatt Low Voltage Subthreshold CMOS Voltage Reference Based On 2-T
Nima Dehghan - Mohammad Yavari
A Low-Power Fully Differential LC Oscillator with Phase Noise Reduction for LTE Applications
Yeganeh Moradzadeh Rezaei - SIROUS TOOFAN - Jafar Sobhi
Broadened graded asymmetric waveguide structure for low divergence 915nm diode laser
Seyed Peyman Abbasi - Arash Hodaei
Design of Floating-Point Multiplier Architecture with Adaptive Data Timing Channels
Hoda Ghabeli
A Low-Power Inductor-Less Linear Wideband CMOS Balun-LNA Using Current Reuse And Linearity Techniques
Soroush Hashemi Bani - Mohammad Yavari
طراحی و شبیهسازی جمع کننده 64 بیتی سریع با استفاده از ترانزیستورهای نانو لوله کربنی (CNTFET)
علیرضا جعفری تازه کند - جواد جاویدان
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.5.5