0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
A Verilog-A Based Dynamic Model for MEMS Accelerometer Sensors
Kamyab Karimi Sarableh - Farshad Gozalpour - Sepehr Zare Teimoori - Rasoul Fathipour
Magnetic Propoerties of Permalloy (Co-Ni-Fe) Electroplated Film on Graphene-Oxide (GO) Thin Film Based on Copper Substrate
Ali Rezaei
Properties of Co-Ni-Fe Electroplated Thin Film on Indium Tin Oxide Coated Transparent Polymer Substrate
Ali Rezaei
طراحی و شبیهسازی شمارنده بالا و پایین شمار چهارسطحی با استفاده از تکنولوژی 32nm-CNTFET
جواد جاویدان
طراحی حسگر تراهرتز مبتنی بر ضریب شکست برای تعیین مشخصات مواد
سهیل هادی پور - پژمان رضائی
A Bootstrapped Switch Based Efficient CMOS Full-Wave Active Rectifier for Biomedical Implants
Mahmood Alibakhshi - Farshad Gozalpour - Yarallah Koolivand
یک موتور محاسباتی آنالوگ برای شبکههای عصبی کانولوشنال با بهرهگیری از تکنیک Gm-Scaling
عرفان بستانچی - مرتضی موسی زاده
Adaptive Oversampling-based CDR with Phase Correction for Low-Cost FPGAs
Amin Khalilzadegan - Asal Malekara - Amir Fathi - Mir Majid Ghasemi
Analysis of electrostatic interaction between a charge trap and a quantum dot based single electron transistor
Fatemeh Hamedvasighi - Majid Shalchian
طراحی سیستماتیک موجبر فوتونی مبتنی بر سیلیکون نیترید در محدوده نور مرئی
افشین احمدپور - امیر حبیب زاده شریف - فائزه بهرامی چناقلو
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.4.0