0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
First Principles Study of Optical and Electrical Properties for Mixed-halide 2D BA2PbBr4-xClx (x=0, 2, and 4) as an Active Layer of Perovskite Light Emitting Diode
ُSamad Shokouhi - Seyedeh Bita Saadatmand - Vahid Ahmadi
طراحی و شبیهسازی شمارنده بالا و پایین شمار چهارسطحی با استفاده از تکنولوژی 32nm-CNTFET
جواد جاویدان
Design and Numerical Assessment of a Novel Dielectrophoretic Microfluidic Chip to Separate CTCs
Fatemeh Ghaffari - Hadi Veladi
طراحی حسگر تراهرتز مبتنی بر ضریب شکست برای تعیین مشخصات مواد
سهیل هادی پور - پژمان رضائی
High-level synthesis-based approach for CNN acceleration on FPGA
Adib Hosseiny - Hadi Jahanirad
طراحی و شبیهسازی جمع کننده 64 بیتی سریع با استفاده از ترانزیستورهای نانو لوله کربنی (CNTFET)
علیرضا جعفری تازه کند - جواد جاویدان
A Low-Noise Amplifier with Bandwidth Extension and Noise Cancellation for 5G Receivers
Pardis Javanbakht - Mortaza Mojarad
OptiCore: A Novel Approach for Designing a Low Power Multi-Core Processor
Abolfazl Rajaiyan - Yas Hosseini Tehrani - Seyed Mojtaba Atarodi
A MEMS Resonant Pressure Sensore Based on 2D Graphene Material
Amir Noroolahi - Farshad Babazadeh
Role of Doping Concentration of n- and p-Strip Regions on Optoelectronical Characterization in IBC-SHJ Solar Cell
Pegah Paknazar - Maryam Shakiba
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.5.5