0% Complete
صفحه اصلی
/
ششمین کنفرانس بین المللی میکروالکترونیک ایران
Comparison between Hardware/Software Co-design of RiscV Vector and Scalar Implementation of Deep Neural Networks
نویسندگان :
Seyed Kian Mousavikia
1
Morteza Mousazadeh
2
1- دانشگاه ارومیه
2- دانشگاه ارومیه
کلمات کلیدی :
Deep Neural Networks،Field Programmable Gate Array،Hardware/Software Co-Design،Parallel Processing،RiscV،Vector Co-processor
چکیده :
This paper compares a hardware/software co-design of a RiscV vector with a RiscV scalar implementation of a deep neural network (DNN). For the vector implementation, all building blocks of a DNN are vectorized and written in vector intrinsic coding format. Focusing more on the convolution function as the main source of the latency, this function is written in a special parallel processing-favor method in the vector intrinsic level to boost execution speed. For the comparison, a sample scalar RiscV core is selected and paired with a vector-based RiscV co-processor. Also, the same sample DNN is implemented only on the scalar processor to demonstrate the speedup better. The system was implemented and tested on a field-programmable gate array (FPGA). As a result, the vector implementation outperformed the scalar version by a factor of 3 in terms of latency by only negligibly increasing the utilized sources on the FPGA.
لیست مقالات
لیست مقالات بایگانی شده
Frequency Response and Design Based on gm/ID of Amplifier in CNFET Technology
S. Mohammadali Zanjani - Mehdi Dolatshahi - Massoud Dousti - Zahra Alaie - Ata Jahangir Moshayedi - Arash Mehrabi
یک مبدل آنالوگ به دیجیتال مبتنی بر مدولاتور سیگما دلتا برای کاربردهای مهندسی - پزشکی با ENOB = 13. 2 bits ، پهنای باند 10 kHz و توان مصرفی 16.9 µ W
علی صداقت - حسین پاک نیت - نوید یثربی
Hybrid ECG Signal Denoising Using Wavelet Transform and Adaptive Notch Filtering
Hossein Kodoori - Mehrnaz Monajati
A Low-Power Inductor-Less Linear Wideband CMOS Balun-LNA Using Current Reuse And Linearity Techniques
Soroush Hashemi Bani - Mohammad Yavari
Quantitative study of Temperature-Dependent BandGap Energy and Its Influence on Threshold Potential Characteristics of MOS Devices
Hanieh Khakvatan - Sarang Kazeminia
Low-Overhead Behavioral Locking for Security of Analog and AMS Integrated Circuits
Paria Farajzadeh - Samad Sheikhaei
طراحی شمارنده بالا پایین شمار سنکرون 8 بیتی بسیار سریع مبتنی بر شمارش در دولبه پالس ساعت با استفاده از ترانزیستورهای نانو لوله کربنی32 نانومتر
جواد جاویدان
طراحی ضرب کننده تقریبی کم مصرف و سریع برای کاربردهای پردازش تصویر
شیوا محمدعلیپوری - وحید جمشیدی
Design and Implementation of novel Microgrid Inverter
Amirhasan Sobhi - Alireza Zabihi - Sina Chartabi - Mina Salim
Graphene-Based Ring Resonator Ammonia Sensor: Design Optimization for Maximum Sensitivity and Q-Factor
Raheleh Masoumi - Manouchehr Bahrami
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.6.0