0% Complete
صفحه اصلی
/
هفتمین کنفرانس بین المللی میکروالکترونیک ایران
FPGA-Based CNN Accelerator with High Computing Resource Utilization
نویسندگان :
Raziyeh Foroumandi
1
Behbood Mashoufi
2
Amir Fathi
3
1- دانشگاه ارومیه
2- دانشگاه ارومیه
3- دانشگاه ارومیه
کلمات کلیدی :
Convolutional neural networks (CNNs)،FPGA-based accelerator،parallel computing
چکیده :
The rapid advancement of Convolutional Neural Networks (CNNs) has created a growing demand for hardware accelerators capable of performing CNN inference efficiently. FPGA-based CNN accelerators are particularly attractive due to their high performance, low power consumption, and inherent reconfigurability. This work presents an FPGA-based CNN accelerator employing a multi-computing engine architecture for convolution operations to enhance computational efficiency and achieve high throughput. The design exploits multiple levels of parallelism with optimized parallelism parameters, a data reordering unit to ensure continuous data delivery to the Processing Element (PE) array without idle cycles, and an optimized buffer structure to maximize computing resource utilization. The proposed accelerator was evaluated on the Xilinx XC7VX690T FPGA using VGG16 benchmark. Results show computing efficiency of 98.92%, outperforming existing FPGA-based CNN accelerators.
لیست مقالات
لیست مقالات بایگانی شده
Graphene-Based Ring Resonator Ammonia Sensor: Design Optimization for Maximum Sensitivity and Q-Factor
Raheleh Masoumi - Manouchehr Bahrami
Efficiency enhancement of tin-based perovskite solar cell with carbon back-contact using cubic and pyramid metallic nano-particles: numerical investigation
Amir Hossein Mohammadian Fard - Samiye Matloub
A 2x1 Bit Multiplier Based on Vibrating Microelectromechanical Resonators
ALI DELVAR - Farshad Babazadeh
Design of long signal path Ternary computational blocks using Dynamic and Pass Transistor Logic based on Carbon Nanotube Field Effect Transistors
Farzin Mahboob Sardroudi - Mehdi Habibi - Mohammad Hossein Moaiyeri
یک موتور محاسباتی آنالوگ برای شبکههای عصبی کانولوشنال با بهرهگیری از تکنیک Gm-Scaling
عرفان بستانچی - مرتضی موسی زاده
طراحی سیستماتیک موجبر فوتونی مبتنی بر سیلیکون نیترید در محدوده نور مرئی
افشین احمدپور - امیر حبیب زاده شریف - فائزه بهرامی چناقلو
An Accurate 10-bit 1 kS/s Charge-Redistribution SAR ADC for Sensor Readout Applications
Farshad Gozalpour - Sepehr Zare Teimoori - Kamyab Karimi Sarableh - Rasoul Fathipour - Mohsen Tamaddon
A 0.9-8 GHz Highly Linear SAW-Less Direct-Conversion Receiver Front-End for 5G Communication Standard
Erfan Salighe - Mortaza Mojarad
Broadband All-Dielectric Metasurface Absorber For VLC Applications
Ershad Sharifi - Mohammad Razaghi - Keyhan Hosseini
An Asynchronous Strategy for Efficient Audio Processing for Better Perception in Cochlear Implants Based on Peak and Trough Detection
Amin Armin - Mohammad Yavari
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.4