0% Complete
صفحه اصلی
/
هفتمین کنفرانس بین المللی میکروالکترونیک ایران
FPGA-Based CNN Accelerator with High Computing Resource Utilization
نویسندگان :
Raziyeh Foroumandi
1
Behbood Mashoufi
2
Amir Fathi
3
1- دانشگاه ارومیه
2- دانشگاه ارومیه
3- دانشگاه ارومیه
کلمات کلیدی :
Convolutional neural networks (CNNs)،FPGA-based accelerator،parallel computing
چکیده :
The rapid advancement of Convolutional Neural Networks (CNNs) has created a growing demand for hardware accelerators capable of performing CNN inference efficiently. FPGA-based CNN accelerators are particularly attractive due to their high performance, low power consumption, and inherent reconfigurability. This work presents an FPGA-based CNN accelerator employing a multi-computing engine architecture for convolution operations to enhance computational efficiency and achieve high throughput. The design exploits multiple levels of parallelism with optimized parallelism parameters, a data reordering unit to ensure continuous data delivery to the Processing Element (PE) array without idle cycles, and an optimized buffer structure to maximize computing resource utilization. The proposed accelerator was evaluated on the Xilinx XC7VX690T FPGA using VGG16 benchmark. Results show computing efficiency of 98.92%, outperforming existing FPGA-based CNN accelerators.
لیست مقالات
لیست مقالات بایگانی شده
Electro-Thermal Analysis of VCSELs with Multi- Mesa Structures Using 3D Self-Consistent Simulations
Hassan Hooshdar Rostami - Vahid Ahmadi - Saeed Pahlavan
Role of Doping Concentration of n- and p-Strip Regions on Optoelectronical Characterization in IBC-SHJ Solar Cell
Pegah Paknazar - Maryam Shakiba
An Integrated Wearable Bio-Impedance Spectroscopy System for Remote Monitoring Heart Failure in 65nm CMOS Technology
Arman Ghouchani - Mohammad Sharifkhani
بررسی عددی انتقال حرارت و جریان سیال در مبدل حرارتی مبتنی بر ریز کانالهای سامانه میکرو الکترومکانیکی
رسول عدلی بیله سوار - فرهاد صادق مغانلو - محمد وجدی حکم آباد
Clusters of Cubic Plasmonic Nanoparticles for Improved Efficiency in Bifacial Perovskite Solar Cells
Amir Hossein Mohammadian Fard - ُSamiye Matloub
Six-Band Frequency Full Absorber Based on the Heterogeneous Structure of Graphene Metamaterial
Yousef Rafighirani - Javad Javidan - Hamid Heidarzadeh
Unified Modeling Framework for Dynamic Analysis of a MEMS Resonant Biosensor
Ali Selk Ghafari
Impact of Geometrical and Process Design Parameters on the Performance of Schottky Barrier Reconfigurable Field Effect Transistor
Hamid Reza Heydari - Zahra Ahangari - Hamed Nematian - Kian Ebrahim Kafoori
طراحی آنتن پچ تراهرتز قابل تنظیم با استفاده از سوییچهای گرافنی برای کاربردهای گسترده فرکانسی
امیر امینی - موسی عبداله وند یاجلو - مهدی نوشیار
A runtime reconfigurable exact-approximate full-adder design
Keihan Naseri - Hadi Jahanirad
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.4.0