Energy-Efficient Convolutional Neural Network (CNN) Implementation in Verilog

This repository contains a Verilog implementation of an energy-efficient Convolutional Neural Network (CNN) designed for FPGA deployment. The design focuses on minimizing power consumption while maintaining functionality for image classification tasks.

Introduction

Convolutional Neural Networks (CNNs) are a class of deep learning models commonly used for image recognition and classification tasks. While powerful, CNNs can be computationally intensive and energy-consuming, especially when deployed on edge devices or FPGAs (Field-Programmable Gate Arrays).

This project demonstrates an energy-efficient implementation of a CNN in Verilog, suitable for FPGA deployment. The design incorporates various techniques to reduce power consumption without significantly compromising performance.

Project Structure

The repository contains the following key files:

energy_efficient_cnn.v: Main Verilog module implementing the CNN
energy_efficient_cnn_tb.v: Testbench for simulating and verifying the CNN implementation
README.md: This file, providing detailed explanation of the project

CNN Architecture

The implemented CNN has a simplified architecture consisting of the following layers:

Input Layer: Accepts an 8x8 binary image (64 bits)
Quantization Layer: Converts binary input to 8-bit quantized values
Convolutional Layer: Applies 4 filters of size 3x3
Pooling Layer: Performs max pooling with a 2x2 window
Fully Connected Layer: Produces the final classification output

This architecture is a simplified version of a typical CNN and is designed for educational purposes and to demonstrate energy-efficient techniques.

Energy Efficiency Techniques

The implementation incorporates several techniques to reduce energy consumption:

Clock Gating: Selectively disables clock signals to unused modules, reducing dynamic power consumption.
Power Gating: Completely shuts off power to inactive portions of the circuit, minimizing static power consumption.
Quantization: Uses 8-bit fixed-point representation instead of floating-point, reducing computational complexity and memory requirements.
Simplified Depthwise Separable Convolution: A more efficient convolution technique that requires fewer parameters and computations compared to standard convolutions.

Verilog Implementation Details

The main module energy_efficient_cnn is implemented with the following key components:

State Machine: Controls the flow of data through different stages of the CNN (IDLE, QUANTIZE, CONV1, POOL, FC, DONE).
Clock and Power Gating Signals: Separate enable signals for clock and power gating of each major component.
Parameterized Design: Uses Verilog parameters for easy configuration of network size and structure.
Quantization Logic: Converts binary input to 8-bit representation.
Convolution and Pooling Operations: Implemented as simplified versions for demonstration purposes.
Fully Connected Layer: A basic implementation that produces the final classification output.

Simulation and Testing

The testbench energy_efficient_cnn_tb.v provides a simulation environment for the CNN:

Generates a 100MHz clock signal
Provides two test cases: a simple image and an alternating pattern
Initiates the CNN processing and waits for completion
Displays the classification results

To run the simulation:

Ensure you have Icarus Verilog installed
Compile the design: iverilog -o cnn_sim energy_efficient_cnn.v energy_efficient_cnn_tb.v
Run the simulation: vvp cnn_sim
(Optional) Generate and view waveforms: Uncomment relevant lines in the testbench and use GTKWave

Waveform Analysis

The waveform output from the simulation provides insights into the CNN's operation:

Clock Signal: The topmost regular signal driving the circuit
State Transitions: Visible changes in state signals corresponding to different CNN stages
Enable Signals: Toggling of clock and power enable signals demonstrating gating techniques
Data Signals: Horizontal lines representing data flow through different layers
Classification Output: Final output signal indicating the classified result
Done Signal: Indicates completion of processing for each input image

Analyzing these waveforms helps in understanding the timing and behavior of different components in the CNN.

Future Improvements

While this implementation demonstrates key concepts, several improvements could enhance its functionality and efficiency:

Implement actual convolution and pooling operations instead of simplified versions
Add support for larger input images and more complex network architectures
Implement more sophisticated quantization techniques
Explore dynamic voltage and frequency scaling (DVFS) for additional power savings
Optimize memory access patterns for improved efficiency
Implement pruning techniques to reduce network size and complexity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Energy-Efficient Convolutional Neural Network (CNN) Implementation in Verilog

Table of Contents

Introduction

Project Structure

CNN Architecture

Energy Efficiency Techniques

Verilog Implementation Details

Simulation and Testing

Waveform Analysis

Future Improvements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
images		images
README.md		README.md
cnn_sim		cnn_sim
energy_efficient_cnn.v		energy_efficient_cnn.v
energy_efficient_cnn_tb.v		energy_efficient_cnn_tb.v
moreinfo.md		moreinfo.md
waveform.vcd		waveform.vcd

heysouravv/energy-efficient-cnn-fpga

Folders and files

Latest commit

History

Repository files navigation

Energy-Efficient Convolutional Neural Network (CNN) Implementation in Verilog

Table of Contents

Introduction

Project Structure

CNN Architecture

Energy Efficiency Techniques

Verilog Implementation Details

Simulation and Testing

Waveform Analysis

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages