
FPGA Acceleration of Convolutional Neural Networks (CNNs)
White Paper FPGA Acceleration of Convolutional Neural Networks Overview Convolutional Neural Networks (CNNs) have been shown to be extremely effective at complex image recognition problems.
FPGAs offer high performance, workload flexibility and energy-efficient operation for a range of HPC applications.
The FPGA value proposition for HPC has strengthened significantly in recent years.
These are key advantages emerge as demonstrated in our BWNN white paper:
Working alongside CPUs, FPGAs provide part of a heterogeneous approach to computing. For certain workloads, FPGAs provide significant speedup versus CPU—in this case 50x faster for machine learning inference.
FPGAs have a range of tools to best tailor to the application. The hardware fabric adapts to use only what’s needed, including hardened floating-point blocks when required. For BWNN’s weights, we used only a single bit, plus mean scaling factor, and still achieved acceptable accuracy but saving significant resources.
Power per watt is not only important at the edge, it’s in the power budget of datacenters in both space and cost of power. FPGAs can uniquely deliver the latest efficient libraries yet at far lower power per watt than CPUs.
With BittWare’s exclusive optimized OpenCL BSP, you’re able to both tap into software-orientated developers and the latest software libraries. This allowed us to quickly adapt the YOLOv3 framework, which has improved performance over older ML libraries.
We target applications when demand to process storage outpaces traditional architectures featuring CPUs.
FPGAs allow customers to create application-specific hardware implementations that exhibit the following properties:
Get answers to your HPC questions from our technical staff.
"*" indicates required fields
White Paper FPGA Acceleration of Convolutional Neural Networks Overview Convolutional Neural Networks (CNNs) have been shown to be extremely effective at complex image recognition problems.
IA-440i 400G + PCIe Gen5 Single-Width Card Compact 400G Card with the Power of Agilex The Intel Agilex 7 I-Series FPGAs are optimized for applications
Go Back to IP & Solutions RDMA Low-Latency RoCE v2 at 100Gbps The GROVF RDMA IP core and host drivers provide RDMA over Converged Ethernet
PCIe FPGA Card 520R-MX Stratix 10 FPGA Board with HBM2 and 480Gbps Optical Input Optimized for sensor processing applications with massive real-time data ingest requirements