Oct. 28, 2023, 6:17 a.m. | /u/ExcitingInternet6083

Machine Learning www.reddit.com

(I post this in several subreddit.)
I'm gonna design an NN accelerator on FPGA. For NN, the basic operation is matrix-vector mult or matrix-matrix mult. And `GEMM+im2col` is easy to implement and many kinds of NN can be mapped on the designed accelerator based on `GEMM+im2col` easily. The disadvantage of it is that a little bit more bandwidth is required. I think it is a little tricky to design address-gen unit for the method of regular convolution when reading input …

accelerator basic cnn commercial cpu data data streams design fpga machinelearning matrix type vector

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US