multi-operator AI acceleration

EMO-CIM: An Input/Stationary-Data Similarity-Aware Computing-In-Memory Design for Variable Vector-Wise Computation in Edge Multi-Operator AI Acceleration

EMO-CIM: An Input/Stationary-Data Similarity-Aware Computing-In-Memory Design for Variable Vector-Wise Computation in Edge Multi-Operator AI Acceleration 150 150

Abstract:

We propose an edge multi-operator computing-in-memory design (EMO-CIM) that supports variable vector-wise multiply-and-accumulate (MAC) in CNN, Depthwise-Convolution (DW), and Attention operators. It features: (1) A single EMO-CIM bank excels in variable vector-wise MAC for multi-operators; (2) Merging local input-shared compute units with a decode-unit & adder-tree facilitates input/stationary-data similarity-aware computing to improve …

View on IEEE Xplore