Abstract:
We propose an edge multi-operator computing-in-memory design (EMO-CIM) that supports variable vector-wise multiply-and-accumulate (MAC) in CNN, Depthwise-Convolution (DW), and Attention operators. It features: (1) A single EMO-CIM bank excels in variable vector-wise MAC for multi-operators; (2) Merging local input-shared compute units with a decode-unit & adder-tree facilitates input/stationary-data similarity-aware computing to improve …