Simba: Scaling Deep-Learning Inference with Chiplet-Based Architecture
This work investigates and quantifies the costs and benefits of using multi-chip-modules with fine-grained chiplets for deep learning inference, an application domain with large compute and on-chip storage requirements.