발행물
컨퍼런스
The 47th IEEE/ACM International Symposium on Computer Architecture (ISCA-47)
2020
,
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
The 25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25)
NeuMMU: Architectural Support for Efficient Address Translations in NPUs
The 26th IEEE International Symposium on High-Performance Computer Architecture (HPCA-26)
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units
The 52nd IEEE/ACM International Symposium on Microarchitecture (MICRO-52)
2019
TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning
The 51st IEEE/ACM International Symposium on Microarchitecture (MICRO-51)
2018
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning