발행물
컨퍼런스
25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25)
2019
,
NeuMMU: Architectural Support for Efficient Address Translations in NPUs
26th IEEE International Symposium High-Performance Computer Architecture (HPCA-26)
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units
52nd IEEE/ACM International Symposium on Microarchitecture (MICRO-52)
TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning
51st IEEE/ACM International Symposium on Microarchitecture (MICRO-51)
2018
Beyond the Memory Wall: A Case for Memory-centric HPC System Architecture for Training Deep Neural Networks
24th IEEE International Symposium on High-Performance Computer Architecture (HPCA-24)
2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks