발행물

전체 논문

70

11

Mamba-X: An End-to-End Vision Mamba Accelerator for Edge Computing Devices
Dongho Yoon, Gungyu Lee, Jaewon Chang, Yunjae Lee, Dongjae Lee, Minsoo Rhu
2025

12

Debunking the CUDA Myth Towards GPU-based AI Systems
Yunjae Lee*, Juntaek Lim*, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu
2025

13

Uncovering Real GPU NoC Characteristics: Implications on Interconnect Architecture
Zhixian Jin, Christopher Rocca, Jiho Kim, Hans Kasan, Minsoo Rhu, Ali Bakhoda, Tor Aamodt, John Kim
2024

14

PIM-MMU: A Memory Management Unit for Accelerating Data Transfers in Commercial PIM Systems
Dongjae Lee, Bongjoon Hyun, Taehun Kim, Minsoo Rhu
2024

15

vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training,
Jehyeon Bang, Yujeong Choi, Myeongwoo Kim, Yongdeok Kim, Minsoo Rhu
2024

16

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference
Ranggi Hwang*, Jianyu Wei*, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang
2024

17

PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models
Yunjae Lee*, Hyeseong Kim*, Minsoo Rhu
2024

18

ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models
Yujeong Choi, Jiin Kim, Minsoo Rhu
2024

19

LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models
Juntaek Lim, Youngeun Kwon, Ranggi Hwang, Kiwan Maeng, Edward Suh, Minsoo Rhu
2024

20

GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, Edward Suh
2024