발행물

전체 논문

21

1

LATPC: Accelerating GPU Address Translation Using Locality-Aware TLB Prefetching and MSHR Compression
Y. Ha et al.
1970

2

Achieving Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
H. Cha et al.
1970

3

IntervalSim++: Enhanced Interval Simulation for Unbalanced Processor Designs
H. Bong et al.
1970

4

GCStack: A GPU Cycle Accounting Mechanism for Providing Accurate Insight into GPU Performance
H. Cha et al.
1970

5

A Fast and Flexible FPGA-based Accelerator for Natural Language Processing Neural Networks
Suyeon Hur, Seongmin Na, Dongup Kwon, Joonsung Kim, Eriko Nurvitadhi, Andrew Boutros, Jangwoo Kim
ACM Transactions on Architecture and Code Optimization (TACO), 2023.02

6

3D-FPIM: An Extreme Energy-Efficient DNN Acceleration System Using 3D NAND Flash-Based In-Situ PIM Unit
Hunjun Lee, Minseop Kim, Dongmoon Min, Joonsung Kim, Jongwon Back, Honam Yoo, Jongho Lee, Jangwoo Kim
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022.10

7

LSim: Fine-Grained Simulation Framework for Large-Scale Performance Evaluation
Hamin Jang, Taehun Kang, Joonsung Kim, Jaeyong Cho, Jae-Eon Jo, Seungwook Lee, Wooseok Chang, Jangwoo Kim, Hanhwi Jang
IEEE Computer Architecture Letters (CAL), 2022.04

8

UC-Check: Characterizing Micro-operation Caches in x86 Processors and Implications in Security and Performance
*Joonsung Kim, *Hamin Jang, Hunjun Lee, Seungho Lee, Jangwoo Kim
Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2021.10

9

NLP-Fast: A Fast, Scalable, and Flexible System to Accelerate Large-Scale Heterogeneous NLP Models
Joonsung Kim, Suyeon Hur, Eunbok Lee, Seungho Lee, Jangwoo Kim
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2021.09

10

Performance Modeling and Practical Use Cases for Black-Box SSDs
Joonsung Kim, Kanghyun Choi, Wonsik Lee, Jangwoo Kim
ACM Transactions on Storage (TOS), 2021.06