발행물

전체 논문

43

21

A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, Won Woo Ro
Sensors, 2013.03

22

TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs
윤명국, Minseong Gil, 오윤호, 구건재, Sangun Choi, Junsu Kim, Jaebeom Jeon
IEEE EMBEDDED SYSTEMS LETTERS, 202506

23

Beyond VABlock: Improving Transformer workloads through aggressive prefetching
Rhee Jane, Choi Ikyoung, Koo Gunjae, Oh Yunho, Yoon Myung Kuk
JOURNAL OF SYSTEMS ARCHITECTURE, 202505

24

Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
윤명국, Won Woo Ro, Yunho Oh, Ipoom Jeong, Gun Ko, Jiwon Lee
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), 202503

25

Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction
윤명국, 김남승, 정이품, 정은비
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), 202503

26

DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
윤명국, Cheonjun Park, Mincheol Park, Hyunchan Moon, Seokjin Go, Suhyun Kim, Won Woo Ro
The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), 202412

27

Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs
윤명국, 전재범, 구건재, 오윤호
IEEE EMBEDDED SYSTEMS LETTERS, 202412

28

VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing
윤명국, 오윤호, 구건재, Jaeyong Park, Junsu Kim, Minseong Gil, Jaebeom Jeon
The 53rd International Conference on Parallel Processing (ICPP 2024), 202408

29

Triple-A: Early Operand Collector Allocation for Maximizing GPU Register Bank Utilization
윤명국, 정은비, 김남승, 정이품
IEEE EMBEDDED SYSTEMS LETTERS, 202406

30

Conflict-aware compiler for hierarchical register file on GPUs
윤명국, 구건재, 오윤호, 박은성, 정은비
JOURNAL OF SYSTEMS ARCHITECTURE, 202404