발행물

전체 논문

48

31

GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
Yunho Oh, Doohwan Oh, Won W. Ro
International Journal of Parallel Programming, 2013.08

32

TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs
오윤호
IEEE EMBEDDED SYSTEMS LETTERS, 202506

33

Beyond VABlock: Improving Transformer workloads through aggressive prefetching
오윤호
JOURNAL OF SYSTEMS ARCHITECTURE, 202505

34

TM-Training: An Energy-Efficient Tiered Memory System for Deep Learning Training in NPUs
오윤호
ACM TRANSACTIONS ON STORAGE, 202503

35

Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs
오윤호
IEEE EMBEDDED SYSTEMS LETTERS, 202412

36

Conflict-aware compiler for hierarchical register file on GPUs
오윤호
JOURNAL OF SYSTEMS ARCHITECTURE, 202404

37

SAVector: Vectored Systolic Arrays
오윤호
IEEE ACCESS, 202403

38

Scale-out Systolic Arrays
오윤호
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 202306

39

GhostLeg: Selective Memory Coalescing for Secure GPU Architecture
오윤호
IEEE ACCESS, 202210

40

Analyzing GCN Aggregation on GPU
오윤호
IEEE ACCESS, 202210