The 43rd IEEE International Conference on Computer Design (ICCD 2025)
Throughput-Oriented LLM Inference via KV-Activation Hybrid Caching with a Single GPU
2025
2025
International Conference on ICT Convergence 2025
Improving Multi-tenant NPU Efficiency via Decoupled Tiling and Adaptive Memory Allocation
2025
2025
52nd Annual International Symposium on Computer Architecture, ISCA 2025
Unified Memory Protection with Multi-granular MAC andIntegrity Tree for Heterogeneous Processors
2025
2025
The 42nd IEEE International Conference on Computer Design (ICCD 2024)
Interference-Aware DNN Serving on Heterogeneous Processors in Edge Systems
2024
2024
51st ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2024
pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures
2024
2024