A 28-nm 323 TOPS/mm<sup>2</sup>/b and 2007 TOPS/W/b Ternary Latch Based Sparsity-Aware CIM Macro With Double-Sampling Ternary ADC | 김경록 교수 연구실 | 울산과학기술원 전기전자공학과

김경록 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

논문

구성원

article|

인용수 1

·2024

A 28-nm 323 TOPS/mm²/b and 2007 TOPS/W/b Ternary Latch Based Sparsity-Aware CIM Macro With Double-Sampling Ternary ADC

Myoung Kim, Hoichang Jeong, Wooseok Kim, Yun‐Mi Jeong, Young-Eun Choi, Junyoung Park, Min Woo Ryu, Yong Kyu Lee, Kyuho Lee, Kyung Rok Kim

초록

For resource constrained on-device Al environment, system efficiency is the key factor for successful implementation (Fig. 1): area efficiency (TOPS $/ mm^{2}$ ) to inference the user demand AI task while maintaining device size to a palm-scale, and power efficiency (TOPS/W) for maximum application usage on limited battery life. To save resources, network precisions are quantized and sparsity exploitable ternary $(- 1, 0, + 1)$ input/weight shows best outcome as hardware demonstration in [1] achieves -82% energy per inference compared to binary ( $- 1, + 1$ ). For system architecture, analog designs generally have high efficiency, as custom bitcells are compactly integrated and perform multiply-and-accumulate (MAC) operations in parallel with low energy consumption. However, there are two challenges that degrade the efficiency enhancement. 1) For bitcells, it requires an additional latch for ternary weight and reduces macro area efficiency. 2) In analog compute-in-memory (CIM), increase in number of MAC worsens the energy and signal margin which affects operation’s accuracy and efficiency. Furthermore, A-to-D converters (ADCs) occupy 41% of macro area [2] and 72% latency [3], so minimal overhead is crucial for analog CIM system efficiency. In this paper we present highly resource efficient ternary latch based CIM macro 1) The refined single latch ternary input/weight cell enabled by 28 nm ternary-CMOS (T-CMOS) technology. 2) Solutions for analog CIM performance enhancement by energy-efficient and signal margin improved MAC operation with double-sampling ternary ADC.

키워드

TOPSTernary operationMacroSampling (signal processing)Materials scienceComputer sciencePhysicsOpticsDetectorProgramming language

타입

article

IF / 인용수

- / 1

원문

https://doi.org/10.1109/a-sscc60305.2024.10848571

게재 연도

2024

프로젝트 공고 서비스 문의 자주 묻는 질문 이용약관 개인정보처리방침

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)