Importance Analysis for Dynamic Control of Balancing Parameter in a Simple Knowledge Distillation Setting | 김관호 교수 연구실 | 동국대학교 본교(제1캠퍼스) 산업시스템공학과

김관호 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

논문

구성원

preprint|

green

·인용수 0

·2025

Importance Analysis for Dynamic Control of Balancing Parameter in a Simple Knowledge Distillation Setting

Seongmin Kim, Kwanho Kim, Mingi Kim, Kang-Hyun Jo

ArXiv.org

초록

Although deep learning models owe their remarkable success to deep and complex architectures, this very complexity typically comes at the expense of real-time performance. To address this issue, a variety of model compression techniques have been proposed, among which knowledge distillation (KD) stands out for its strong empirical performance. The KD contains two concurrent processes: (i) matching the outputs of a large, pre-trained teacher network and a lightweight student network, and (ii) training the student to solve its designated downstream task. The associated loss functions are termed the distillation loss and the downsteam-task loss, respectively. Numerous prior studies report that KD is most effective when the influence of the distillation loss outweighs that of the downstream-task loss. The influence(or importance) is typically regulated by a balancing parameter. This paper provides a mathematical rationale showing that in a simple KD setting when the loss is decreasing, the balancing parameter should be dynamically adjusted

키워드

DistillationVariety (cybernetics)Matching (statistics)Simple (philosophy)Control (management)

타입

preprint

IF / 인용수

- / 0

원문

http://arxiv.org/abs/2505.06270

게재 연도

2025

프로젝트 공고 서비스 문의 자주 묻는 질문 이용약관 개인정보처리방침

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)