A Normalization Technique via Attention Score Suppression | 배성근 교수 연구실 | 강남대학교 소프트웨어응용학부

배성근 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

발행물

구성원

article|

인용수 0

·2025

A Normalization Technique via Attention Score Suppression

Ki-Beom Kweon, Jinsil Seong, Tea-Ho Kim, Seong-Geon Bae

초록

Vision Transformers have been applied in various domains, but their lack of inductive bias limits training stability and performance on small-scale datasets. While previous studies have attempted to address this issue through structural modifications, this study proposes a normalization method that reduces the self-similarity value in self-attention to enhance token-to-token interactions. This approach introduces inductive bias without altering the architecture or increasing computational complexity. Experiments on the CIFAR-10 dataset demonstrate that the proposed method improves the final validation accuracy (raw) by 2.62% and decreases the loss, thereby enhancing training stability.

키워드

Normalization (sociology)Inductive biasPattern recognition (psychology)Training setTransformer

타입

article

IF / 인용수

- / 0

원문

https://doi.org/10.1109/icecer65523.2025.11401279

게재 연도

2025