Efficient Transfer Learning of DINO-Based Transformers via MLP and Progressive Unfreezing | 배성근 교수 연구실 | 강남대학교 소프트웨어응용학부

배성근 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

발행물

구성원

article|

인용수 0

·2025

Efficient Transfer Learning of DINO-Based Transformers via MLP and Progressive Unfreezing

Jinsil Seong, So-Yoon Kim, B. M. Kim, Hyun-Suk Lee, Seong-Geon Bae

초록

This study proposes an efficient framework to improve the classification performance of Distillation with NO label(DINO)-based Vision Transformer models in a small labeled dataset environment. The proposed method utilizes powerful multi-crop-based data augmentation techniques and various visual transformations such as Auto Augment and Gaussian Blur to maximize the diversity of receptive fields and local features in the input image. In addition, by introducing a gradual unfreezing strategy that keeps the backbone network frozen during the initial 1 to 3 epochs of learning and trains only the Multi-Layer Perceptron(MLP)-based classification head, the problem of overfitting during fine-tuning was alleviated while maintaining a stable pre-trained generalized visual representation. The lightweight MLP head structure, AdamW optimizer, and Cosine Annealing learning rate scheduling were also applied to simultaneously improve the learning stability and convergence speed of the model. In extensive experiments centered on the STL-10 dataset, the proposed framework showed significant performance improvement in Top-1 accuracy, K-Nearest Neighbors(KNN) accuracy, and Linear Probe results compared to existing DINO models, and demonstrated more than 50% faster inference. Although the number of parameters in the MLP head increased, the actual execution efficiency was found to be rather improved thanks to parallel processing optimization and learning schedule adjustment. This study presents meaningful progress in further increasing the transfer learning efficiency and versatility of the self-supervised learning-based DINO-Transformer model, thereby significantly expanding the practical possibilities for field applications.Future research aims to improve more generalized performance and strengthen practical application possibilities by optimizing learning rates, diversifying augmentation policies, and automating backbone unfreezing.

키워드

TransformerTransfer of learningOverfittingStability (learning theory)Performance improvementDeep learningPattern recognition (psychology)Scheduling (production processes)Vernier scale

타입

article

IF / 인용수

- / 0

원문

https://doi.org/10.1109/icecer65523.2025.11401294

게재 연도

2025