논문 | 황원준 교수 연구실 | 고려대학교 전기전자공학부

황원준 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

논문

구성원

논문

연구 성과 추이

표시된 성과는 수집된 데이터 기준으로 산출되며, 일부 차이가 있을 수 있습니다.

5개년 연도별 논문 게재 수

39총합

5개년 연도별 피인용 수

579총합

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

article

인용수 0

2025

Sensor Fusion‐Based Autoencoder Feature Distillation for 3D Object Detection

Junmin Lee, Wonjun Hwang

IF 0.7 (2025)

Electronics Letters

ABSTRACT Knowledge distillation is a widely adopted model compression method aimed at narrowing the performance gap between a high‐capacity teacher network and a lightweight student network. However, in the context of sensor fusion‐based 3D object detection, existing distillation methods predominantly emphasize accuracy enhancement through the introduction of multiple loss functions, which often leads to overly complex training procedures. To address this limitation, we propose a sensor fusion‐based feature distillation framework tailored for camera and radar modalities. Our proposed method utilizes an autoencoder to facilitate efficient knowledge transfer from the teacher to the student model. Additionally, we introduce image‐context and radar‐context knowledge distillation strategies to capture and transfer modality‐specific features effectively. We demonstrate the effectiveness of the proposed method on the nuScenes dataset using a ResNet‐based architecture.

https://doi.org/10.1049/ell2.70295

Autoencoder

Artificial intelligence

Pattern recognition (psychology)

Feature (linguistics)

Fusion

Object (grammar)

Computer vision

Computer science

Distillation

Sensor fusion

article

인용수 1

2025

Knowledge tailoring: Bridging the teacher-student gap in semantic segmentation

Seokhwa Cheung, Seung-Beom Woo, Taehoon Kim, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.112399

Bridging (networking)

Segmentation

Natural language processing

Computer science

Artificial intelligence

Mathematics education

Psychology

article

인용수 7

2025

Bridging domain spaces for unsupervised domain adaptation

Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.111537

Bridging (networking)

Domain adaptation

Computer science

Domain (mathematical analysis)

Artificial intelligence

Adaptation (eye)

Pattern recognition (psychology)

Mathematics

Psychology

Neuroscience

article

인용수 0

2024

Channel and Spatial Enhancement Network for human parsing

Kunliang Liu, Rize Jin, Yuelong Li, Jianming Wang, Wonjun Hwang

IF 4.2 (2024)

Image and Vision Computing

The dominant backbones of neural networks for scene parsing consist of multiple stages, where feature maps in different stages often contain varying levels of spatial and semantic information. High-level features convey more semantics and fewer spatial details, while low-level features possess fewer semantics and more spatial details. Consequently, there are semantic-spatial gaps among features at different levels, particularly in human parsing tasks. Many existing approaches directly upsample multi-stage features and aggregate them through addition or concatenation, without addressing the semantic-spatial gaps present among these features. This inevitably leads to spatial misalignment, semantic mismatch, and ultimately misclassification in parsing, especially for human parsing that demands more semantic information and more fine details of feature maps for the reason of intricate textures, diverse clothing styles, and heavy scale variability across different human parts. In this paper, we effectively alleviate the long-standing challenge of addressing semantic-spatial gaps between features from different stages by innovatively utilizing the subtraction and addition operations to recognize the semantic and spatial differences and compensate for them. Based on these principles, we propose the Channel and Spatial Enhancement Network (CSENet) for parsing, offering a straightforward and intuitive solution for addressing semantic-spatial gaps via injecting high-semantic information to lower-stage features and vice versa, introducing fine details to higher-stage features. Extensive experiments on three dense prediction tasks have demonstrated the efficacy of our method. Specifically, our method achieves the best performance on the LIP and CIHP datasets and we also verify the generality of our method on the ADE20K dataset. • We propose the CSENet which effectively addresses the challenge of semantic and spatial gaps between feature maps from different stages in human parsing. By utilizing the operations of subtraction and addition to calculate and compensate the feature differences, CSENet reduces the semantic gaps and successfully introduce high-semantic information to low-level feature and fine details to high-level feature to benefit recognizing large objects and inconspicuous parts, especially in the context of human parsing. • We introduce CEM and SEM as the main components of CSENet. CEM employs average pooling, subtraction and addition to calculate and compensate semantic differences, while SEM utilizes similar operations to compute and compensate the spatial differences. These modules enhance the discriminative ability of feature representations, improving the recognition of fine details, inner patterns, and accurate spatial locations of human parts. • Our CSENet is shown to be effective and efficient in improving the performance of existing backbones. Our modules are general and can be easily integrated into existing architectures, enabling the effective assembly of feature maps from deep to shallow layers. Experimental results demonstrate the efficacy of our CSENet. Our method achieves SOTA performances on LIP and CIHP datasets without using pose information or the hierarchy structure of the class in the scene. We also validate the generality of our method via using the transformer as the backbone on the scene parsing dataset ADE20K.

https://doi.org/10.1016/j.imavis.2024.105332

Parsing

Channel (broadcasting)

Computer science

Artificial intelligence

Natural language processing

Computer network

article

인용수 2

2023

Class relationship‐based knowledge distillation for efficient human parsing

Yuqi Lang, Kunliang Liu, Jianming Wang, Wonjun Hwang

IF 0.7 (2023)

Electronics Letters

Abstract In computer vision, human parsing is challenging due to its demand for accurate human region location and semantic partitioning. This dense prediction task needs powerful computation and high‐precision models. To enable real‐time parsing on resource‐limited devices, the authors introduced a lightweight model using ResNet18 as a core network . The authors simplified the pyramid module, improving context clarity and reducing complexity. The authors integrated a spatial attention fusion strategy to counter precision loss in the light‐weighting process. Traditional models, despite their segmentation precision, are limited by their computational complexity and extensive parameters. The authors implemented knowledge distillation (KD) techniques to enhance the authors’ lightweight network's accuracy. Traditional methods can fail to learn useful knowledge with significant network differences. Hence, the authors used a novel distillation approach based on inter‐class and intra‐class relations in prediction outcomes, noticeably improving parsing accuracy. The authors’ experiments on the Look into Person (LIP) dataset show that their lightweight model significantly reduces parameters while maintaining parsing precision and enhancing inference speed.

https://doi.org/10.1049/ell2.12900

Computer science

Parsing

Artificial intelligence

Machine learning

Context (archaeology)

Inference

Weighting

Process (computing)

Class (philosophy)

Benchmark (surveying)

전체 논문

article

인용수 0

2025

Sensor Fusion‐Based Autoencoder Feature Distillation for 3D Object Detection

Junmin Lee, Wonjun Hwang

IF 0.7 (2025)

Electronics Letters

https://doi.org/10.1049/ell2.70295

Autoencoder

Artificial intelligence

Pattern recognition (psychology)

Feature (linguistics)

Fusion

Object (grammar)

Computer vision

Computer science

Distillation

Sensor fusion

article

인용수 1

2025

Knowledge tailoring: Bridging the teacher-student gap in semantic segmentation

Seokhwa Cheung, Seung-Beom Woo, Taehoon Kim, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.112399

Bridging (networking)

Segmentation

Natural language processing

Computer science

Artificial intelligence

Mathematics education

Psychology

article

인용수 7

2025

Bridging domain spaces for unsupervised domain adaptation

Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.111537

Bridging (networking)

Domain adaptation

Computer science

Domain (mathematical analysis)

Artificial intelligence

Adaptation (eye)

Pattern recognition (psychology)

Mathematics

Psychology

Neuroscience

article

인용수 0

2024

Channel and Spatial Enhancement Network for human parsing

Kunliang Liu, Rize Jin, Yuelong Li, Jianming Wang, Wonjun Hwang

IF 4.2 (2024)

Image and Vision Computing

https://doi.org/10.1016/j.imavis.2024.105332

Parsing

Channel (broadcasting)

Computer science

Artificial intelligence

Natural language processing

Computer network

article

인용수 2

2023

Class relationship‐based knowledge distillation for efficient human parsing

Yuqi Lang, Kunliang Liu, Jianming Wang, Wonjun Hwang

IF 0.7 (2023)

Electronics Letters

https://doi.org/10.1049/ell2.12900

Computer science

Parsing

Artificial intelligence

Machine learning

Context (archaeology)

Inference

Weighting

Process (computing)

Class (philosophy)

Benchmark (surveying)

preprint

인용수 0

2025

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ArXiv.org

Test-time adaptation aims to adapt to realistic environments in an online manner by learning during test time. Entropy minimization has emerged as a principal strategy for test-time adaptation due to its efficiency and adaptability. Nevertheless, it remains underexplored in continual test-time adaptation, where stability is more important. We observe that the entropy minimization method often suffers from model collapse, where the model converges to predicting a single class for all images due to a trivial solution. We propose ranked entropy minimization to mitigate the stability problem of the entropy minimization method and extend its applicability to continuous scenarios. Our approach explicitly structures the prediction difficulty through a progressive masking strategy. Specifically, it gradually aligns the model's probability distributions across different levels of prediction difficulty while preserving the rank order of entropy. The proposed method is extensively evaluated across various benchmarks, demonstrating its effectiveness through empirical results. Our code is available at https://github.com/pilsHan/rem

http://arxiv.org/abs/2505.16441

Minification

Entropy (arrow of time)

Empirical risk minimization

Stability (learning theory)

Kullback–Leibler divergence

Cross entropy

preprint

인용수 0

2025

D-TPT: Dimensional Entropy Maximization for Calibrating Test-Time Prompt Tuning in Vision-Language Models

Jisu Han, Wonjun Hwang

ArXiv.org

Test-time adaptation paradigm provides flexibility towards domain shifts by performing immediate adaptation on unlabeled target data from the source model. Vision-Language Models (VLMs) leverage their generalization capabilities for diverse downstream tasks, and test-time prompt tuning has emerged as a prominent solution for adapting VLMs. In this work, we explore contrastive VLMs and identify the modality gap caused by a single dominant feature dimension across modalities. We observe that the dominant dimensions in both text and image modalities exhibit high predictive sensitivity, and that constraining their influence can improve calibration error. Building on this insight, we propose dimensional entropy maximization that regularizes the distribution of textual features toward uniformity to mitigate the dependency of dominant dimensions. Our method alleviates the degradation of calibration performance in test-time prompt tuning, offering a simple yet effective solution to enhance the reliability of VLMs in real-world deployment scenarios.

http://arxiv.org/abs/2510.09473

Entropy maximization

Maximization

Leverage (statistics)

Entropy (arrow of time)

Calibration

Estimator

Domain adaptation

Mixture model

Pattern recognition (psychology)

Feature (linguistics)

preprint

인용수 0

2025

SMCNet: SAM Marries CLIP for Human Parsing

Kunliang Liu, Jianming Wang, Rize Jin, Wonjun Hwang, Tae‐Sun Chung

SSRN Electronic Journal

https://doi.org/10.2139/ssrn.5495925

Parsing

Leverage (statistics)

Segmentation

Feature (linguistics)

Semantic mapping

Semantic feature

Semantics (computer science)

article

인용수 0

2025

Loop based Continual Learning

Chang-Jun Ahn, Yunho Jeon, Wonjun Hwang

Journal of Broadcast Engineering

Groups at MIT and NYU have collected a dataset of millions of tiny colour images from the web.It is, in principle, an excellent dataset for unsupervised training of deep generative models, but previous researchers who have tried this have found it dicult to learn a good set of lters from the images.We show how to train a multi-layer generative model that learns to extract meaningful features which resemble those found in the human visual cortex.Using a novel parallelization algorithm to distribute the work among multiple machines connected on a network, we show how training such a model can be done in reasonable time.A second problematic aspect of the tiny images dataset is that there are no reliable class labels which makes it hard to use for object recognition experiments.We created two sets of reliable labels.The CIFAR-10 set has 6000 examples of each of 10 classes and the CIFAR-100 set has 600 examples of each of 100 non-overlapping classes.Using these labels, we show that object recognition is signicantly improved by pre-training a layer of features on a large set of unlabeled tiny images.

https://doi.org/10.5909/jbe.2025.30.4.599

Computer science

Artificial intelligence

preprint

인용수 0

2025

SCHNet: SAM Marries CLIP for Human Parsing

Kunliang Liu, Jianming Wang, Rize Jin, Wonjun Hwang, Tae‐Sun Chung

ArXiv.org

Vision Foundation Model (VFM) such as the Segment Anything Model (SAM) and Contrastive Language-Image Pre-training Model (CLIP) has shown promising performance for segmentation and detection tasks. However, although SAM excels in fine-grained segmentation, it faces major challenges when applying it to semantic-aware segmentation. While CLIP exhibits a strong semantic understanding capability via aligning the global features of language and vision, it has deficiencies in fine-grained segmentation tasks. Human parsing requires to segment human bodies into constituent parts and involves both accurate fine-grained segmentation and high semantic understanding of each part. Based on traits of SAM and CLIP, we formulate high efficient modules to effectively integrate features of them to benefit human parsing. We propose a Semantic-Refinement Module to integrate semantic features of CLIP with SAM features to benefit parsing. Moreover, we formulate a high efficient Fine-tuning Module to adjust the pretrained SAM for human parsing that needs high semantic information and simultaneously demands spatial details, which significantly reduces the training time compared with full-time training and achieves notable performance. Extensive experiments demonstrate the effectiveness of our method on LIP, PPP, and CIHP databases.

http://arxiv.org/abs/2503.22237

Parsing

Segmentation

Semantics (computer science)

Training set

Image segmentation

프로젝트 공고 서비스 문의 자주 묻는 질문 이용약관 개인정보처리방침

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)

전체 논문

article

인용수 0

2025

Sensor Fusion‐Based Autoencoder Feature Distillation for 3D Object Detection

Junmin Lee, Wonjun Hwang

IF 0.7 (2025)

Electronics Letters

https://doi.org/10.1049/ell2.70295

Autoencoder

Artificial intelligence

Pattern recognition (psychology)

Feature (linguistics)

Fusion

Object (grammar)

Computer vision

Computer science

Distillation

Sensor fusion

article

인용수 1

2025

Knowledge tailoring: Bridging the teacher-student gap in semantic segmentation

Seokhwa Cheung, Seung-Beom Woo, Taehoon Kim, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.112399

Bridging (networking)

Segmentation

Natural language processing

Computer science

Artificial intelligence

Mathematics education

Psychology

article

인용수 7

2025

Bridging domain spaces for unsupervised domain adaptation

Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.111537

Bridging (networking)

Domain adaptation

Computer science

Domain (mathematical analysis)

Artificial intelligence

Adaptation (eye)

Pattern recognition (psychology)

Mathematics

Psychology

Neuroscience

article

인용수 0

2024

Channel and Spatial Enhancement Network for human parsing

Kunliang Liu, Rize Jin, Yuelong Li, Jianming Wang, Wonjun Hwang

IF 4.2 (2024)

Image and Vision Computing

https://doi.org/10.1016/j.imavis.2024.105332

Parsing

Channel (broadcasting)

Computer science

Artificial intelligence

Natural language processing

Computer network

article

인용수 2

2023

Class relationship‐based knowledge distillation for efficient human parsing

Yuqi Lang, Kunliang Liu, Jianming Wang, Wonjun Hwang

IF 0.7 (2023)

Electronics Letters

https://doi.org/10.1049/ell2.12900

Computer science

Parsing

Artificial intelligence

Machine learning

Context (archaeology)

Inference

Weighting

Process (computing)

Class (philosophy)

Benchmark (surveying)

preprint

인용수 0

2025

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ArXiv.org

http://arxiv.org/abs/2505.16441

Minification

Entropy (arrow of time)

Empirical risk minimization

Stability (learning theory)

Kullback–Leibler divergence

Cross entropy

preprint

인용수 0

2025

D-TPT: Dimensional Entropy Maximization for Calibrating Test-Time Prompt Tuning in Vision-Language Models

Jisu Han, Wonjun Hwang

ArXiv.org

http://arxiv.org/abs/2510.09473

Entropy maximization

Maximization

Leverage (statistics)

Entropy (arrow of time)

Calibration

Estimator

Domain adaptation

Mixture model

Pattern recognition (psychology)

Feature (linguistics)

preprint

인용수 0

2025

SMCNet: SAM Marries CLIP for Human Parsing

Kunliang Liu, Jianming Wang, Rize Jin, Wonjun Hwang, Tae‐Sun Chung

SSRN Electronic Journal

https://doi.org/10.2139/ssrn.5495925

Parsing

Leverage (statistics)

Segmentation

Feature (linguistics)

Semantic mapping

Semantic feature

Semantics (computer science)

article

인용수 0

2025

Loop based Continual Learning

Chang-Jun Ahn, Yunho Jeon, Wonjun Hwang

Journal of Broadcast Engineering

https://doi.org/10.5909/jbe.2025.30.4.599

Computer science

Artificial intelligence

preprint

인용수 0

2025

SCHNet: SAM Marries CLIP for Human Parsing

Kunliang Liu, Jianming Wang, Rize Jin, Wonjun Hwang, Tae‐Sun Chung

ArXiv.org

http://arxiv.org/abs/2503.22237

Parsing

Segmentation

Semantics (computer science)

Training set

Image segmentation

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

article

인용수 0

2025

Sensor Fusion‐Based Autoencoder Feature Distillation for 3D Object Detection

Junmin Lee, Wonjun Hwang

IF 0.7 (2025)

Electronics Letters

https://doi.org/10.1049/ell2.70295

Autoencoder

Artificial intelligence

Pattern recognition (psychology)

Feature (linguistics)

Fusion

Object (grammar)

Computer vision

Computer science

Distillation

Sensor fusion

article

인용수 1

2025

Knowledge tailoring: Bridging the teacher-student gap in semantic segmentation

Seokhwa Cheung, Seung-Beom Woo, Taehoon Kim, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.112399

Bridging (networking)

Segmentation

Natural language processing

Computer science

Artificial intelligence

Mathematics education

Psychology

article

인용수 7

2025

Bridging domain spaces for unsupervised domain adaptation

Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang

IF 7.6 (2025)

Pattern Recognition

https://doi.org/10.1016/j.patcog.2025.111537

Bridging (networking)

Domain adaptation

Computer science

Domain (mathematical analysis)

Artificial intelligence

Adaptation (eye)

Pattern recognition (psychology)

Mathematics

Psychology

Neuroscience

article

인용수 0

2024

Channel and Spatial Enhancement Network for human parsing

Kunliang Liu, Rize Jin, Yuelong Li, Jianming Wang, Wonjun Hwang

IF 4.2 (2024)

Image and Vision Computing

https://doi.org/10.1016/j.imavis.2024.105332

Parsing

Channel (broadcasting)

Computer science

Artificial intelligence

Natural language processing

Computer network

article

인용수 2

2023

Class relationship‐based knowledge distillation for efficient human parsing

Yuqi Lang, Kunliang Liu, Jianming Wang, Wonjun Hwang

IF 0.7 (2023)

Electronics Letters

https://doi.org/10.1049/ell2.12900

Computer science

Parsing

Artificial intelligence

Machine learning

Context (archaeology)

Inference

Weighting

Process (computing)

Class (philosophy)

Benchmark (surveying)