논문 | 배호 교수 연구실 | 이화여자대학교 사이버보안학과

|배호 교수 연구실

홈

연구 영역

기본 정보

논문·특허

과제

구성원

논문

연구 성과 추이

표시된 성과는 수집된 데이터 기준으로 산출되며, 일부 차이가 있을 수 있습니다.

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

Article

인용수 7

2024

Evaluation of Malware Classification Models for Heterogeneous Data

Ho Bae

IF 3.5 (2024)

Sensors

머신러닝(ML)은 다양한 분야에서 널리 활용되고 있다. 또한 ML 기반 기법은 기술 분야의 보안 문제를 해결하기 위해 사용되어 왔으며, 다수의 연구는 보안 문제를 해결하는 데 있어 그 잠재력과 효과를 보여주고 있다. 수년 동안 악성 소프트웨어를 식별하기 위한 ML 방법은 여러 보안 영역에 걸쳐 개발되어 왔다. 그러나 최근 연구에서는 ML 모델이 작은 입력 교란에 취약하다는 점이 강조되었으며, 이를 적대적 예(adversarial examples)라 한다. 이러한 적대적 예는 모델의 예측을 크게 변화시킬 수 있다. 기존의 적대적 예에 관한 연구는 주로 영상 처리용 ML 모델에 초점을 두었으나, 점차 보안을 포함한 다른 응용 분야로 확장되었다. 흥미롭게도 적대적 공격은 악성코드 분류(malware classification)의 영역에서 특히 효과적인 것으로 입증되었다. 본 연구는 악성코드 분류의 투명성을 탐구하고 악성코드 분류기(malware classifiers)를 위한 설명 방법을 개발하고자 한다. 현재의 과제는 전통적인 영상 데이터셋과 비교해 악성코드가 지닌 복잡한 데이터 구조로 인해, 동질적 데이터에 대한 설명가능 AI(explainable AI)에서의 과제보다 더 복잡하다. 연구 결과 기존의 설명들이 이질적 데이터(heterogeneous data)를 해석하는 데에는 한계가 있음을 확인하였다. 본 연구에서 사용한 방법은, 분류 정확도(classification accuracy)가 높더라도 현재의 악성코드 탐지기가 오히려 잘못된 형태의 보안감을 제공할 수 있으며, 분류 정확도의 측정만으로 탐지기를 검증하기에는 충분하지 않다는 점을 보여주었다.

https://doi.org/10.3390/s24010288

Malware

Computer science

Adversarial system

Machine learning

Artificial intelligence

Software

Adversarial machine learning

Data mining

Computer security

Article

인용수 0

2024

Star-Generative Adversarial Network Advancements for De-Identification with Fixing Target Attributes

Yerin Yoon, Ho Bae

KIISE Transactions on Computing Practices

비식별화는 데이터셋에서 개인을 식별할 수 있는 요소들을 제거하여 데이터로부터 개인정보가 노출되지 않도록 하는 보안 방법이다. 제 4차 산업혁명 이후 데이터에 대한 수요와 공급이 기하급수적으로 증가하면서 데이터로 인한 개인 정보 노출 가능성이 현저히 높아졌는데, 이에 따라 데이터 활용을 제한하지 않으면서 보안 문제를 해결하기 위한 데이터 비식별화가 중요해 졌다. 특히 이미지 데이터는 원본 형태로 노출되었을 때 데이터 도용 및 악용 가능성이 높고, 사용자의 초상권을 침해할 수 있다. 그래서 이미지 데이터 비식별화는 지속적으로 연구되어 왔으며, 최근에는 데이터의 분포를 유지하면서 비식별화를 진행하는 생성 모델을 이용한 비식별화가 주목받고 있다. 본 논문에서는 이미지의 특정한 속성은 유지하면서 비식별화를 진행하는 생성 모델의 고도화를 목표로 하며, 비식별화 과정 중 모델 학습 과정에서 사용할 수 있는 두 가지의 선택 방법론을 제안한다. 그리고 두 가지의 선택 방법론을 적용함으로써 유지하고자 하는 target 속성이 기존 모델보다 더 잘 유지되며 생성 모델의 성능 또한 더 좋아지는 것을 실험으로 보인다.

https://doi.org/10.5626/ktcp.2024.30.4.199

Adversarial system

Identification (biology)

Generative grammar

Star (game theory)

Generative adversarial network

Computer science

Artificial intelligence

Deep learning

Astrophysics

Physics

Book chapter

인용수 3

2024

FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-51482-1_4

Computer science

Federated learning

Byzantine fault tolerance

Deep learning

Artificial intelligence

Ensemble learning

Machine learning

Deep neural networks

Scheme (mathematics)

Distributed computing

Article

인용수 1

2023

PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

Ho Bae, Hyemi Jang, Dahuin Jung, Jaihyun Lew, Junsung Park, Sungroh Yoon

https://doi.org/10.52202/075280-0843

Channel (broadcasting)

Noise reduction

Noise (video)

Pattern recognition (psychology)

Image (mathematics)

Article

인용수 0

2023

Privacy-Preserving Publishing of Individual-Level Medical Data for Cloud Services

Ho Bae, Heonseok Ha, Siwon Kim

딥러닝(DL)은 질병 예측을 포함한 많은 응용 분야에서 광범위하게 채택되어 왔다. 대부분의 DL 기반 응용은 DL 모델이 너무 크고 복잡하여 클라이언트 측에서 실행하기 어렵기 때문에 클라우드 서버에서 수행된다. 실질적으로 클라우드에 호스팅된 추론은 개인 의료 데이터를 사용하는 서비스와 관련하여 개인정보 보호에 대한 우려를 초래한다. 그럼에도 불구하고 최근 건강 진단 서비스용 DL 기반 응용의 발전을 고려하면, 이러한 응용은 일상생활에서 의료 지원을 제공하는 지배적인 수단이 되었다. 개인 의료 데이터의 오용을 방지하기 위해 민감한 정보를 보존하는 여러 기법이 개발되었으며, 이는 개인정보 보호와 유용성 간의 상충관계(trade-off)를 수반한다. 개인정보 보존과 양호한 예측 성능을 모두 제공하는 간단한 방법은 진단 방법을 클라이언트 측에 배치하는 것이다. 그러나 이를 수행하면 DL 모델이 공격자에 대해 더 취약해진다. 이를 위해 본 연구에서는 사용자 데이터의 프라이버시를 보장하면서 원래의 클래스 정보를 유지하고 모델을 역공학(reverse engineering)으로부터 보호하는 딥 프라이빗 생성 프레임워크를 제안한다. 벤치마크 질병 데이터셋에 대한 실제 딥 신경망을 대상으로 한 실험 결과, 제안된 방법은 원래 데이터와 합성 데이터 간의 상호정보량(mutual information)을 거의 80% 감소시키면서도 원래 예측 정확도에 거의 95%에 해당하는 예측 정확도를 보존함을 보여주었다.

http://dx.doi.org/10.1109/bibm58861.2023.10385371

Computer science

Cloud computing

Benchmark (surveying)

Software deployment

Information privacy

Server

Deep learning

Machine learning

Computer security

Private information retrieval

전체 논문

Article

인용수 7

2024

Evaluation of Malware Classification Models for Heterogeneous Data

Ho Bae

IF 3.5 (2024)

Sensors

https://doi.org/10.3390/s24010288

Malware

Computer science

Adversarial system

Machine learning

Artificial intelligence

Software

Adversarial machine learning

Data mining

Computer security

Article

인용수 0

2024

Star-Generative Adversarial Network Advancements for De-Identification with Fixing Target Attributes

Yerin Yoon, Ho Bae

KIISE Transactions on Computing Practices

https://doi.org/10.5626/ktcp.2024.30.4.199

Adversarial system

Identification (biology)

Generative grammar

Star (game theory)

Generative adversarial network

Computer science

Artificial intelligence

Deep learning

Astrophysics

Physics

Book chapter

인용수 3

2024

FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-51482-1_4

Computer science

Federated learning

Byzantine fault tolerance

Deep learning

Artificial intelligence

Ensemble learning

Machine learning

Deep neural networks

Scheme (mathematics)

Distributed computing

Article

인용수 1

2023

PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

Ho Bae, Hyemi Jang, Dahuin Jung, Jaihyun Lew, Junsung Park, Sungroh Yoon

https://doi.org/10.52202/075280-0843

Channel (broadcasting)

Noise reduction

Noise (video)

Pattern recognition (psychology)

Image (mathematics)

Article

인용수 0

2023

Privacy-Preserving Publishing of Individual-Level Medical Data for Cloud Services

Ho Bae, Heonseok Ha, Siwon Kim

http://dx.doi.org/10.1109/bibm58861.2023.10385371

Computer science

Cloud computing

Benchmark (surveying)

Software deployment

Information privacy

Server

Deep learning

Machine learning

Computer security

Private information retrieval

Article

인용수 0

2025

Dependable Code Repair with LLMs: AI-Driven Vulnerability Detection and Automated Patching

Sungmin Han, Hyoungshick Kim, Hojoon Lee, Hyungon Moon, Yuseok Jeon, Ho Bae, D Yeo, Gail‐Joon Ahn, Sangkyun Lee

소프트웨어 취약점의 급속한 확산은 규모에 맞춰 보안 결함을 지능적으로 자동 탐지하고 완화하기 위한 긴급한 필요를 야기했다. 전통적인 취약점 분석은 주로 수작업 점검과 도메인 특화 전문지식에 의존해 왔으나, 생성형 AI 기반 코드 개발의 시대에 들어 이러한 방식은 점차 부적절해지고 있다. 본 연구는 소스 코드와 바이너리를 포함한 다중 모달 데이터셋을 활용하여 취약점 수명주기 전 과정—탐지, 패치 생성, 검증—에서 종단 간 자동화를 달성하는 AI 기반 자동 취약점 탐지 및 안전한 코드 생성 프레임워크를 제안한다. 이 시스템은 설명가능 AI(XAI) 기반 취약점 원인 분석, 생성형 패치 합성, 시스템 수준의 방어적 코드 생성, Rust 기반 메모리 안전성 변환, 그리고 모델 기밀성을 위한 차등 프라이버시 메커니즘을 통합한다. 한국-미국 공동 연구 이니셔티브를 통해 개발된 본 프로젝트는 신뢰할 수 있고 프라이버시를 보존하는 AI 기반 소프트웨어 보안을 국제적으로 배치 가능한 플랫폼으로 구축하는 것을 목표로 한다. 제안된 연구는 자기 치유적이며 설명가능하고 설계 단계에서부터 안전을 내재한 소프트웨어 생태계를 위한 기반 방법과 실행 도구 모두에 기여한다.

https://doi.org/10.1109/prdc67299.2025.00032

Vulnerability (computing)

Secure coding

Software

Code (set theory)

Vulnerability assessment

Source code

Vulnerability management

Static program analysis

Book chapter

인용수 6

2024

VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and Purification

Yungi Cho, Woorim Han, Miseon Yu, Younghan Lee, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-70903-6_15

Backdoor

Computer science

Identification (biology)

Computer security

Artificial intelligence

Preprint

인용수 1

2024

VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and Purification

Yungi Cho, Woorim Han, Miseon Yu, Younghan Lee, Ho Bae, Yunheung Paek

arXiv (Cornell University)

수직 연합 학습(Vertical Federated Learning, VFL)은 FL 참여자들에 걸쳐 수직으로 분할된 데이터를 처리하는 데 초점을 둔다. 최근 연구에서는 VFL이 VFL의 고유한 특성을 표적으로 하는 백도어 공격에 대해 상당한 취약성을 지닌다는 점이 밝혀졌다. 따라서 이러한 공격은 주로 수평 연합 학습(Horizontal Federated Learning, HFL) 및 딥 신경망을 위해 설계된 기존 방어 기법을 무력화할 수 있다. 본 논문에서는 VFL에 특화된 최초의 백도어 방어 기법인 VFLIP를 제시한다. VFLIP는 추론 단계에서 동작하는 식별 및 정제(purification) 기법을 활용함으로써 백도어 공격에 대한 견고성을 크게 향상시킨다. VFLIP는 참여자별 이상 탐지(participant-wise anomaly detection) 접근법을 채택하여 먼저 백도어-트리거가 유발한 임베딩을 식별한다. 이후 VFLIP는 악성으로 식별된 임베딩을 제거하는 정제를 수행하고, 나머지 임베딩을 기반으로 모든 임베딩을 재구성한다. CIFAR10, CINIC10, Imagenette, NUS-WIDE, BankMarketing에 대해 광범위한 실험을 수행하여 VFL에서의 백도어 공격을 VFLIP가 효과적으로 완화할 수 있음을 입증한다. https://github.com/blingcho/VFLIP-esorics24

http://arxiv.org/abs/2408.15591

Backdoor

Identification (biology)

Computer science

Computer security

Business

Biology

Preprint

인용수 0

2024

DAFA: Distance-Aware Fair Adversarial Training

Hyungyu Lee, Saehyung Lee, Hyemi Jang, Junsung Park, Ho Bae, Sungroh Yoon

arXiv (Cornell University)

표준 학습에서 클래스 간 정확도 격차는 적대적(adversarial) 학습 중에 증폭되며, 이 현상을 강건 공정성(robust fairness) 문제라고 한다. 기존의 방법들은 강건 공정성을 향상시키기 위해 쉬운 클래스에서의 모델 성능을 희생하여 더 어려운 클래스에 대한 성능을 개선하는 방식으로 접근해 왔다. 그러나 우리는 적대적 공격 하에서 최악 클래스(worst class)에 속한 표본에 대한 모델의 예측이 쉬운 클래스 쪽으로 치우치기보다는 최악 클래스와 유사한 클래스 쪽으로 편향됨을 관찰하였다. 이론적 및 경험적 분석을 통해, 클래스 간 거리가 감소할수록 강건 공정성이 악화됨을 입증한다. 이러한 통찰에 동기부여되어, 클래스 간 유사성을 고려하는 거리 인지형 공정 적대적 학습(Distance-Aware Fair Adversarial training, DAFA) 방법론을 제안한다. 구체적으로, 우리의 방법은 각 클래스에 대해 서로 다른 손실 가중치와 적대적 마진을 부여하고, 유사한 클래스 간에서 강건성의 상충(trade-off)이 이루어지도록 이를 조정한다. 다양한 데이터셋에 걸친 실험 결과는 본 방법이 평균 강건 정확도(robust accuracy)를 유지할 뿐 아니라 최악의 강건 정확도를 유의하게 개선하여, 기존 방법에 비해 강건 공정성이 뚜렷하게 향상됨을 보여준다.

http://arxiv.org/abs/2401.12532

Adversarial system

Robustness (evolution)

Computer science

Class (philosophy)

Artificial intelligence

Machine learning

Article

인용수 2

2023

Exploring Clustered Federated Learning’s Vulnerability against Property Inference Attack

Hyunjun Kim, Yungi Cho, Younghan Lee, Ho Bae, Yunheung Paek

군집화 연합 학습(Clustered federated learning, CFL)은 비독립적이며 동일한 분포(non-independent and identically distributed, non-IID)인 데이터셋으로 인해 발생하는 치명적 망각(catastrophic forgetting)의 문제를 해결하는 연합 학습(Federated learning, FL) 분야의 고급 기법이다. CFL은 클라이언트의 데이터셋이 유사한 정도에 따라 이를 군집화하고, 각 군집에 대해 전역 모델을 학습함으로써 이를 달성한다. 비록 CFL이 non-IID 데이터셋으로부터 야기되는 성능 저하를 완화하는 데 효과적이긴 하나, CFL에서의 잠재적 개인정보 유출 위험은 충분히 연구되지 않았다. 선행 연구에서는 연합 학습(FL)에서의 개인정보 유출 위험을 속성 추론 공격(property inference attack, PIA)을 이용해 평가했는데, 이는 전역 모델의 주요 과제에서 목표 속성과 다른 의도치 않은 속성(즉, 속성)을 추출한다. 본 논문에서는 CFL에 대해 수동 및 능동 PIAs를 모두 적용함으로써 의도치 않은 속성 누출의 잠재적 위험을 탐구한다. 실증 분석 결과, CFL에서의 수동 PIA 성능은 공격 AUC 점수 측면에서 FL에 비해 상당히 우수한 것으로 나타났다. 또한 CFL에 맞춤화된 향상된 능동 PIA 방법을 제안하여 공격 성능을 개선한다. 우리의 방법은 악의적인 로컬 업데이트의 영향을 증폭하는 스케일업 파라미터를 도입하며, 그 결과 이전 기법보다 더 나은 성능을 보인다. 더 나아가, 클라이언트 수준에서 차등 프라이버시(differential privacy, DP) 메커니즘을 적용함으로써 CFL의 취약성이 완화될 수 있음을 보여준다. FL에 DP를 적용하면 높은 유틸리티 손실이 유발될 수 있음을 보여준 선행 연구와 달리, 본 실증 결과는 DP가 CFL에서 방어 메커니즘으로 활용될 수 있으며, 프라이버시와 유틸리티 간 더 나은 절충(trade-off)을 이끌 수 있음을 시사한다.

https://doi.org/10.1145/3607199.3607218

Computer science

Property (philosophy)

Inference

Vulnerability (computing)

Vulnerability assessment

Artificial intelligence

Computer security

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

Article

인용수 7

2024

Evaluation of Malware Classification Models for Heterogeneous Data

Ho Bae

IF 3.5 (2024)

Sensors

https://doi.org/10.3390/s24010288

Malware

Computer science

Adversarial system

Machine learning

Artificial intelligence

Software

Adversarial machine learning

Data mining

Computer security

Article

인용수 0

2024

Star-Generative Adversarial Network Advancements for De-Identification with Fixing Target Attributes

Yerin Yoon, Ho Bae

KIISE Transactions on Computing Practices

https://doi.org/10.5626/ktcp.2024.30.4.199

Adversarial system

Identification (biology)

Generative grammar

Star (game theory)

Generative adversarial network

Computer science

Artificial intelligence

Deep learning

Astrophysics

Physics

Book chapter

인용수 3

2024

FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-51482-1_4

Computer science

Federated learning

Byzantine fault tolerance

Deep learning

Artificial intelligence

Ensemble learning

Machine learning

Deep neural networks

Scheme (mathematics)

Distributed computing

Article

인용수 1

2023

PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

Ho Bae, Hyemi Jang, Dahuin Jung, Jaihyun Lew, Junsung Park, Sungroh Yoon

https://doi.org/10.52202/075280-0843

Channel (broadcasting)

Noise reduction

Noise (video)

Pattern recognition (psychology)

Image (mathematics)

Article

인용수 0

2023

Privacy-Preserving Publishing of Individual-Level Medical Data for Cloud Services

Ho Bae, Heonseok Ha, Siwon Kim

http://dx.doi.org/10.1109/bibm58861.2023.10385371

Computer science

Cloud computing

Benchmark (surveying)

Software deployment

Information privacy

Server

Deep learning

Machine learning

Computer security

Private information retrieval

전체 논문

Article

인용수 7

2024

Evaluation of Malware Classification Models for Heterogeneous Data

Ho Bae

IF 3.5 (2024)

Sensors

https://doi.org/10.3390/s24010288

Malware

Computer science

Adversarial system

Machine learning

Artificial intelligence

Software

Adversarial machine learning

Data mining

Computer security

Article

인용수 0

2024

Star-Generative Adversarial Network Advancements for De-Identification with Fixing Target Attributes

Yerin Yoon, Ho Bae

KIISE Transactions on Computing Practices

https://doi.org/10.5626/ktcp.2024.30.4.199

Adversarial system

Identification (biology)

Generative grammar

Star (game theory)

Generative adversarial network

Computer science

Artificial intelligence

Deep learning

Astrophysics

Physics

Book chapter

인용수 3

2024

FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-51482-1_4

Computer science

Federated learning

Byzantine fault tolerance

Deep learning

Artificial intelligence

Ensemble learning

Machine learning

Deep neural networks

Scheme (mathematics)

Distributed computing

Article

인용수 1

2023

PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

Ho Bae, Hyemi Jang, Dahuin Jung, Jaihyun Lew, Junsung Park, Sungroh Yoon

https://doi.org/10.52202/075280-0843

Channel (broadcasting)

Noise reduction

Noise (video)

Pattern recognition (psychology)

Image (mathematics)

Article

인용수 0

2023

Privacy-Preserving Publishing of Individual-Level Medical Data for Cloud Services

Ho Bae, Heonseok Ha, Siwon Kim

http://dx.doi.org/10.1109/bibm58861.2023.10385371

Computer science

Cloud computing

Benchmark (surveying)

Software deployment

Information privacy

Server

Deep learning

Machine learning

Computer security

Private information retrieval

Article

인용수 0

2025

Dependable Code Repair with LLMs: AI-Driven Vulnerability Detection and Automated Patching

Sungmin Han, Hyoungshick Kim, Hojoon Lee, Hyungon Moon, Yuseok Jeon, Ho Bae, D Yeo, Gail‐Joon Ahn, Sangkyun Lee

https://doi.org/10.1109/prdc67299.2025.00032

Vulnerability (computing)

Secure coding

Software

Code (set theory)

Vulnerability assessment

Source code

Vulnerability management

Static program analysis

Book chapter

인용수 6

2024

VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and Purification

Yungi Cho, Woorim Han, Miseon Yu, Younghan Lee, Ho Bae, Yunheung Paek

Lecture notes in computer science

https://doi.org/10.1007/978-3-031-70903-6_15

Backdoor

Computer science

Identification (biology)

Computer security

Artificial intelligence

Preprint

인용수 1

2024

VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and Purification

Yungi Cho, Woorim Han, Miseon Yu, Younghan Lee, Ho Bae, Yunheung Paek

arXiv (Cornell University)

http://arxiv.org/abs/2408.15591

Backdoor

Identification (biology)

Computer science

Computer security

Business

Biology

Preprint

인용수 0

2024

DAFA: Distance-Aware Fair Adversarial Training

Hyungyu Lee, Saehyung Lee, Hyemi Jang, Junsung Park, Ho Bae, Sungroh Yoon

arXiv (Cornell University)

http://arxiv.org/abs/2401.12532

Adversarial system

Robustness (evolution)

Computer science

Class (philosophy)

Artificial intelligence

Machine learning

Article

인용수 2

2023

Exploring Clustered Federated Learning’s Vulnerability against Property Inference Attack

Hyunjun Kim, Yungi Cho, Younghan Lee, Ho Bae, Yunheung Paek

https://doi.org/10.1145/3607199.3607218

Computer science

Property (philosophy)

Inference

Vulnerability (computing)

Vulnerability assessment

Artificial intelligence

Computer security