논문 | 박재휘 교수 연구실 | 서울시립대학교 통계학과

|박재휘 교수 연구실

홈

연구 영역

기본 정보

논문·특허

과제

구성원

논문

연구 성과 추이

표시된 성과는 수집된 데이터 기준으로 산출되며, 일부 차이가 있을 수 있습니다.

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

Preprint

인용수 9

2024

Proxy-based Item Representation for Attribute and Context-aware Recommendation

Jinseok Seol, Minseok Gang, Sang‐goo Lee, Jaehui Park

추천 시스템에서의 신경망 기반 접근은 대규모 항목 집합을 학습 가능한 벡터 임베딩 테이블로 표현함으로써 놀라운 성과를 보였다. 그러나 드문 항목은 충분한 학습 기회를 갖지 못해 의미 있는 표현을 학습하기 어려울 수 있다. 본 연구에서는 속성 및 맥락을 고려하는 설정에서 드문 항목의 충분히 학습되지 않은 임베딩이 추천 정확도를 저하시킨다는 점을 확인한다. 이러한 문제를 해결하기 위해, 각 항목을 학습 가능한 프록시 임베딩들의 가중 합으로 표현할 수 있게 하는 프록시 기반 항목 표현을 제안한다. 여기서 프록시의 가중치는 각 항목의 속성과 맥락에 의해 결정되며, 빈번한 항목의 경우 협업 신호를 보다 잘 반영하기 위해 편향 항(bias term)을 포함할 수 있다. 프록시 기반 방법은 항목 표현을 조합적으로(compositionally) 계산하여 각 표현이 잘 학습된 심플렉스(simplex) 내부에 위치하도록 보장하며, 따라서 품질이 담보된다. 또한 모든 항목에 걸쳐 프록시 임베딩을 공유함으로써, 드문 항목은 통일된 모델 구조 내에서 빈번한 항목의 학습 신호를 엔드투엔드 방식으로 차용할 수 있다. 제안하는 방법은 플러그앤플레이 방식의 모델로서, 어떤 신경망 기반 추천 모델이든 항목 인코딩 레이어를 대체할 수 있으며, 훨씬 더 적은 파라미터 사용으로도 일관되게 추천 성능을 향상시킨다. 실제 추천 벤치마크 데이터셋에서 수행한 실험 결과, 본 모델은 10%의 파라미터만 사용하면서도 추천 정확도 면에서 기존 최첨단 모델을 최대 17%까지 능가함을 보여주었다.

https://doi.org/10.1145/3616855.3635824

Computer science

Proxy (statistics)

Recommender system

Embedding

Collaborative filtering

Artificial neural network

Benchmark (surveying)

Artificial intelligence

Machine learning

Set (abstract data type)

Article

인용수 33

2022

Retrieval-Augmented Response Generation for Knowledge-Grounded Conversation in the Wild

Yeonchan Ahn, Sang‐goo Lee, Junho Shim, Jaehui Park

IF 3.9 (2022)

IEEE Access

인터넷 사용자들은 흥미로운 사실이나 주제에 관한 대화를 나누면서 웹으로부터 다양한 지식을 함께 접하는 경우가 흔하다. 그러나 기존의 대부분 지식 기반 대화 모델은 대화의 주제와 관련하여 오직 단일 문서만을 고려한다. 최근 제안된 검색 증강(retrieval-augmented) 모델들은 다수의 문서에 기반하여 응답을 생성하지만, 주어진 주제를 무시하고 대화의 국소적 문맥(local context)만을 사용한다. 이를 위해 본 연구는 주제와 대화의 국소적 문맥 모두와 관련 있는 적절한 범위의 문서를 검색하여 이를 지식 기반 응답 생성에 활용하는 새로운 검색 증강 응답 생성 모델을 제안한다. 우리의 모델은 먼저 전체 대화에서 추출한 주제 단어(topic words)와 응답 이전의 토큰(tokens)을 모두 입력으로 받아 여러 표상(representations)을 산출한다. 그 다음 대화와 문서 인코더에서 각각 처음 N 토큰의 표상과 대화의 키워드 및 문서에서의 키워드 표상을 선택하고, 대화의 표상 그룹을 문서의 표상 그룹과 각각 비교한다. 학습을 위해서는 정답 지식(ground truth knowledge) 없이도 모델이 지식 기반 응답을 생성하도록 유도하는 새로운 데이터 가중치(data-weighting) 방식을 도입한다. 대규모 데이터셋을 사용한 자동 및 사람 평가 결과는, 제안한 모델이 기존 최신(state-of-the-art) 모델에 비해 보다 더 지식이 풍부하고 다양하며 관련성 높은 응답을 생성할 수 있음을 보여준다.

https://doi.org/10.1109/access.2022.3228964

Conversation

Computer science

Context (archaeology)

Security token

Information retrieval

Representation (politics)

Natural language processing

Artificial intelligence

The Internet

World Wide Web

Article

인용수 6

2020

Exploiting Text Matching Techniques for Knowledge-Grounded Conversation

Yeonchan Ahn, Sang‐goo Lee, Jaehui Park

IF 3.367 (2020)

IEEE Access

지식 기반 대화 모델은 외부 지식에 근거하여 주어진 대화 맥락에 대해 유익한 응답을 생성하는 것을 목표로 한다. 유익하고 맥락에 부합하는 응답을 생성하기 위해서는 대화 맥락과 외부 지식을 균형 있게 결합(conjugate)하는 것이 중요하다. 그러나 기존 연구들은 외부 지식원에서 적절한 지식 문장을 찾는 문제를, 정확한 대화 행위(dialogue acts)를 갖는 적절한 문장을 생성하는 문제보다 상대적으로 덜 주목해 왔다. 본 논문에서는 두 가지 지식 선택 전략을 제안한다: 1) Reduce-Match 및 2) Match-Reduce. 그리고 각 전략에 기반한 여러 신경 지식 기반 대화 모델을 탐색한다. Reduce-Match 전략에 기반한 모델은 먼저 전체 대화 맥락을 중요 특징이 보존된 단일 벡터로 압축(distill)한 다음, 이 맥락 벡터를 지식 문장들의 표현과 비교하여 관련된 지식 문장을 예측한다. Match-Reduce 전략에 기반한 모델은 먼저 맥락의 각 발화를 지식 문장과 매칭(match)하여 세밀한 상호작용을 포착하고, 정보 손실을 최소화하면서 이를 집계하여 지식 문장을 예측한다. 실험 결과는 각 지식 선택 전략을 사용하는 대화 모델이 지식 선택 정확도뿐 아니라 응답 생성 성능에서도 경쟁 기준선(competitive baselines)보다 우수함을 보여준다. 또한 Match-Reduce를 기반으로 한 최우수 모델은 Wizard of Wikipedia 데이터셋을 대상으로 한 비교 실험에서 기준선들을 능가한다. 아울러 Reduce-Match를 기반으로 한 최우수 모델은 CMU Document Grounded Conversations 데이터셋에서 기준선들을 능가한다.

https://doi.org/10.1109/access.2020.3007893

Conversation

Computer science

Matching (statistics)

Natural language processing

Psychology

Communication

전체 논문

Preprint

인용수 9

2024

Proxy-based Item Representation for Attribute and Context-aware Recommendation

Jinseok Seol, Minseok Gang, Sang‐goo Lee, Jaehui Park

https://doi.org/10.1145/3616855.3635824

Computer science

Proxy (statistics)

Recommender system

Embedding

Collaborative filtering

Artificial neural network

Benchmark (surveying)

Artificial intelligence

Machine learning

Set (abstract data type)

Article

인용수 33

2022

Retrieval-Augmented Response Generation for Knowledge-Grounded Conversation in the Wild

Yeonchan Ahn, Sang‐goo Lee, Junho Shim, Jaehui Park

IF 3.9 (2022)

IEEE Access

https://doi.org/10.1109/access.2022.3228964

Conversation

Computer science

Context (archaeology)

Security token

Information retrieval

Representation (politics)

Natural language processing

Artificial intelligence

The Internet

World Wide Web

Article

인용수 6

2020

Exploiting Text Matching Techniques for Knowledge-Grounded Conversation

Yeonchan Ahn, Sang‐goo Lee, Jaehui Park

IF 3.367 (2020)

IEEE Access

https://doi.org/10.1109/access.2020.3007893

Conversation

Computer science

Matching (statistics)

Natural language processing

Psychology

Communication

Preprint

인용수 0

2025

Enhancing Text-to-Image Retrieval by Addressing Parts-of-Speech Imbalance in Vision-Language Models

D. Kim, Hyesu Hwang, Jaehui Park, Yongjin Kwon

SSRN Electronic Journal

https://doi.org/10.2139/ssrn.5078743

Computer science

Image (mathematics)

Natural language processing

Speech recognition

Artificial intelligence

Language model

Article

인용수 1

2024

Bridging the Lexical Gap: Generative Text-to-Image Retrieval for Parts-of-Speech Imbalance in Vision-Language Models

Hyesu Hwang, Daeun Kim, Jaehui Park, Yongjin Kwon

시각과 언어 표현을 정렬하는 일이 비자명하므로, 텍스트를 기반으로 관련 이미지를 검색하는 것은 어렵다. 최근 연구에서는 정렬에 대한 사전학습 지식을 활용하기 위해 CLIP과 같은 대규모 비전-언어 모델이 널리 사용된다. 그러나 우리의 관찰에 따르면 명사 쿼리에 비해 동사, 형용사, 부사 쿼리에서는 성능이 60.8% 감소한다. 예비 연구를 통해, 널리 사용되는 비전-언어 모델들에서 특정 품사에 대한 이미지-텍스트 정렬이 충분하지 않음을 확인하였다. 또한 명사가 비전-언어 모델의 텍스트-대-이미지 검색 결과에 높은 영향을 미친다는 점도 관찰하였다. 이를 바탕으로 본 논문은 쿼리 재작성 과정의 일부로서 명사 기반 쿼리를 생성하는 방법을 제안한다. 먼저, 대규모 언어 모델이 초기 쿼리와 관련된 명사를 추출하고, 비전-언어 모델에서의 품사 정렬에 가장 잘 부합하는 가상 쿼리를 생성한다. 그런 다음 해당 가상 쿼리가 원래 쿼리의 의도를 보존하는지 검증하고, 이를 반복적으로 재작성한다. 실험 결과, 본 방법은 텍스트-대-이미지 검색 성능을 유의미하게 향상시킬 수 있으며, 비전-언어 모델이 어휘 지식을 이해하는 방식을 부각한다.

https://doi.org/10.1145/3689091.3690089

Bridging (networking)

Computer science

Generative grammar

Natural language processing

Artificial intelligence

Generative model

Speech recognition

Article

인용수 18

2023

Contrastive learning for unsupervised image-to-image translation

Hanbit Lee, Jinseok Seol, Sang‐goo Lee, Jaehui Park, Junho Shim

IF 7.2 (2023)

Applied Soft Computing

https://doi.org/10.1016/j.asoc.2023.111170

Computer science

Image (mathematics)

Translation (biology)

Artificial intelligence

Image translation

Natural language processing

Computer vision

Pattern recognition (psychology)

Article

인용수 2

2023

Integrating Heterogeneous Graphs Using Graph Transformer Encoder for Solving Math Word Problems

Soyun Shin, Jaehui Park, Moonwook Ryu

IF 3.4 (2023)

IEEE Access

본 논문은 수학 단어 문제를 해결하기 위해 구조적 정보를 딥 뉴럴 모델 학습에 통합하는 새로운 방법을 제안한다. 선행 연구들은 입력 문장에 내재된 풍부한 정보를 표현하기 위해 그래프 구조를 채택해 왔다. 그러나 문장 내 다른 부분들 사이의 서로 다른 관계 유형을 고려하지 않았다. 다양한 유형의 구조 정보를 일관된 방식으로 제공하기 위해, 우리는 다양한 입력 표현의 이질적 그래프를 통합하는 그래프 트랜스포머 인코더를 제안한다. 우리는 두 가지 유형의 그래프 구조를 개발하였다. 첫째, Dependency Graph는 단어와 수량 사이의 장거리 어휘 의존성을 유지한다. 둘째, Question Overlap Graph는 문제 본문 내의 핵심을 포착한다. 두 그래프는 그래프 변환을 위해 단일 그래프로 인코딩된다. 실험 결과, 제안 방법은 기본 방법들(baselines)과 비교하여 경쟁력 있는 성능을 보인다. 또한 우리의 모델은 SVAMP 벤치마크에서 방정식 및 답변 정확도에 대해 약 3 퍼센트가량 우수한 성능을 최신(기존) 모델들보다 달성한다. 더 나아가, 서로 다른 유형의 텍스트적 특성을 통합하면 자연어 문장으로부터 수학적 논리 추론의 품질을 향상시킬 수 있음을 논의한다.

https://doi.org/10.1109/access.2023.3257571

Computer science

Transformer

Theoretical computer science

Inference

Encoder

Graph

Dependency graph

Artificial intelligence

Natural language processing

Algorithm

Preprint

인용수 1

2023

Contrastive Learning for Unsupervised Image-to-Image Translation

Hanbit Lee, Jinseok Seol, Sang‐Goo Lee, Jaehui Park, Junho Shim

SSRN Electronic Journal

http://dx.doi.org/10.2139/ssrn.4548639

Translation (biology)

Image (mathematics)

Artificial intelligence

Computer science

Natural language processing

Image translation

Computer vision

Pattern recognition (psychology)

Chemistry

Article

인용수 2

2023

Improving Complex Scene Generation by Enhancing Multi-Scale Representations of GAN Discriminators

Hanbit Lee, Sang‐goo Lee, Jaehui Park, Junho Shim

IF 3.4 (2023)

IEEE Access

최근 GAN 모델의 발전은 다양한 대상 이미지의 사진과 유사한 합성을 가능하게 했지만, 다수의 물체가 포함된 장면과 같은 더 복잡한 이미지 분포를 모델링하는 데에는 여전히 과제가 남아 있다. 이러한 어려움은 장면 이미지의 높은 구조적 복잡성에 있으며, 판별기(discriminator)는 실제 장면 이미지와 가짜 장면 이미지 사이의 복잡한 구조적 차이를 판별하는 데 큰 부담을 지닌다. 따라서 판별기의 판별 능력을 향상시키는 것은 GAN 모델의 생성 성능을 개선하는 효과적인 전략 중 하나가 될 수 있다. 본 논문에서는 시각 표현 학습에 관한 두 가지 최근 패러다임인 자기지도학습(self-supervised learning)과 전이학습(transfer learning)을 활용하여 판별 능력을 향상시키는 방법을 탐구한다. 첫 번째 접근으로, 판별기의 다중 스케일(multi-scale) 표현을 강화하기에 적합한 자기지도 보조 과제를 제안한다. 두 번째 접근으로는, 다양한 장면 이해(scene understanding) 모델로부터 사전학습된 표현을 활용하여 판별기를 추가로 강화한다. 다수의 전문가 모델로부터의 지식을 충분히 활용하기 위해, 다중 스케일 특징 앙상블(multi-scale feature ensemble)을 제안하여 다중 스케일 표현을 혼합한다. 도전적인 장면 데이터셋에서의 실험 결과는 제안된 전략들이 생성 성능을 유의하게 향상시켜 복잡한 장면 이미지의 다양하고 사진과 같은 합성을 가능하게 함을 보여준다.

http://dx.doi.org/10.1109/access.2023.3270561

Discriminator

Discriminative model

Computer science

Artificial intelligence

Representation (politics)

Feature (linguistics)

Feature learning

Pattern recognition (psychology)

Scale (ratio)

Object (grammar)

Article

인용수 2

2019

Selectively Connected Self-Attentions for Semantic Role Labeling

Jaehui Park

IF 2.474 (2019)

Applied Sciences

의미 역할 라벨링(Semantic role labeling)은 자연어 문장에서 단어 관계와 관련된 내재적 의미를 이해하기 위한 효과적인 접근법이다. 특히 최근의 연구들은 심층 신경망, 그중에서도 순환 신경망(recurrent neural networks)을 사용하여 기존의 얕은 모델을 크게 향상시켰다. 그러나 순환적 업데이트의 한계로 인해 대규모 데이터 집합에 대해 긴 학습 시간이 필요하다. 또한 이들은 언어의 계층적 구조를 포착하지 못한다. 본 연구에서는 의미 역할 라벨링을 위해 주의(attentive) 표현들 사이의 선택적 연결(selective connections)을 제공하는 새로운 심층 신경 모델을 제안하며, 이를 통해 순환 업데이트를 제거한다. 실험 결과, 본 모델은 최신 연구들(state-of-the-art)에 비해 정확도가 더 우수함을 보였다. 본 모델은 각각 CoNLL 2005 및 CoNLL 2012 공유 과제에서 86.6 F1 점수와 83.6 F1 점수를 달성하였다. 연결 모듈(connection module)을 통해 계층 정보를 포착함으로써 정확도 향상이 향상되었다. 또한 본 모델은 반복적 업데이트를 회피하도록 병렬화될 수 있음을 보인다. 그 결과, 본 모델은 기준선 대비 학습 시간을 62%p(퍼센트 포인트) 감소시켰다.

https://doi.org/10.3390/app9081716

Computer science

Artificial intelligence

Baseline (sea)

Natural language processing

Recurrent neural network

Semantic role labeling

Deep neural networks

Word (group theory)

Training set

Set (abstract data type)

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

Preprint

인용수 9

2024

Proxy-based Item Representation for Attribute and Context-aware Recommendation

Jinseok Seol, Minseok Gang, Sang‐goo Lee, Jaehui Park

https://doi.org/10.1145/3616855.3635824

Computer science

Proxy (statistics)

Recommender system

Embedding

Collaborative filtering

Artificial neural network

Benchmark (surveying)

Artificial intelligence

Machine learning

Set (abstract data type)

Article

인용수 33

2022

Retrieval-Augmented Response Generation for Knowledge-Grounded Conversation in the Wild

Yeonchan Ahn, Sang‐goo Lee, Junho Shim, Jaehui Park

IF 3.9 (2022)

IEEE Access

https://doi.org/10.1109/access.2022.3228964

Conversation

Computer science

Context (archaeology)

Security token

Information retrieval

Representation (politics)

Natural language processing

Artificial intelligence

The Internet

World Wide Web

Article

인용수 6

2020

Exploiting Text Matching Techniques for Knowledge-Grounded Conversation

Yeonchan Ahn, Sang‐goo Lee, Jaehui Park

IF 3.367 (2020)

IEEE Access

https://doi.org/10.1109/access.2020.3007893

Conversation

Computer science

Matching (statistics)

Natural language processing

Psychology

Communication

전체 논문

Preprint

인용수 9

2024

Proxy-based Item Representation for Attribute and Context-aware Recommendation

Jinseok Seol, Minseok Gang, Sang‐goo Lee, Jaehui Park

https://doi.org/10.1145/3616855.3635824

Computer science

Proxy (statistics)

Recommender system

Embedding

Collaborative filtering

Artificial neural network

Benchmark (surveying)

Artificial intelligence

Machine learning

Set (abstract data type)

Article

인용수 33

2022

Retrieval-Augmented Response Generation for Knowledge-Grounded Conversation in the Wild

Yeonchan Ahn, Sang‐goo Lee, Junho Shim, Jaehui Park

IF 3.9 (2022)

IEEE Access

https://doi.org/10.1109/access.2022.3228964

Conversation

Computer science

Context (archaeology)

Security token

Information retrieval

Representation (politics)

Natural language processing

Artificial intelligence

The Internet

World Wide Web

Article

인용수 6

2020

Exploiting Text Matching Techniques for Knowledge-Grounded Conversation

Yeonchan Ahn, Sang‐goo Lee, Jaehui Park

IF 3.367 (2020)

IEEE Access

https://doi.org/10.1109/access.2020.3007893

Conversation

Computer science

Matching (statistics)

Natural language processing

Psychology

Communication

Preprint

인용수 0

2025

Enhancing Text-to-Image Retrieval by Addressing Parts-of-Speech Imbalance in Vision-Language Models

D. Kim, Hyesu Hwang, Jaehui Park, Yongjin Kwon

SSRN Electronic Journal

https://doi.org/10.2139/ssrn.5078743

Computer science

Image (mathematics)

Natural language processing

Speech recognition

Artificial intelligence

Language model

Article

인용수 1

2024

Bridging the Lexical Gap: Generative Text-to-Image Retrieval for Parts-of-Speech Imbalance in Vision-Language Models

Hyesu Hwang, Daeun Kim, Jaehui Park, Yongjin Kwon

https://doi.org/10.1145/3689091.3690089

Bridging (networking)

Computer science

Generative grammar

Natural language processing

Artificial intelligence

Generative model

Speech recognition

Article

인용수 18

2023

Contrastive learning for unsupervised image-to-image translation

Hanbit Lee, Jinseok Seol, Sang‐goo Lee, Jaehui Park, Junho Shim

IF 7.2 (2023)

Applied Soft Computing

https://doi.org/10.1016/j.asoc.2023.111170

Computer science

Image (mathematics)

Translation (biology)

Artificial intelligence

Image translation

Natural language processing

Computer vision

Pattern recognition (psychology)

Article

인용수 2

2023

Integrating Heterogeneous Graphs Using Graph Transformer Encoder for Solving Math Word Problems

Soyun Shin, Jaehui Park, Moonwook Ryu

IF 3.4 (2023)

IEEE Access

https://doi.org/10.1109/access.2023.3257571

Computer science

Transformer

Theoretical computer science

Inference

Encoder

Graph

Dependency graph

Artificial intelligence

Natural language processing

Algorithm

Preprint

인용수 1

2023

Contrastive Learning for Unsupervised Image-to-Image Translation

Hanbit Lee, Jinseok Seol, Sang‐Goo Lee, Jaehui Park, Junho Shim

SSRN Electronic Journal

http://dx.doi.org/10.2139/ssrn.4548639

Translation (biology)

Image (mathematics)

Artificial intelligence

Computer science

Natural language processing

Image translation

Computer vision

Pattern recognition (psychology)

Chemistry

Article

인용수 2

2023

Improving Complex Scene Generation by Enhancing Multi-Scale Representations of GAN Discriminators

Hanbit Lee, Sang‐goo Lee, Jaehui Park, Junho Shim

IF 3.4 (2023)

IEEE Access

http://dx.doi.org/10.1109/access.2023.3270561

Discriminator

Discriminative model

Computer science

Artificial intelligence

Representation (politics)

Feature (linguistics)

Feature learning

Pattern recognition (psychology)

Scale (ratio)

Object (grammar)

Article

인용수 2

2019

Selectively Connected Self-Attentions for Semantic Role Labeling

Jaehui Park

IF 2.474 (2019)

Applied Sciences

https://doi.org/10.3390/app9081716

Computer science

Artificial intelligence

Baseline (sea)

Natural language processing

Recurrent neural network

Semantic role labeling

Deep neural networks

Word (group theory)

Training set

Set (abstract data type)