조재민 연구실은 인간-컴퓨터 상호작용을 중심으로 정보 시각화, 고차원 데이터 분석, 멀티모달 표현, 생성형 AI 기반 분석 지원 시스템을 연구하며, 사용자의 의도와 맥락을 반영한 지능형 인터페이스를 설계하고 복잡한 데이터를 더 신뢰성 있게 이해할 수 있는 시각 분석 도구를 개발하는 컴퓨팅 융합 연구를 수행하고 있다.
GhostUMAP2: Measuring and Analyzing $(r,d)$-Stability of UMAP
Myeongwon Jung, Toshifumi Fujiwara, Jaemin Jo
IEEE Transactions on Visualization and Computer Graphics
Despite the widespread use of Uniform Manifold Approximation and Projection (UMAP), the impact of its stochastic optimization process on the results remains underexplored. We observed that it often produces unstable results where the projections of data points are determined mostly by chance rather than reflecting neighboring structures. To address this limitation, we introduce (r,d)-stability to UMAP: a framework that analyzes the stochastic positioning of data points in the projection space. To assess how stochastic elements-specifically, initial projection positions and negative sampling-impact UMAP results, we introduce "ghosts", or duplicates of data points representing potential positional variations due to stochasticity. We define a data point's projection as (r,d)-stable if its ghosts perturbed within a circle of radius r in the initial projection remain confined within a circle of radius d for their final positions. To efficiently compute the ghost projections, we develop an adaptive dropping scheme that reduces a runtime up to 60% compared to an unoptimized baseline while maintaining approximately 90% of unstable points. We also present a visualization tool that supports the interactive exploration of the (r,d)-stability of data points. Finally, we demonstrate the effectiveness of our framework by examining the stability of projections of real-world datasets and present usage guidelines for the effective use of our framework.
UMATO: Bridging Local and Global Structures for Reliable Visual Analytics With Dimensionality Reduction
Hyeon Jeon, K. Ko, Soohyun Lee, Jake Hyun, Taoxi Yang, Gyeong‐Tak Go, Jaemin Jo, Jinwook Seo
IEEE Transactions on Visualization and Computer Graphics
Due to the intrinsic complexity of high-dimensional (HD) data, dimensionality reduction (DR) techniques cannot preserve all the structural characteristics of the original data. Therefore, DR techniques focus on preserving either local neighborhood structures (local techniques) or global structures such as pairwise distances between points (global techniques). However, both approaches can mislead analysts to erroneous conclusions about the overall arrangement of manifolds in HD data. For example, local techniques may exaggerate the compactness of individual manifolds, while global techniques may fail to separate clusters that are well-separated in the original space. In this research, we provide a deeper insight into Uniform Manifold Approximation with Two-phase Optimization (UMATO), a DR technique that addresses this problem by effectively capturing local and global structures. UMATO achieves this by dividing the optimization process of UMAP into two phases. In the first phase, it constructs a skeletal layout using representative points, and in the second phase, it projects the remaining points while preserving the regional characteristics. Quantitative experiments validate that UMATO outperforms widely used DR techniques, including UMAP, in terms of global structure preservation, with a slight loss in local structure. We also confirm that UMATO outperforms baseline techniques in terms of scalability and stability against initialization and subsampling, making it more effective for reliable HD data analysis. Finally, we present a case study and a qualitative demonstration that highlight UMATO's effectiveness in generating faithful projections, enhancing the overall reliability of visual analytics using DR.