유재민 교수 연구실 | 서울대학교 컴퓨터공학부

유재민 연구실

서울대학교 컴퓨터공학부

유재민 교수

유재민 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

발행물

구성원

홈

유재민 연구실

서울대학교 컴퓨터공학부 유재민 교수

유재민 연구실은 기계학습 및 지식처리, 데이터마이닝을 기반으로 그래프·하이퍼그래프 학습, 자기지도 표현학습, 이상탐지, 그래프 신경망, 모델 해석가능성과 효율화 기술을 연구하며, 복잡한 관계형 데이터의 구조를 정교하게 분석하고 이를 추천, 분류, 보안, 시계열 및 실세계 대규모 데이터 문제에 적용하는 인공지능 연구를 수행하고 있다.

대표 연구 분야

연구 영역 전체보기

그래프·하이퍼그래프 기반 기계학습과 데이터마이닝

자기지도 학습 기반 이상탐지와 표현학습

그래프 학습의 해석가능성·증강·효율화

주요 논문

논문 전체보기

article

hybrid

인용수 5

2024

Representative and Back-In-Time Sampling from Real-world Hypergraphs

Minyoung Choe, Jaemin Yoo, Geon Lee, Woonsung Baek, U Kang, Kijung Shin

IF 4.8

ACM Transactions on Knowledge Discovery from Data

Graphs are widely used for representing pairwise interactions in complex systems. Since such real-world graphs are large and often evergrowing, sampling subgraphs is useful for various purposes, including simulation, visualization, stream processing, representation learning, and crawling. However, many complex systems consist of group interactions (e.g., collaborations of researchers and discussions on online Q&A platforms) and thus are represented more naturally and accurately by hypergraphs than by ordinary graphs. Motivated by the prevalence of large-scale hypergraphs, we study the problem of sampling from real-world hypergraphs, aiming at answering (Q1) how can we measure the goodness of sub-hypergraphs, and (Q2) how can we efficiently find a “good” sub-hypergraph. Regarding Q1, we distinguish between two goals: (a) representative sampling , which aims at capturing the characteristics of the input hypergraph, and (b) back-in-time sampling , which aims at closely approximating a past snapshot of the input time-evolving hypergraph. To evaluate the similarity of the sampled sub-hypergraph to the target (i.e., the input hypergraph or its past snapshot), we consider 10 graph-level, hyperedge-level, and node-level statistics. Regarding Q2, we first conduct a thorough analysis of various intuitive approaches using 11 real-world hypergraphs. Then, based on this analysis, we propose MiDaS and MiDaS-B , designed for representative sampling and back-in-time sampling, respectively. Regarding representative sampling, we demonstrate through extensive experiments that MiDaS , which employs a sampling bias toward high-degree nodes in hyperedge selection, is (a) Representative : finding overall the most representative samples among 15 considered approaches, (b) Fast : several orders of magnitude faster than the strongest competitors, and (c) Automatic : automatically tuning the degree of sampling bias. Regarding back-in-time sampling, we demonstrate that MiDaS-B inherits the strengths of MiDaS despite an additional challenge—the unavailability of the target (i.e., past snapshot). It effectively handles this challenge by focusing on replicating universal evolutionary patterns, rather than directly replicating the target.

https://doi.org/10.1145/3653306

Hypergraph

Snapshot (computer storage)

Pairwise comparison

Computer science

Sampling (signal processing)

Crawling

Theoretical computer science

Data mining

Graph

Artificial intelligence

article

인용수 10

2023

Reciprocity in directed hypergraphs: measures, findings, and generators

Sunwoo Kim, Minyoung Choe, Jaemin Yoo, Kijung Shin

IF 4.3

Data Mining and Knowledge Discovery

https://doi.org/10.1007/s10618-023-00955-3

Reciprocity (cultural anthropology)

Axiom

Hypergraph

Computer science

Theoretical computer science

Graph

Directed graph

Measure (data warehouse)

Directed acyclic graph

Pairwise comparison

article

인용수 8

2022

Graph-based PU learning for binary and multiclass classification without class prior

Jaemin Yoo, Junghun Kim, Hoyoung Yoon, Geonsoo Kim, Changwon Jang, U Kang

IF 3.1

Knowledge and Information Systems

https://doi.org/10.1007/s10115-022-01702-8

Binary classification

Computer science

Artificial intelligence

Graph

Classifier (UML)

Binary number

Machine learning

Semi-supervised learning

Markov chain

Graphical model

정부 과제

과제 전체보기

2024년 5월-2025년 5월

|200,000,000원

내부직원 정보유출 위협평가를 위한 AI 적용 보안솔루션 개발

o 사람에 의한 기술유출 징후를 평가하고 사람별 위협정도를 언어형태로 레포트하는 독자적인 LLM 활용 솔루션을 개발o Pre-train된 LLM을 활용, 기술유출 평가에 특화된 독자 LLM fine-tuningo 대용량 데이터 처리 및 환각문제 해결 위한 독자적인 기술 개발o 인물별 정보유출 위협평가와 대응 및 후속조치를 포함한 직관적 레포트 자동생성

내부자위협

기술유출

정보유출

거대언어모델

악의적내부자

지능형 보안솔루션

위협평가

인물별 위협평가

2024년 3월-2029년 3월

|238,377,000원

시공간 그래프 이상탐지를 수행하는 자기지도 학습 프레임워크 개발

본 연구의 목표는 시공간 그래프가 주어졌을 때 빠른 속도로 그래프 내에 존재하는 이상 데이터를 찾고 이를 실시간으로 처리할 수 있는 새로운 이상탐지 프레임워크를 개발하는 것이다. 동시에, 이상탐지 문제에서 세계 최고 수준의 성능을 달성하는 것을 목표로 한다. 시공간 그래프 이상탐지 기법은 시·공간적 정보를 포함하는 모든 데이터에 바로 적용될 수 있으며, 다...

시공간 그래프

이상 탐지

자기지도 학습

다변량 시계열

그래프 신경망

2019년 3월-2028년 12월

|4,400,000,000원

인공지능대학원지원(한국과학기술원)

ㆍ 세계 최고 연구 2018년 NIPS와 ICML 논문수가 세계 10위인 KAIST의 AI 연구 역량을 본 사업 1단계 종료 시점인 2023년까지 세계 5위 안으로 진입시킬 것임. ㆍ 글로벌 Top 인재양성 KAIST가 배출한 세계적인 AI 연구자인 뉴욕대학교 조경현 교수 (Facebook AI Research 연구원 겸직), 김윤 SK텔레콤 AI 센터장 ...

세계 Top 5 AI대학원

아시아 AI밸리

AI 글로벌 네트워크

최신 특허

특허 전체보기

상태	출원연도	과제명	출원번호
공개	2024	하이퍼그래프 신경망을 위한 생성형 자기지도학습 방법 및 시스템	1020240074251
등록	2021	정점의 특성을 예측하는 장치 및 방법	1020210172385
등록	2021	데이터 증강 장치 및 방법	1020210169909

전체 특허

하이퍼그래프 신경망을 위한 생성형 자기지도학습 방법 및 시스템

상태

공개

출원연도

2024

출원번호

1020240074251

상세 정보 바로가기

정점의 특성을 예측하는 장치 및 방법

상태

등록

출원연도

2021

출원번호

1020210172385

상세 정보 바로가기

데이터 증강 장치 및 방법

상태

등록

출원연도

2021

출원번호

1020210169909

상세 정보 바로가기