논문 | 이재윤 교수 연구실 | 서울대학교 데이터사이언스학과

이재윤 교수 연구실

홈

기본 정보

연구 분야

프로젝트

논문

구성원

논문

연구 성과 추이

표시된 성과는 수집된 데이터 기준으로 산출되며, 일부 차이가 있을 수 있습니다.

5개년 연도별 논문 게재 수

13총합

5개년 연도별 피인용 수

40총합

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

article

인용수 8

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

검색 증강 생성(Retrieval Augmented Generation, RAG) 프레임워크는 매개변수 지식과 외부 지식을 결합하여 오픈 도메인 질의응답(opendomain question answering) 과제에서 최첨단 성능을 보이는 방식을 활용한다. 그러나 RAG 프레임워크는 질의에 무관련 컨텍스트가 함께 제공될 때 성능 저하가 발생한다. 본 연구에서는 기존 리랭커(reranker)들이 제공하던 컨텍스트 간 상대적 관련성뿐 아니라, 주어진 컨텍스트가 해당 질문에 답하는 데 유용한지 분류하는 데 활용할 수 있는 신뢰도(confidence)를 제공하는 관련성 추정기(relevance estimator, RE)를 도입한 RE-RAG 프레임워크를 제안한다. 우리는 정답 컨텍스트에 대한 레이블 없이도 질의-답변(question-answer) 데이터만을 단순히 활용하여 RE를 훈련하기 위한 약지도(weakly supervised) 방법을 제안한다. 소형 생성기(small language model; sLM)로 학습된 RE는 RE와 함께 미세조정된 sLM의 성능을 향상시킬 뿐만 아니라, 이전에 참조되지 않았던 대규모 언어 모델(LLMs)의 성능도 향상시킬 수 있음을 보인다. 또한 우리는 RE가 측정한 신뢰도를 활용하는 새로운 디코딩 전략을 조사한다. 예를 들어, 검색된 컨텍스트를 바탕으로 해당 질문에 답하는 것이 "불가능(unanswerable)"하다고 사용자에게 알리도록 선택하거나, 무관련 컨텍스트에 의존하기보다는 LLM의 매개변수 지식에 의존하도록 선택하는 방법 등이 있다.

https://doi.org/10.18653/v1/2024.emnlp-main.1236

Interpretability

Computer science

Relevance (law)

Domain (mathematical analysis)

Estimator

Information retrieval

Relevance feedback

Artificial intelligence

Natural language processing

Mathematics

article

인용수 3

2024

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee

일반적인 검색-재랭킹 패러다임은 빠른 바이인코더(BE)로 방대한 집합에서 관련 후보를 검색한 뒤, 비용이 크지만 정확한 크로스인코더(CE)를 제한된 후보 집합에 적용하는 방식으로 이루어진다. 그러나 이와 같은 작은 부분 집합에 의존할 경우 바이인코더로부터의 오류 전파에 취약해지며, 이는 전체 성능을 제한한다. 이러한 문제를 해결하기 위해 우리는 Comparing Multiple Candidates(CMC) 프레임워크를 제안한다. CMC는 쿼리와 유사한 후보의 다중 임베딩(즉, 이웃)을 얕은 self-attention 계층을 통해 비교하여, 서로 간에 맥락화된 풍부한 표현을 제공한다. 또한 CMC는 다수의 비교를 동시에 처리할 수 있을 만큼 확장 가능하다. 예를 들어 CMC로 10K 후보를 비교하는 데 걸리는 시간은 CE로 16개 후보를 비교하는 것과 유사하다. ZeSHEL 데이터셋에서의 실험 결과, BE와 CE 사이에 CMC를 매끄러운 중간 재랭커(BE-CMC-CE)로 삽입하면, 단지 바이인코더만 사용하는 경우(BE-CE)에 비해 recall@k가 효과적으로 향상됨을 보이며(R@16에서 +6.7%-p, R@64에서 +3.5%-p), 지연은 미미한 수준(<7%)이다. 또한 상위 1단 정확도를 개선하는 최종 단계 재랭커로서의 CMC의 효과를 검증하기 위해, 엔티티, 패시지, 대화 랭킹과 같은 다운스트림 태스크에서 실험을 수행한다. 그 결과, CMC는 단지 더 빠를 뿐만 아니라(11배) 종종 크로스인코더보다 더 효과적이며, 예측 정확도 향상으로 이어짐을 확인했다. 구체적으로 위키피디아 엔티티 링크에서는 +0.7%-p, DSTC7 대화 랭킹에서는 +3.3%-p의 개선이 나타났다.

https://doi.org/10.18653/v1/2024.emnlp-main.1242

Computer science

Information retrieval

preprint

인용수 0

2024

Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation

Hye Ryung Son, Jay-Yoon Lee

arXiv (Cornell University)

최근의 제어된 텍스트 생성(CTG) 접근법은 대개 디코딩 시점에서 기본 언어 모델(LM)의 가중치 또는 로짓(logits)을 조작하는 방법을 포함한다. 그러나 이러한 방법들은 최신의 블랙박스 LMs에는 적용할 수 없으며, 기본 LM이 원래 생성한 결과의 핵심 의미를 보존하는 데에도 비효율적이다. 본 연구에서는 블랙박스가 아닌 텍스트 생성 접근인 CTG를 위한 효율적이고 유연한 에너지 기반 접근법인 Locate&Edit(L&E)를 제안한다. 이는 시판(off-the-shelf) 에너지 모델을 사용하여 기본 LM의 텍스트 출력을 편집한다. 기본 LM으로부터 텍스트 출력이 주어지면, L&E는 먼저 에너지 모델을 활용해 제약(예: 독성)과 가장 관련 있는 구간(span)을 위치(Locate)시키고, 이어서 이러한 구간을 더 적절한 대안으로 대체하여 편집(Edit)한다. 중요하게도, 본 방법은 텍스트 출력만 필요하므로 블랙박스 LMs와 호환 가능하다. 또한 L&E는 구성 요소 모델에 대해 특정 아키텍처를 요구하지 않으므로, 다양한 조합의 이용 가능한 시판 에너지 모델과 함께 동작할 수 있다. 더 나아가 L&E는 제약과 관련된 양상만을 선택적으로 수정하고 나머지는 변경하지 않음으로써, 기본 LM의 원래 생성 결과를 보존한다. 이러한 표적 편집은 또한 L&E가 효율적으로 동작하도록 보장한다. 우리의 실험 결과는 L&E가 기본 LM 생성 결과의 의미 보존과 속도에서 우수함을 달성하는 한편, 제약 충족에서도 경쟁력 있거나 향상된 성능을 동시에 얻음을 확인하였다. 뿐만 아니라, 에너지 분포의 과립성(granularity)이 CTG 성능에 미치는 영향을 분석한 결과, 기존의 이진 분류기 기반 에너지 모델에 비해 미세한(granular) 회귀(regression) 기반 에너지 모델이 제약 충족을 향상시키는 것으로 나타났다.

http://arxiv.org/abs/2407.00740

Text generation

Computer science

Natural language processing

Information retrieval

preprint

인용수 1

2024

Case-Based Reasoning Approach for Solving Financial Question Answering

Yi Kyung Kim, Jay-Yoon Lee

arXiv (Cornell University)

기계가 인간 언어를 이해하는 정도를 측정하는 일은 종종 그 추론 능력, 즉 질문에 대한 답을 도출하기 위한 논리적 과정의 평가를 포함한다. 최근의 언어 모델들은 텍스트 기반 과제에서 놀라운 성능을 보여주었으나, 텍스트, 표, 수치와 같은 이질적인 정보를 포함하는 복잡한 추론 문제에서의 효율성은 여전히 불확실하다. 이러한 공백을 메우기 위해 FinQA는 금융 문서를 위한 수치 추론 데이터셋을 도입하는 동시에 프로그램 생성(program generation) 접근법을 제안하였다. 본 연구는 오류의 절반(48%)이 생성되는 연산의 부정확성에서 비롯됨을 확인하였다. 이 문제를 해결하기 위해, 우리는 인공지능 패러다임인 사례 기반 추론(case based reasoning, CBR)을 활용하여 수치 추론 문제를 다루는 새로운 접근법을 제안한다. CBR은 유사한 사례(즉, 유사한 질문과 그에 대응하는 논리 프로그램)를 제공함으로써 문제 해결에 대한 지침을 제공한다. 우리의 모델은 주어진 질문에 대해 관련 사례를 검색한 뒤, 검색된 사례와 문맥 정보를 바탕으로 답을 생성한다. FinQA 데이터셋에 대한 실험을 통해 본 접근법의 경쟁력 있는 성능을 입증하였으며, 또한 사례 저장소를 확장함으로써 FinQA가 취약점을 보였던 복잡한 다단계 프로그램의 해결을 돕는다는 점을 추가로 보여주었다.

http://arxiv.org/abs/2405.13044

Question answering

Computer science

Artificial intelligence

preprint

인용수 1

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

arXiv (Cornell University)

검색 증강 생성(Retrieval Augmented Generation, RAG) 프레임워크는 매개변수 지식(parametric knowledge)과 외부 지식을 결합하여 개방형 도메인 질의응답(open-domain question answering) 과제에서 최신 수준의 성능을 보이는 것을 입증한다. 그러나 RAG 프레임워크는 질의가 비관련 문맥과 함께 제공될 때 성능이 저하되는 문제를 겪는다. 본 연구에서는 기존의 재순위화기(rerankers)가 제공하던 문맥 간 상대적 관련성뿐만 아니라, 주어진 문맥이 주어진 질문에 대한 답변에 유용한지를 분류하는 데 활용할 수 있는 신뢰도(confidence)까지 함께 제공하는 관련성 추정기(relevance estimator, RE)를 도입한 RE-RAG 프레임워크를 제안한다. 우리는 정답 문맥에 대한 어떠한 라벨도 없이 질의-답변 데이터만을 사용하여 RE를 학습하기 위한 약지도 학습(weakly supervised) 방법을 제안한다. 또한 소형 생성기(small generator, sLM)로 학습한 RE는 RE와 함께 미세조정된 sLM의 성능을 향상시킬 뿐만 아니라, 이전에 참조되지 않았던 대형 언어 모델(large language models, LLMs)의 성능도 향상시킬 수 있음을 보인다. 더 나아가, 검색된 문맥을 바탕으로 질문에 답할 수 없음을 사용자에게 알리도록 선택하는 것, 또는 비관련 문맥 대신 LLM의 매개변수 지식(parametric knowledge)에 의존하도록 선택하는 것과 같이, RE가 측정한 신뢰도를 활용하는 새로운 디코딩 전략을 탐구한다.

http://arxiv.org/abs/2406.05794

Interpretability

Relevance (law)

Estimator

Computer science

Domain (mathematical analysis)

Information retrieval

Artificial intelligence

Mathematics

Statistics

Political science

전체 논문

article

인용수 8

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1236

Interpretability

Computer science

Relevance (law)

Domain (mathematical analysis)

Estimator

Information retrieval

Relevance feedback

Artificial intelligence

Natural language processing

Mathematics

article

인용수 3

2024

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1242

Computer science

Information retrieval

preprint

인용수 0

2024

Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation

Hye Ryung Son, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2407.00740

Text generation

Computer science

Natural language processing

Information retrieval

preprint

인용수 1

2024

Case-Based Reasoning Approach for Solving Financial Question Answering

Yi Kyung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2405.13044

Question answering

Computer science

Artificial intelligence

preprint

인용수 1

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2406.05794

Interpretability

Relevance (law)

Estimator

Computer science

Domain (mathematical analysis)

Information retrieval

Artificial intelligence

Mathematics

Statistics

Political science

article

인용수 1

2025

A survey on large language models in biology and chemistry

Islambek Ashyrmamatov, Su Ji Gwak, Su-Young Jin, Ikhyeong Jun, Umit Volkan Ucak, Jay-Yoon Lee, Juyong Lee

IF 12.9 (2025)

Experimental & Molecular Medicine

인공지능(AI)은 복잡한 생물학적 시스템에 적합한 확장 가능한 계산 프레임워크를 제공함으로써 생의학 연구를 재편하고 있다. 이 혁명의 핵심에는 대형 언어 모델을 포함한 생체/화학 언어 모델이 있으며, 이들은 분자 구조를 고급 계산 기법에 적합한 ‘언어’의 한 형태로 재개념화하고 있다. 본 연구에서는 생물학과 화학에서 이러한 모델이 수행하는 역할을 비판적으로 고찰하고, 분자 표현에서 분자 생성 및 최적화로의 진화 과정을 추적한다. 본 총설은 생물학적 거대분자와 소분자 유기 화합물 모두에 대한 주요 분자 표현 전략을 다루며, 단백질 및 뉴클레오타이드 서열부터 단일세포 데이터, 문자열 기반 화학 포맷, 그래프 기반 인코딩, 3차원 포인트 클라우드에 이르기까지 각 접근법의 상대적 장점과 AI 응용에서의 내재적 한계를 조명한다. 또한 논의에서는 트랜스포머 계열 인코더의 양방향 인코더 표현, 생성형 사전학습 트랜스포머 계열 디코더, 인코더-디코더 트랜스포머와 같은 핵심 모델 아키텍처와 함께, 자기지도 학습, 멀티태스크 학습, 검색 증강 생성(retrieval-augmented generation) 등 정교한 사전학습 전략을 함께 탐색한다. 단백질 구조 및 기능 예측, de novo 단백질 설계, 게놈 분석, 분자 특성 예측, de novo 분자 설계, 반응 예측 및 회고적 합성(retrosynthesis) 등 주요 생의학 응용은 대표 연구와 부상하는 경향을 통해 고찰된다. 마지막으로 본 총설은 에이전틱(agentic) 및 상호작용형(interactive) AI 시스템의 부상하는 지형을 고려하면서, 생의학에서 AI의 미래 궤적을 좌우할 핵심 기술적, 윤리적 및 규제적 고려사항을 함께 다루는 가운데 과학적 발견을 자동화하고 가속할 잠재력을 간략히 제시한다.

https://doi.org/10.1038/s12276-025-01583-1

Representation (politics)

Key (lock)

Function (biology)

Generative grammar

Scalability

Biomedicine

Tracing

Computational model

article

인용수 6

2024

Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation

Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum

Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum. 연례 회의 제62회(Association for Computational Linguistics) 회의록(제1권: 장편 논문). 2024.

https://doi.org/10.18653/v1/2024.acl-long.766

Distillation

Computer science

Sequence (biology)

Language model

Natural language processing

Artificial intelligence

Process engineering

Engineering

Chemistry

Chromatography

preprint

인용수 0

2024

An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints

Mooho Song, Jay-Yoon Lee

arXiv (Cornell University)

신경망(NN)은 다양한 작업에서 우수한 성능을 보이지만, 때때로 인간에게는 무의미한 결과를 생성하기도 한다. 대부분의 NN 모델은 주로 (입력, 출력) 쌍으로부터 “오로지” 학습하며, 경우에 따라 인간의 지식과 충돌한다. 다수의 연구는 훈련 중 출력 제약을 완화하여 인간의 지식을 주입하면 모델의 성능을 향상시키고 제약 위반을 줄일 수 있음을 보여준다. 동일한 프로그래밍 프레임워크 하에서 서로 다른 기존 알고리즘을 비교하려는 시도들이 여러 차례 있었으나, 출력 제약을 갖는 학습 알고리즘을 통일된 방식으로 분류한 선행 연구는 없었다. 우리의 기여는 다음과 같다. (1) 우리는 선행 연구를 세 가지 축—사용된 제약 손실의 유형(예: 확률적 소프트 로직, REINFORCE), 제약 위반 예시의 탐색 전략, 주 과제와 제약으로부터의 학습 신호를 통합하는 메커니즘—에 따라 분류한다. (2) 우리는 지속학습(continual-learning) 알고리즘에서 영감을 받아, 주 과제 및 제약 주입의 정보를 통합하는 새로운 알고리즘을 제안한다. (3) 또한 주 과제의 지표와 제약 위반을 동시에 고려하기 위한 지표로

H β

-score를 제안한다. 철저한 분석을 제공하기 위해, 우리는 세 가지 NLP 과제인 자연어 추론(NLI), 합성 전환 예시(STE), 의미 역할 표지(SRL)에서 모든 알고리즘을 검토한다. 우리는 서로 다른 알고리즘들 중 높은

H β

-score 달성과 관련된 핵심 요인들을 탐색하고 규명한다.

http://arxiv.org/abs/2406.01647

Computer science

Algorithm

Artificial intelligence

article

인용수 0

2024

An Analysis under a Unified Formulation of Learning Algorithms with Output Constraints

Mooho Song, Jay-Yoon Lee

http://doi.org/10.18653/v1/2024.acl-srw.41

Computer science

Algorithm

Algorithm design

Artificial intelligence

Mathematical optimization

Mathematics

article

인용수 0

2023

Machine Reading Comprehension using Case-based Reasoning

Dung Thai, D. P. Agarwal, Mudit Chaudhary, Wenlong Zhao, Raj Das, Jay-Yoon Lee, Hannaneh Hajishirzi, Manzil Zaheer, Andrew McCallum

Dung Thai, Dhruv Agarwal, Mudit Chaudhary, Wenlong Zhao, Rajarshi Das, Jay-Yoon Lee, Hannaneh Hajishirzi, Manzil Zaheer, Andrew McCallum. 자연어처리학회(Association for Computational Linguistics)의 연구 결과: EMNLP 2023. 2023.

http://dx.doi.org/10.18653/v1/2023.findings-emnlp.564

Comprehension

Reading (process)

Computer science

Association (psychology)

Natural language processing

Artificial intelligence

Computational linguistics

Linguistics

Reading comprehension

Cognitive science

프로젝트 공고 서비스 문의 자주 묻는 질문 이용약관 개인정보처리방침

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)

전체 논문

article

인용수 8

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1236

Interpretability

Computer science

Relevance (law)

Domain (mathematical analysis)

Estimator

Information retrieval

Relevance feedback

Artificial intelligence

Natural language processing

Mathematics

article

인용수 3

2024

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1242

Computer science

Information retrieval

preprint

인용수 0

2024

Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation

Hye Ryung Son, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2407.00740

Text generation

Computer science

Natural language processing

Information retrieval

preprint

인용수 1

2024

Case-Based Reasoning Approach for Solving Financial Question Answering

Yi Kyung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2405.13044

Question answering

Computer science

Artificial intelligence

preprint

인용수 1

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2406.05794

Interpretability

Relevance (law)

Estimator

Computer science

Domain (mathematical analysis)

Information retrieval

Artificial intelligence

Mathematics

Statistics

Political science

article

인용수 1

2025

A survey on large language models in biology and chemistry

Islambek Ashyrmamatov, Su Ji Gwak, Su-Young Jin, Ikhyeong Jun, Umit Volkan Ucak, Jay-Yoon Lee, Juyong Lee

IF 12.9 (2025)

Experimental & Molecular Medicine

https://doi.org/10.1038/s12276-025-01583-1

Representation (politics)

Key (lock)

Function (biology)

Generative grammar

Scalability

Biomedicine

Tracing

Computational model

article

인용수 6

2024

Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation

Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum

https://doi.org/10.18653/v1/2024.acl-long.766

Distillation

Computer science

Sequence (biology)

Language model

Natural language processing

Artificial intelligence

Process engineering

Engineering

Chemistry

Chromatography

preprint

인용수 0

2024

An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints

Mooho Song, Jay-Yoon Lee

arXiv (Cornell University)

H β

H β

-score 달성과 관련된 핵심 요인들을 탐색하고 규명한다.

http://arxiv.org/abs/2406.01647

Computer science

Algorithm

Artificial intelligence

article

인용수 0

2024

An Analysis under a Unified Formulation of Learning Algorithms with Output Constraints

Mooho Song, Jay-Yoon Lee

http://doi.org/10.18653/v1/2024.acl-srw.41

Computer science

Algorithm

Algorithm design

Artificial intelligence

Mathematical optimization

Mathematics

article

인용수 0

2023

Machine Reading Comprehension using Case-based Reasoning

Dung Thai, D. P. Agarwal, Mudit Chaudhary, Wenlong Zhao, Raj Das, Jay-Yoon Lee, Hannaneh Hajishirzi, Manzil Zaheer, Andrew McCallum

http://dx.doi.org/10.18653/v1/2023.findings-emnlp.564

Comprehension

Reading (process)

Computer science

Association (psychology)

Natural language processing

Artificial intelligence

Computational linguistics

Linguistics

Reading comprehension

Cognitive science

주요 논문

*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.

article

인용수 8

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1236

Interpretability

Computer science

Relevance (law)

Domain (mathematical analysis)

Estimator

Information retrieval

Relevance feedback

Artificial intelligence

Natural language processing

Mathematics

article

인용수 3

2024

Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee

https://doi.org/10.18653/v1/2024.emnlp-main.1242

Computer science

Information retrieval

preprint

인용수 0

2024

Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation

Hye Ryung Son, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2407.00740

Text generation

Computer science

Natural language processing

Information retrieval

preprint

인용수 1

2024

Case-Based Reasoning Approach for Solving Financial Question Answering

Yi Kyung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2405.13044

Question answering

Computer science

Artificial intelligence

preprint

인용수 1

2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

Kiseung Kim, Jay-Yoon Lee

arXiv (Cornell University)

http://arxiv.org/abs/2406.05794

Interpretability

Relevance (law)

Estimator

Computer science

Domain (mathematical analysis)

Information retrieval

Artificial intelligence

Mathematics

Statistics

Political science