Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models | 김태욱 교수 연구실 | 한양대학교 컴퓨터소프트웨어학부

김태욱 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

발행물

구성원

article|

gold

·인용수 0

·2026

Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models

Junyeob Kim, Sang-goo Lee, Taeuk Kim

초록

Recent reinforcement learning (RL)-trained language models have demonstrated strong performance on complex reasoning tasks by producing long and detailed reasoning traces.However, despite these advancements, they often struggle with finding the right balance in reasoning length: some terminate prematurely before reaching a correct answer (underthinking), while others continue reasoning beyond necessity, leading to inefficiency or even degraded accuracy (overthinking).To address these challenges, we propose a method for optimizing reasoning length via self-assessed confidence.By prompting the model to evaluate its own confidence at intermediate reasoning steps, we enable dynamic stopping once sufficient reasoning is achieved.Experiments across multiple reasoning benchmarks show that our approach improves computational efficiency without compromising answer quality.Furthermore, we find that confidence estimates from RL-trained reasoning models are more reliable than those from standard LLMs, making it a valuable internal signal for controlling reasoning depth.

키워드

Language modelLanguage understandingNatural languageNon-monotonic logicAutomated reasoning

타입

article

IF / 인용수

- / 0

원문

https://doi.org/10.18653/v1/2026.findings-eacl.263

게재 연도

2026