주요 논문
3
*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.
1
article
|
인용수 16
·
2025Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus
Seungpil Lee, Woochang Sim, Donghyeon Shin, Wongyu Seo, Jiwon Park, S. C. Lee, Sanha Hwang, Sejin Kim, Sundong Kim
ACM Transactions on Intelligent Systems and Technology
The existing methods for evaluating the inference abilities of Large Language Models (LLMs) have been predominantly results-centric, making it challenging to assess the inference process comprehensively. We introduce a novel approach using the Abstraction and Reasoning Corpus (ARC) benchmark to evaluate the inference and contextual understanding abilities of LLMs in a process-centric manner, focusing on three key components from the Language of Thought Hypothesis (LoTH): Logical Coherence, Compositionality, and Productivity. Our carefully designed experiments reveal that while LLMs demonstrate some inference capabilities, they still significantly lag behind human-level reasoning in these three aspects. The main contribution of this article lies in introducing the LoTH perspective, which provides a method for evaluating the reasoning process that conventional results-oriented approaches fail to capture, thereby offering new insights into the development of human-level reasoning in artificial intelligence systems.
https://doi.org/10.1145/3712701
Computer science
Abstraction
Natural language processing
Artificial intelligence
Language model
Qualitative reasoning
2
article
|
인용수 10
·
2023Explainable Product Classification for Customs
Eunji Lee, Sihyeon Kim, Sundong Kim, Soyeon Jung, Heeja Kim, Meeyoung Cha
IF 7.2 (2023)
ACM Transactions on Intelligent Systems and Technology
The task of assigning internationally accepted commodity codes (aka HS codes) to traded goods is a critical function of customs offices. Like court decisions made by judges, this task follows the doctrine of precedent and can be nontrivial even for experienced officers. Together with the Korea Customs Service (KCS), we propose a first-ever explainable decision supporting model that suggests the most likely subheadings (i.e., the first six digits) of the HS code. The model also provides reasoning for its suggestion in the form of a document that is interpretable by customs officers. We evaluated the model using 5,000 cases that recently received a classification request. The results showed that the top-3 suggestions made by our model had an accuracy of 93.9% when classifying 925 challenging subheadings. A user study with 32 customs experts further confirmed that our algorithmic suggestions accompanied by explainable reasonings, can substantially reduce the time and effort taken by customs officers for classification reviews.
https://doi.org/10.1145/3635158
Computer science
Task (project management)
AKA
Code (set theory)
Product (mathematics)
Function (biology)
Service (business)
Doctrine
Commodity
Artificial intelligence
3
article
|
hybrid
·
인용수 8·
2022Active Learning for Human-in-the-Loop Customs Inspection
Sundong Kim, Tung-Duong Mai, Sungwon Han, Sungwon Park, Duc‐Hung Nguyen, Jaechan So, Karandeep Singh, Meeyoung Cha
IF 8.9 (2022)
IEEE Transactions on Knowledge and Data Engineering
We study the human-in-the-loop customs inspection scenario, where an AI-assisted algorithm supports customs officers by recommending a set of imported goods to be inspected. If the inspected items are fraudulent, the officers can levy extra duties. These logs are then used as additional training data for the next iterations. Choosing to inspect suspicious items first leads to an immediate gain in customs revenue, yet such inspections may not bring new insights for learning dynamic traffic patterns. On the other hand, inspecting uncertain items can help acquire new knowledge, which will be used as a supplementary training resource to update the selection systems. Based on multiyear customs datasets from three countries, we demonstrate that some degree of exploration is necessary to cope with domain shifts in the trade data. The results show that a hybrid strategy of selecting likely fraudulent and uncertain items will eventually outperform the exploitation-only strategy.
https://doi.org/10.1109/tkde.2022.3144299
Computer science
Revenue
Training set
Domain (mathematical analysis)
Set (abstract data type)
Human-in-the-loop
Machine learning
Artificial intelligence
Business
Finance