TelAgentBench: A Multi-faceted Benchmark for Evaluating LLM-based Agents in Telecommunications
Sunwoo Lee, D. H. Jang, Dev P. Arya, Guoqiang Han, Injee Song, Sang-Jin Kim, Sangjin Kim, Seojin Lee, Seokyoung Hong, Sereimony Sek, S. Cho, Sohee Park, Sang Min Yoon, W. Jang, Eric Davis
Sunwoo Lee, Daseong Jang, Dhammiko Arya, Gyoung-eun Han, Injee Song, SaeRom Kim, Sangjin Kim, Seojin Lee, Seokyoung Hong, Sereimony Sek, Seung-Mo Cho, Sohee Park, Sungbin Yoon, Wonbeom Jang, Eric Davis. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2025.
https://doi.org/10.18653/v1/2025.emnlp-industry.83
Benchmark (surveying)
Empirical research
Telecommunications network
Natural language
Telecommunications service
상세 정보 바로가기