발행물
컨퍼런스
CUPUM 2025
,
CityAPIBench: A Benchmark of Domain-Adapted Large Language Models for Urban Big Data Query
NAACL 2025
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use
PAKDD 2025
2013.06
DialRet: Enhancing Dialogue Retention for Multi-Session Conversations
Dynamic Spuriosity Bias Harmonizer: Combatting Spurious Features for Domain Generalization
A Unified Detector for Both Adversarial Attacks and Out-of-distribution Samples based on Kernel Path Distribution