발행물
컨퍼런스
Seminar | Data Science Lab
2025
,
RouteLLM: Learning to Route LLMs with Preference Data
ARES : Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
The Hyperfitting Phenomenon:Sharpening and Stabilizing LLMs for Open-Ended Text Generation
Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models