발행물
컨퍼런스
Seminar | Data Science Lab
2025
,
Transformers without Normalization
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
MixLLM: Dynamic Routing in Mixed Large Language Models