2025 Neural Information Processing Systems
AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding
2025
2025
The Thirty-Ninth Annual Conference on Neural Information Processing Systems
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
2025
2025
The Thirty-Ninth Annual Conference on Neural Information Processing Systems
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
2025
2025
2025 Findings of the Association for Computational Linguistics, EMNLP 2025
Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
2025
2025
ACM International Conference on Multimedia (MM) 2025
AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
2025
2025