발행물
컨퍼런스
WACV2024
2024
,
Localization and Manipulation of Immoral Visual Cues for Safe Text-to-Image Generation
BEVMap: Map-Aware BEV Modeling for 3D Perception
EMNLP 2023
2023
Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
BMVC2023
Distillation for High-Quality Knowledge Extraction via Explainable Oracle Approach
ICCV2023
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion