Stream and Query-guided Feature Aggregation for Efficient and Effective 3D Occupancy Prediction | 김진규 교수 연구실 | 고려대학교 컴퓨터학과

김진규 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

발행물

구성원

preprint|

green

·인용수 0

·2025

Stream and Query-guided Feature Aggregation for Efficient and Effective 3D Occupancy Prediction

Seokha Moon, Janghyun Baek, G.H. Kim, Jinkyu Kim, SeonKyung Choi

ArXiv.org

초록

3D occupancy prediction has become a key perception task in autonomous driving, as it enables comprehensive scene understanding. Recent methods enhance this understanding by incorporating spatiotemporal information through multi-frame fusion, but they suffer from a trade-off: dense voxel-based representations provide high accuracy at significant computational cost, whereas sparse representations improve efficiency but lose spatial detail. To mitigate this trade-off, we introduce DuOcc, which employs a dual aggregation strategy that retains dense voxel representations to preserve spatial fidelity while maintaining high efficiency. DuOcc consists of two key components: (i) Stream-based Voxel Aggregation, which recurrently accumulates voxel features over time and refines them to suppress warping-induced distortions, preserving a clear separation between occupied and free space. (ii) Query-guided Aggregation, which complements the limitations of voxel accumulation by selectively injecting instance-level query features into the voxel regions occupied by dynamic objects. Experiments on the widely used Occ3D-nuScenes and SurroundOcc datasets demonstrate that DuOcc achieves state-of-the-art performance in real-time settings, while reducing memory usage by over 40% compared to prior methods.

키워드

VoxelKey (lock)Pattern recognition (psychology)Task (project management)Feature (linguistics)Dual (grammatical number)Spatial analysis

타입

preprint

IF / 인용수

- / 0

원문

http://arxiv.org/abs/2503.22087

게재 연도

2025