Empilo: Realizing Immersive Mobile 3D Video Conferencing through Parameterized Communication | 이경한 교수 연구실 | 서울대학교 전기·정보공학부

이경한 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

발행물

구성원

article|

gold

·인용수 0

·2025

Empilo: Realizing Immersive Mobile 3D Video Conferencing through Parameterized Communication

Donggyu Yang, Wooseung Nam, Bong-Soo Kang, Kyunghan Lee

초록

In this work, we explore a new communication paradigm for immersive 3D video conferencing, termed parameterized communication, which dramatically reduces bandwidth usage by eliminating the need to exchange excessive volumetric data. Instead, this approach extracts a compact set of informative parameters representing key elements in the 3D space, transmits only these parameters, and reconstructs the scene on the receiving end. Translating this concept into practice, we present Empilo, a mobile 3D conferencing system composed of a face parameter extractor and a neural rendering-based scene generator. However, while neural rendering excels at synthesizing arbitrary views of objects without explicit 3D models, its heavy computational demands present a major obstacle for mobile deployment. To overcome this challenge, we propose a novel technique called truncated ray marching, which significantly reduces computational overhead by replacing iterative MLP inferences with a single-pass of a shallow neural network. Furthermore, to ensure a consistently immersive experience, we structure the neural-free lightweight renderer as a decoupled component, dedicated to delivering rapid responsiveness to dynamic viewpoint changes. These breakthroughs on computation together enable Empilo to rely entirely on mobile resources, achieving real-time performance with a frame generation time of 30.3 ms and a re-rendering latency of just 6.6 ms—all while operating at an exceptionally low bitrate of 24 kbps. Our approach provides valuable guidance for the practical deployment of 3D conferencing, envisioning accessibility on par with platforms like FaceTime and Zoom.

키워드

Rendering (computer graphics)Mobile deviceParameterized complexityComputationVideophoneTeleconferenceSet (abstract data type)Latency (audio)Key (lock)

타입

article

IF / 인용수

- / 0

원문

https://doi.org/10.1145/3711875.3729140

게재 연도

2025