기본 정보
연구 분야
프로젝트
논문
구성원
preprint|
green
·인용수 1
·2025
Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders
Seungbae Kim, Daeun Lee, Brielle C. Stark, Jinyoung Han
ArXiv.org
초록

Individuals with language disorders often face significant communication challenges due to their limited language processing and comprehension abilities, which also affect their interactions with voice-assisted systems that mostly rely on Automatic Speech Recognition (ASR). Despite advancements in ASR that address disfluencies, there has been little attention on integrating non-verbal communication methods, such as gestures, which individuals with language disorders substantially rely on to supplement their communication. Recognizing the need to interpret the latent meanings of visual information not captured by speech alone, we propose a gesture-aware ASR system utilizing a multimodal large language model with zero-shot learning for individuals with speech impairments. Our experiment results and analyses show that including gesture information significantly enhances semantic understanding. This study can help develop effective communication technologies, specifically designed to meet the unique needs of individuals with language impairments.

키워드
GestureZero (linguistics)Shot (pellet)Computer scienceSpeech recognitionGesture recognitionNatural language processingLinguisticsArtificial intelligence
타입
preprint
IF / 인용수
- / 1
게재 연도
2025

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)

© 2026 RnDcircle. All Rights Reserved.