발행물

전체 논문

388

11

Differentiable Duration Refinement Using Internal Division for Non-Autoregressive Text-to-Speech
장준혁
IEEE SIGNAL PROCESSING LETTERS, 202411

12

Efficient Lightweight Speaker Verification With Broadcasting CNN-Transformer and Knowledge Distillation Training of Self-Attention Maps
장준혁
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 202409

13

Enhancing multimodal emotion recognition through ASR error compensation and LLM fine-tuning
장준혁
Interspeech, 202409

14

Neural ATSM: Fully neural network-based adaptive time-scale modification using sentence-specific dynamic control
장준혁
Interspeech, 202409

15

Whisper multilingual downstream task tuning using task vectors
장준혁
Interspeech, 202409

16

Online subloop search via uncertainty quantization for efficient test-time adaptation
장준혁
Interspeech, 202409

17

Few-shot keyword-incremental learning with total calibration
장준혁
Interspeech, 202409

18

Efficient speaker embedding extraction using a twofold sliding window algorithm for speaker diarization
장준혁
Interspeech, 202409

19

Balanced-Wav2Vec: Enhancing stability and robustness of representation learning through sample reweighting techniques
장준혁
Interspeech, 202409

20

H4C-TTS: Leveraging multi-modal historical context for conversational text-to-speech
장준혁
Interspeech, 202409