발행물
컨퍼런스
Interspeech 2022
,
CTRL: Continual representation learning to transfer information of pre-trained for wav2vec 2.0
Convolutional recurrent neural network with auxiliary stream for robust variable-length acoustic scene classification
One-shot speaker adaptation based on initialization by generative adversarial networks for TTS
FiLM conditioning with enhanced feature to the transformer-based end-to-end noisy speech recognition
Regularizing transformer-based acoustic models by penalizing attention weights for robust speech recognition