學術演講

TIGP (SNHCC) -- Deep Learning-based Speech Assessment Metrics and its Applications

講者李安德博士 (中央研究院資訊科技創新研究中心)
邀請人：TIGP (SNHCC)
時間2023-12-04 (Mon.) 14:00 ~ 16:00
地點資訊所新館106演講廳

摘要

Most conventional speech assessment metrics require a golden clean reference to calculate the evaluation score. Such a scenario has limited applicability in real-world scenarios since clean reference is not always accessible. To address this limitation, non-intrusive speech assessment metrics have caught great attention in recent years. Recently, with the emergence of the deep learning model and the availability of training data, many studies have involved the deep learning model to deploy a non-intrusive speech assessment model. However, despite the good performance achieved by the deep learning-based speech assessment model, the generalization of the model remains a challenge. In this talk, we would like to introduce several approaches to improve the generalization of the deep learning-based speech assessment model. Additionally, we aim to introduce the direct integration between deep learning-based speech assessment models and speech enhancement systems.

BIO

Dr. Ryandhimas E. Zezario received a Ph.D. degree in Computer Science and Information Engineering from National Taiwan University in 2023. He is currently a Postdoctoral Researcher at the Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan. He was awarded the Gold Prize for the best non-intrusive system and 1st place for the Hearing Industry Research Consortium student prizes at the Clarity Prediction Challenge 2022. His research interests include speech enhancement, non-intrusive quality assessment, speech processing, speech/speaker recognition.

中央研究院資訊科學研究所

活動訊息

學術演講

TIGP (SNHCC) -- Deep Learning-based Speech Assessment Metrics and its Applications

摘要

BIO