中央研究院 資訊科學研究所

活動訊息

友善列印

學術演講

[資訊所/資創]前瞻科技演講系列_Deep-learning-based Speech Enhancement

  • 曹昱 博士 (中央研究院資訊科技創新研究中心)
    邀請人:鐘楷閔、楊得年、蘇黎
  • 2020-09-29 (Tue.) 10:00 – 11:30
  • 實體: 資訊所新館106演講廳
線上串流

ID:170 899 0812

Passcode:9TwsRMKNm63

Link: https://asmeet.webex.com/asmeet/j.php?MTID=m5721372e8a70d0c06c44aa40b6043d65

*此系列演講主要開放對象為本院資訊所及資創同仁

**本所具與會者參加資格之認定,為確保演講品質,必要時得將解除與會權限(現場及視訊)

摘要

goal of SE is to enhance the speech signals by reducing distortions caused by additive and convoluted noises in order to achieving improved human-human and human-machine communication efficacy. In the this talk, we will review the system architecture and fundamental theories of deep learning based SE approaches. Next, we will present more recent advances, including end-to-end and goal-driven based SE systems as well as the SE systems with improved architectures and feature extraction procedure. The reinforcement learning and generative adversarial network (GAN)-based SE methods will also be presented. Finally, we will discuss some applications based on the deep learning SE systems, including impaired speech transformation and noise reduction for assistive hearing devices.

 

BIO

Yu Tsao received the B.S. and M.S. degrees in electrical engineering from National Taiwan University, Taipei, Taiwan, in 1999 and 2001, respectively, and the Ph.D. degree in electrical and computer engineering from the Georgia Institute of Technology, Atlanta, GA, USA, in 2008. From 2009 to 2011, he was a Researcher with the National Institute of Information and Communications Technology, Tokyo, Japan, where he engaged in research and product development in automatic speech recognition for multilingual speech-to-speech translation. He is currently an Associate Research Fellow with the Research Center for Information Technology Innovation, Academia Sinica, Taipei. His research interests include speech and speaker recognition, acoustic and language modeling, audio coding, and bio-signal processing. He is currently an Associate Editor for the IEEE/ACM Transactions on Audio, Speech, and Language Processing and IEICE Transactions on Information and Systems and a Distinguished Lecturer of APSIPA. He was the recipient of the Academia Sinica Career Development Award in 2017, the National Innovation Award in 2018 and 2019, Future Tech Breakthrough Award 2019,and the Outstanding Elite Award, Chung Hwa Rotary Educational Foundation 2019–2020.