您的瀏覽器不支援JavaScript語法,網站的部份功能在JavaScript沒有啟用的狀態下無法正常使用。

中央研究院 資訊科學研究所

活動訊息

友善列印

列印可使用瀏覽器提供的(Ctrl+P)功能

學術演講

:::

Segmental recurrent neural networks

  • 講者Lingpeng Kong 先生 (Carnegie Mellon University)
    邀請人:古倫維
  • 時間2016-11-18 (Fri.) 10:30 ~ 12:30
  • 地點資訊所新館106演講廳
摘要

We introduce segmental recurrent neural networks (SRNNs) which define, given an input sequence, a joint probability distribution over segmentations of the input and labelings of the segments. Representations of the input segments (i.e., contiguous subsequences of the input) are computed by encoding their constituent tokens using bidirectional recurrent neural nets, and these “segment embeddings” are used to define compatibility scores with output labels. These local compatibility scores are integrated using a global semi-Markov conditional random field. Both fully supervised training—in which segment boundaries and labels are observed—as well as partially supervised training—in which segment boundaries are latent—are straightforward. Experiments on handwriting recognition and joint Chinese word segmentation/POS tagging show that, compared to models that do not explicitly represent segments such as BIO tagging schemes and connectionist temporal classification (CTC), SRNNs obtain substantially higher accuracies. We also performed experiments on the TIMIT dataset. where we achieved 17.3% phone error rate (PER) from the first-pass decoding — the best reported result using CRFs, despite the fact that we only used a zeroth-order CRF and without using any language model.

BIO

Lingpeng Kong is a Ph.D. candidate in School of Computer Science, Carnegie Mellon University, co-advised by Prof. Noah Smith and Prof. Chris Dyer. His main research interests are in designing algorithms to tackle the core problems in natural language processing (NLP). His work utilizes methods from machine learning, optimization and combinatorial algorithms with applications related to syntactic parsing, machine translation, and social media. Prior to CMU, he worked in IBM China Systems and Technology Lab.