Previous [ 1] [ 2] [ 3] [ 4] [ 5] [ 6] [ 7] [ 8] [ 9] [ 10] [ 11]

@

Journal of Information Science and Engineering, Vol. 29 No. 4, pp. 729-742 (July 2013)


Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition


WEI-TYNG HONG
Department of Communications Engineering
Yuan Ze University
Chungli, 320 Taiwan

Hidden conditional random fields (HCRFs) are derived from the theory of conditional random fields with hidden-state probabilistic framework. It directly models the conditional probability of a label sequence given observations. Compared to hidden Markov models, HCRFs provide a number of benefits in the acoustic modeling of speech signals. Prior works for training on HCRFs were accomplished with gradient descent based algorithms by conditional maximum likelihood criterion. In this paper, we extend that methodology by applying minimum classification error criterion-based training technique on HCRFs. Specifically, we adopt generalized probabilistic descent (GPD)- based training algorithm with HCRF framework to improve the discrimination capabilities of acoustic models for speech and speaker recognition. Two tasks including a speaker identification and a Mandarin continuous syllable recognition are applied to evaluate the proposed approach. We present the results on the MAT2000 database and these results confirm that the HCRF/GPD approach has good capabilities for speech recognition and speaker identification regardless of the length of the test and training speech or the presence of noise. We note that the HCRF/GPD enjoys its potential for development in acoustic modeling.

Keywords: speech recognition, speaker recognition, hidden conditional random field, discriminative training algorithm, Mandarin syllable recognition

Full Text () Retrieve PDF document (201307_08.pdf)

Received February 3, 2012; accepted June 18, 2012.
Communicated by Vincent Shin-Mu Tseng.