In this talk, I will discuss four important issues together for explicit discourse relation labelling in Chinese texts: (1) discourse connective extraction, (2) linking ambiguity resolution, (3) relation type disambiguation, and (4) argument boundary identification. In a pipelined Chinese discourse parser, we identify potential connective candidates by string matching, eliminate non-discourse usages from them with a binary classifier, resolve linking ambiguities among connective components by ranking, disambiguate relation types by a multiway classifier, and determine the argument boundaries by conditional random fields. The experiments on Chinese Discourse Treebank show that the F1 scores of 0.7506, 0.7693, 0.7458, and 0.3134 are achieved for discourse usage disambiguation, linking disambiguation, relation type disambiguation, and argument boundary identification, respectively, in a pipelined Chinese discourse parser.
Hsin-Hsi Chen is a professor in Department of Computer Science and Information Engineering, National Taiwan University. His research interests are computational linguistics, Chinese language processing, information retrieval and extraction, and web mining. He served as President (2007-2009) and Advisory Board Chair (2009-2011) of Association for Computational Linguistics and Chinese Language Processing (ACLCLP), and Chair (2011-2013) of ACL SIGHAN. He was also Editorial Board member (2000-2008) of International Journal of Computational Linguistics and Chinese Language Processing, an Associate Editor (2007-2010) of ACM Transactions on Asian Language Information Processing, and an Editor of Journal of Information Science and Engineering (2010-). He was system demonstration co-chair of ACL-IJCNLP 2015, conference chair of IJCNLP 2013, program co-chair of ACM SIGIR 2010, program co-chair of ICADL 2012, served as steering committee members of AIRS, senior PC members of ACM SIGIR 2006, 2007, 2008 and 2009, area/track chairs of ACL 2012, ACL-IJCNLP 2009 and ACM CIKM 2008, and PC members of many conferences (IJCAI, SIGIR, AIRS, ACL, COLING, EMNLP, NAACL, EACL, IJCNLP, WWW, and so on). He won Google research awards in 2007 and 2012, awards of Microsoft Research Asia in 2008 and 2009, NTU EECS Academic Award in 2011, and NTU Award for Outstanding Service in 2011.