Richard Tzong-Han Tsai

蔡宗翰 (cai zong han)

l   Assistant Professor, Dept. of Computer Science & Engineering, Yuan Ze Univ.

l   Director of Open Source Software Center, Yuan Ze Univ.

l   Visiting Scholar of IIS, Academia Sinica, Taiwan

 

135 Yuan-Tung Rd., Chung-Li, Taoyuan, Taiwan, R.O.C., 32003

TEL: 886-3-4638800 ext. 3004

FAX: 886-3-4638850

E-mail: thtsai@saturn.yzu.edu.tw

I am really fascinated by the application of machine learning and information extraction technology to the solution of natural language processing and text mining problems.

News!! My biomedical text mining demo page, including two functions: protein interaction article identification (IAS) and gene mention tagging (GM)

is currectly available at http://bioner.mytw.net/, welcome collaboration!

Education

Experience

Publications

VITA

Research Interests

 

Publications

PhD Dissertation

Richard Tzong-Han Tsai (2006). Biomedical Named Entity Recognition, Semantic Role Labeling, and Their Application to Question Answering. Ph.D. dissertation, 2006.
Advisor: Dr. Wen-Lian Hsu and Dr. Jieh Hsiang


Journal

 

Richard Tzong-Han Tsai, Wen-Chi Chou, Yu-Chun Lin, Ying-Shan Su, Cheng-Lung Sung, Hong-Jie Dai, Irene Tzu-Hsuan Yeh, Wei Ku, Ting-Yi Sung & Wen-Lian Hsu (2007).  BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model. To appear in BMC Bioinformatics. (SCI Impact Factor: 4.96)

Richard Tzong-Han Tsai , Cheng-Lung Sung, Hong-Jie Dai, Hsi-Chuan Hung, Ting-Yi Sung, & Wen-Lian Hsu (2006). NERBio: Using Selected Word Conjunction, Term Normalization, and Global Patterns to Improve Biomedical Named Entity Recognition. BMC Bioinformatics, 7(Suppl 5):S11. (SCI Impact Factor: 4.96)

Richard Tzong-Han Tsai , Shih-Hung Wu , Wen-Chi Chou, Yu-Chun Lin, Ding He, Jieh Hsiang, Ting-Yi Sung, & Wen-Lian Hsu (2006). Various Criteria in the Evaluation of Biomedical Named Entity Recognition. BMC Bioinformatics, 7(92). (SCI Impact Factor: 4.96) PDF

Richard Tzong-Han Tsai , Wen-Chi Chou, Shih-Hung Wu, Ting-Yi Sung, Jieh Hsiang, & Wen-Lian Hsu (2006). Integrating Linguistic Knowledge into a Conditional Random Field Framework to Identify Biomedical Named Entities. Expert Systems with Applications, 30(1), 117-128. (SCI Impact Factor: 1.247) PDF

Min-Yuh Day, Richard Tzong-Han Tsai , Cheng-Lung Sung, Chiu-Chen Hsieh, Cheng-Wei Lee, Shih-Hung Wu, Kun-Pin Wu, Chorng-Shyong Ong, & Wen-Lian Hsu (2007). Reference Metadata Extraction Using a Hierarchical Knowledge Representation Framework. Decision Support Systems, 43(1), 152-167. (SCI Impact Factor: 0.946)

Chia-Wei Wu, Richard Tzong-Han Tsai , & Wen-Lian Hsu (2005). Learning to Integrate Web Taxonomies with Fine-Grained Relations: A Case Study Using Maximum Entropy Model. LNCS, 3689, 190-205. (SCI Impact Factor=0.402, EI)

Richard Tzong-Han Tsai , Shih-Hung Wu, & Wen-Lian Hsu (2004). Mencius: A Chinese Named Entity Recognizer Based on a Maximum Entropy Framework. Computational Linguistics and Chinese Language Processing, 9(1), 65-82. PDF

Chia-Wei Wu, Richard Tzong-Han Tsai* , & Wen-Lian Hsu (2007). Web Taxonomy Integration System with Hierarchical Shrinkage and Fine-Grained Relations . Submitted to Expert Systems with Applications .

 

Shared Task

[2007]

Chosen to present the task paper “A Novel Feature Representation: Integrate GM Results into Interaction Abstract Identification” in the Second BioCreAtIvE Challenge Workshop - Critical Assessment of Information Extraction in Molecular Biology ( 國際分子生物資訊擷取競賽優勝 ) , Madrid, Spain.

1st place in the Korean-Chinese Cross Language Information Retrieval task ( 日本 NII 韓中跨語檢索競賽第一名 ) of the 6 th NTCIR workshop , Tokyo, Japan.

 

[2006]

1st place out of 13 teams in the Chinese word segmentation task of SIGHAN bakeoff 2006 (CTU corpus) (the 2 nd place is Microsoft Research Asia), Sydney (ACL 漢語計算小組中文斷詞競賽香港語料分項第一名 , 第二名為微軟亞洲研究院 )

2nd place out of 9 teams in the Chinese word segmentation task of SIGHAN bakeoff 2006 (CKIP corpus) (the 1 st place is Microsoft Research Asia) , Sydney (ACL 漢語計算小組中文斷詞競賽台灣語料分項第二名 , 第一名為微軟亞洲研究院 )

2nd place out of 8 teams in the Chinese named entity recognition task of SIGHAN bakeoff 2006 (CTU corpus), Sydney (ACL 漢語計算小組中文專有名詞辨識競賽香港語料分項第二名 )

 

[2005]

1st place out of 9 teams in the CLQA Chinese to Chinese Question-Answering task of the 5 th NTCIR workshop , Tokyo , Japan . ( 日本 NII 中文問答系統競賽第一名 )

6th place out of 32 teams in the ad-hoc retrieval task of Genomic TREC 2005, Washington DC . ( 美國國家標準局 TREC 基因文獻檢索比賽第六名 )

6th place out of 22 teams in the CoNLL-2005 shared task, Ann Arbor , Michigan . (ACL 計算語言自動學習學會語意角色標註系統競賽第六名 , 前五名為 UIUC, Stanford 等名校 )

 

Translation Book (著作或譯作書籍)
Foundations of Algorithms Using C++ Pseudocode , 3rd ed., 演算法:使用C++虛擬碼 , 碁峰出版社, ISBN: 9864215892, 蔡宗翰譯
(
本書為目前交大資工, 師大資工, 輔大圖資, 聯合科大資管, 南開科大資工等系所採用之演算法課程教科書)

Conference

[2007]

Hsi-Chuan Hung, Richard Tzong-Han Tsai , & Wen-Lian Hsu (2007). Identifying Protein Interaction Abstracts with Contextual Bag of Words . Paper presented at the AAAI-07 Student Workshop.

Hong-Jie Dai, Hsi-Chuan Hung, Richard Tzong-Han Tsai*, & Wen-Lian Hsu (2007). IASL Systems in the Gene Mention Tagging Task and Protein Interaction Article Sub-task . Paper presented at the Second BioCreAtIvE Challenge Workshop.

Richard Tzong-Han Tsai, Hong-Jie Dai, Hsi-Chuan Hung, & Wen-Lian Hsu (2007). Exploiting Unlabeled Internal Data in Conditional Random Fields to Reduce Word Segmentation Errors for Chinese Texts. Paper presented at the Interspeech-2007 Conference.

[2006]

Richard Tzong-Han Tsai , Hong-Jie Dai, Cheng-Lung Sung, Hsi-Chuan Hung, Min-Yuh Day, & Wen-Lian Hsu (2006). Chinese Word Segmentation with Minimal Linguistic Knowledge: An Improved Conditional Random Fields Coupled with Character Clustering and Automatically Discovered Template Matching. Paper presented at the IEEE IRI-06 Conference. (EI)

Richard Tzong-Han Tsai , Hsi-Chuan Hung, Cheng-Lung Sung, Hong-Jie Dai, & Wen-Lian Hsu (2006). On Closed Task of Chinese Word Segmentation: An Improved CRF Model Coupled with Character Clustering and Automatically Generated Template Matching. Paper presented at the SIGHAN-06 Workshop. PDF

Wen-Chi Chou, Richard Tzong-Han Tsai* , Ying-Shan Su, Wei Ku, Ting-Yi Sung, & Wen-Lian Hsu (2006). A Semi-Automatic Method for Annotating a Biomedical Proposition Bank. Paper presented at the ACL Workshop on Frontiers in Linguistically Annotated Corpora (LINC-06). PDF

Chia-Wei Wu, Shyh-Yi Jan, Richard Tzong-Han Tsai , & Wen-Lian Hsu (2006) . On Using Ensemble Methods for Chinese Named Entity Recognition . Paper presented at the SIGHAN-06 Workshop.

Richard Tzong-Han Tsai (2006). A Hybrid Approach to Biomedical Named Entity Recognition and Semantic Role Labeling. Paper presented at the HLT/NAACL-06 Doctoral Consortium.

Richard Tzong-Han Tsai , Wen-Chi Chou, Yu-Chun Lin, Wei Ku, Ying-Shan Su, Ting-Yi Sung, & Wen-Lian Hsu (2006). BIOSMILE: Adapting Semantic Role Labeling for Biomedical Verbs: An Exponential Model Coupled with Automatically Generated Template Features. Paper presented at the BioNLP-06 Conference, New York . Acceptance rate: 27.5%. PDF

 

[2005]

Min -Yuh Day, Richard Tzong-Han Tsai , Cheng-Lung Sung, Cheng-Wei Lee, Shih-Hung Wu, Ong, C.-S., et al. (2005). A knowledge-based approach to citation extraction. Paper presented at the IEEE IRI-05 Conference. (EI) PDF

Richard Tzong-Han Tsai , Chia-Wei Wu, Yu-Chun Lin, & Wen-Lian Hsu (2005). Exploiting full parsing information to label semantic roles using an ensemble of ME and SVM via integer linear programming. Paper presented at the CoNLL-05 Conference.PDF

Richard Tzong-Han Tsai , Shih-Hung Wu, & Wen-Lian Hsu (2005). Exploitation of Linguistic Features Using a CRF-based Biomedical Named Entity Recognizer. Paper presented at the ACL Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics (BioLINK-05), Detroit .

Richard Tzong-Han Tsai, Chia-Wei Wu, Hsi-Chuan Hung, Yu-Chun Wang, Ding He, Yi-Feng Lin, Cheng-Wei Lee, Ting-Yi Sung, and Wen-Lian Hsu. (2005). Enhance Genomic IR with Term Variation and Expansion: Experiences of the IASL Group at Genomic Track 2005. Paper presented at the TREC-05 Conference. PDF

Chia-Wei Wu, Richard Tzong-Han Tsai , & Wen-Lian Hsu (2005). Learning to Integrate Web Taxonomies with Fine-Grained Relations: A Case Study Using Maximum Entropy Model. LNCS, 3689, 190-205. Acceptance rate: 23%. (SCI, EI) PDF

 

[2004]

C.-W. Shih, Richard Tzong-Han Tsai, S.-H. Wu, C.-C. Hsieh, & W.-L. Hsu (2004). The Construction of a Chinese Named Entity Tagged Corpus: CNEC1.0. Fifteenth Conference on Computational Linguistics and Speech Processing (ROCLING XVI). PDF

Y.-F. Lin, Richard Tzong-Han Tsai, C. K.-P. Wu, T.-Y. Sung, & W.-L. Hsu (2004). A Maximum Entropy Approach to Biomedical Named Entity Recognition. 4th ACM SIGKDD Workshop on Data Mining in Bioinformatics (BioKDD-2004). Acceptance rate: 37%. PDF

 

[2003]

Richard Tzong-Han Tsai, S.-H. Wu, & W.-L. Hsu, (2003). Mencius: A Chinese Named Entity Recognizer Using Hybrid Model. Fifteenth Research on Computational Linguistics International Conference (ROCLING XV), pp. pp.193-209. PDF

S.-H. Wu, Richard Tzong-Han Tsai, and W.-L. Hsu, (2003). Domain Event Extraction and Representation with Domain Ontology. IJCAI 2003 Workshop on Information Integration on the Web. PDF

Wu, S.-H., Richard Tzong-Han Tsai, and Hsu, W.-L. (2003). Text Categorization Using Automatically Acquired Domain Ontology. the Sixth International Workshop on Information Retrieval with Asian Languages (IRAL-03), Sapporo, Japan. PDF

 

[2002]

G. Hsieh, Richard Tzong-Han Tsai, D. Wible, & W.-L. Hsu, (2002). Exploiting Knowledge Representation in an Intelligent Tutoring System for English Lexical Errors. ICCE 2002. Acceptance rate: 23%. PDF

Richard Tzong-Han Tsai (2002). A Dialogue System with Digression Handling - An Ontology-Based Approach. AAAI 2002 Doctoral Consortium. PDF

 

Honor

[2006]

Ph.D. Dissertation Award, Association for Computational Linguistics and Chinese Language Processing ( 中華民國計算語言學會博士論文獎 )

Chosen to present the thesis proposal in NAACL-06 Doctoral Consortium with Scholarship, New York ( 北美計算語言研討會菁英博士論壇獎學金 )

[2002]

Chosen to present the thesis proposal in AAAI-02 Doctoral Consortium with Scholarship, Edmonton, Canada ( 美國人工智慧研討會菁英博士生論壇獎學金 )

 ▲ TOP

Education

   Ph.D., June, 2006

National Taiwan University
Department of Computer Science and Information Engineering
Advisor: Dr. Wen-Lian Hsu and Dr. Jieh Hsiang

  

M.S., July, 1999 (GPA 4.0/4.0)

National Taiwan University
Department of Computer Science and Information Engineering
Thesis: Machine Learning Classification Techniques for Personalized Email
Advisor: Dr. Jane Yung-Jen Hsu              

B.S.,  July, 1997 (GPA 3.6/4.0)

National Taiwan University
Department of Computer Science and Information Engineering

▲ TOP

Experience

  

Aug 07-

Sep 06 – Aug 07

Dept. of Computer Science & Engineering, Yuan Ze Univ., Chung-Li, Taoyuan, Taiwan, R.O.C., Assistant Professor

Institute of Information Science , Academia Sinica, Taipei , Taiwan , R.O.C. Post-doc

Oct 99 – Aug 06

Institute of Information Science , Academia Sinica, Taipei , Taiwan , R.O.C. Research Assistant

Services

▲ TOP

Research Interests

▲ TOP

 

 Last update: Aug 29, 2007