Page 36 - untitled
P. 36

Bio-informatics
 Bio-informatics
                    The yearly flu epidemics often occur as a   tion integration systems for biologists. One of the
 Principal Investigators:  Wen-Lian Hsu, Jan-Ming Ho, Chun-Nan Hsu, Ming-Tat Ko, Chung-Yen
 Lin, Der-Tsai Lee, Arthur Chun-Chieh Shih and Ting-Yi Sung   result of the lack of immunity against these new   agent-based systems is to support an SNP project of
 Postdoctoral Fellows:  Han-Shen Huang, Laurent Lin, Wen-Dar Lin, and Kun-Pin Wu   strains. The World Health Organization (WHO) an-  National Genotyping Center (NGC) at IBMS. Fast-

               nually recommends three influenza strains currently   SNP allows the researchers at NGC to efficiently
 Our research can be divided into the follow-  ence on human-mouse synteny mapping by UniMark-  in worldwide circulation for vaccine manufacturers   prioritize risk-causing SNPs to conduct genotyping
 ing areas: 1. Genomics; 2. Proteomics; 3. Structural   er-synteny method, we plan to develop an efficient   to massively produce the vaccines. However, each   and enables them to successfully identify a novel

 bioinformatics; 4. Biological literature mining; and 5.   synteny mapping and annotation system for vertebrate   year Taiwan needs four million flu vaccines, import-  promoter polymorphism that contributes to Adverse
 Systems biology. Since 2003, at IIS we have started   genomes.  ed from USA or other countries. To avoid a lack of   Drug Responses (ADRs). More identification will
 recruiting Ph.D. students in Bioinformatics of the Tai-  drugs and vaccines in case of a global fl u outbreak,   be announced in the near future. One of the agent-  Research Groups
 Large-scale and large-number sequence analysis and
 wan International Graduate Program (TIGP). There   the Department of Health, Executive Yuan, has   based systems won the best paper award in a local
 its applications to phyloinformatics
 are 15 students currently. The number of applicants   planned to establish a domestic GMP plant to pro-  bioinformatics conference in 2003. An article about
 this year is more than the total number of applicants   In the post-genomic era, the number of sequenc-  duce local flu vaccines. Beginning in 2006 in col-  FastSNP will appear in the 2006 Web Server Special
 for the past three years.  es available for reconstructing the evolutionary history   laboration with IBMS we participate in a three-year   Issue of Nucleid Acid Research.
 of genes and species has increased 20-fold in the past   CDC (Center for Disease Control, Taiwan) project
 1. Genomics                                                    Development of Gene Ontology browsing/
                                                                                                                  Research Groups
 decade. But most tools available to date are limited to   for development of influenza vaccine. We plan to
                                                                maninpulating utility
 EST annotation pipeline system  processing only small data sets. Especially, the cost   study the evolutionary mechanisms of influenza
 for constructing the tree of a large group of organisms   viruses from sequence data as well as to establish   Due to the increasing requirements for func-
 Large scale DNA sequencing has become a
 is very high when using an optimal character-based   a computational model that can predict potentially   tionally annotating gene products with Gene Ontol-
 worldwide-used methodology. EST sequencing is a
 method such as maximum likelihood. Thus, to de-  emerging strains in coming epidemic seasons. One   ogy (GO), more and more pipelines are producing
 feasible way for studying functional genomics under
 velop new computational tools for building very large   of our goals is to be able to provide a quantitative   GO-related biological data. In order to facilitate bi-
 limited resources, so there are a lot of EST sequencing
 phylogenetic trees and for comparing and visualizing   model for vaccine strain selection.  ologists§ browsing and manipulation of their data,
 projects all over the world. However, the essential step
 huge data sets is urgent. In this project, we focus on         we designed a Java-based software called GOBU.
 of an EST sequencing project is the bioinformatics   Agent-based information integration for
 the study and processing of large-number sequence              The underlying architecture of GOBU enables arbi-
 analysis. A good annotation pipeline can provide biol-  bioinformatics
 alignments and their phylogenies. We plan to build a           trary data description, data integration and extend-
 ogists the most complete information about their EST
 problem-solving environment, develop effi cient algo-  More than 600 Web-based bioinformatics   able user interface, and thus is feasible for browsing
 data and help them with further studies. In the last few
 rithms for constructing large-scale phylogenetic trees,   resources, including databases, links, and analysis   annotations for various pipelines. We have applied
 years, we have in collaboration with biologists in the
 such as supertrees, and develop effective visualization   tools, are available for biological knowledge dis-  this software to our EST annotation pipeline and
 Institute of Biomedical Sciences (IBMS) of Academia
 tools for visualizing supertrees. We have developed a   covery.  But to integrate them requires tremendous   microarray experiments.
 Sinica, provided the EST annotation service systems,
 versatile alignment visualization system, SinicView   efforts.  We have been working on an intelligent
 Bio101 and Bio301, for some functional genomics                2. Proteomics
 (Sequence-aligning INnovative and Interactive Com-  agent-based approach to information integration
 projects and also developed a fast gene ontology an-
 parison VIEWer), which enables users to efficiently   and have built a tool for rapidly training intel-  Quantitative proteomics based on high-throughput
 notator for the EST annotation, called ESTFastAnno-
 compare and evaluate assorted alignment results ob-  ligent agents.  This tool provides bioinformatics   mass spectrometry data
 tator.
 tained by different tools. This work appeared in BMC   researchers a scalable, flexible and extensible solu-

 An efficient synteny mapping and annotation system   Bioinformatics 2006. More information can be found   tion to query these resources with a single uniform   Most current quantitative experiments aim to
 for vertebrate genomes  at the Computational Genomics Lab. website at http://  interface.  With this tool, users can train an agent   determine relative protein expression levels in dif-
 biocomp.iis.sinica.edu.tw/ in the Genomics Research   to learn a specialized session of Web browsing   ferent cell states or cells grown under different con-
 Synteny mapping, or detecting regions that are                 ditions. Various stable isotope labeling techniques,
 Center.       and extraction without programming. Instead, they
 orthologous between genomes, is a key step in stud-            e.g., ICAT and iTRAQ, followed by liquid chroma-
               can train intelligent agents by click-and-drag in a
 ies of comparative genomics. Synteny mapping of all   Influenza virus study and vaccine strain selection   tography-tandem mass spectrometry (LC-MS/MS)
               programming-by-example manner.  As a result, a
 model vertebrate genomes will make the global view             are frequently used to quantify protein expressions.
 In recent years, the study of molecular phylog-  large number of agents can be created to integrate
 of genome evolution clear, enhance the functional              To expedite the analysis of vast amounts of spectral
 enies of influenza viruses has become increasingly   a large number of Web sites in a matter of days to
 decoding of genomes in a comprehensive way beyond              data generated, we have developed a fully automat-
 important for epidemiologists and evolutionary biolo-  answer biologists’ queries. We have successfully
 genes so far, and serve as a foundation of comparative         ed tool for multiplexed quantitation using iTRAQ
 gists.        built and deployed many agent-based informa-
 vertebrate genomics. Based on our successful experi-



 24                                                                                                               25
   31   32   33   34   35   36   37   38   39   40   41