TR-IIS-04-019 PDF format
Palindrome-like Patterns in Genomes
Arthur Chun-Chieh Shih, D. T. Lee, Chi-Fang Chin, Hong-Yuan Mark Liao, and Wen-Hsiung Li
Abstract
Motivation: It has been of great
interest to know whether some simple nucleotide-level patterns or permutation
structures can give rise to genome-wide general properties. In this study, we
wish to find such patterns or structures. For this purpose, we develop a representation
method different from those used to detect long range correlations (LRCs) in
genomes. Specifically, we propose a set of crystalline-like walking spaces in
which DNA sequences can be coded into a set of 3D curves by following a set
of walking rules.
Results: Our representation method reveals that genomic DNA
sequences in both prokaryotes and eukaryotes have linear trends and discernable
orientations in the walking spaces. Some palindrome-like sequences are then
found to have the trends and orientations similar to those found in genomic
DNA sequences. After a series of verifications and simulations, we speculate
that these patterns may be derived from a very old ancestral genome and some
of the patterns might still possess certain functions in present-day genomes.
These patterns may explain the existence of LRCs in genomes and why Chargaff's
second parity rule holds approximately in many genomic regions.