ISB Home



- Article -





Volume 6


Full article

In Silico Biology 6, 0019 (2006); ©2006, Bioinformation Systems e.V.  



Are categorical periodograms and indicator sequences of genomes spectrally equivalent?

Achuthsankar S. Nair1 and T. Mahalakshmi2

Department of Computer Science, University of Kerala, India - 695 581
1 present address: Centre for Bioinformatics, University of Kerala, Thiruvananthapuram, India 695 581
   sankar.achuth@gmail.com
2 present address: Sree Narayana Institute of Technology, Kollam, India 691 010 and National Institute of Computer Technology, Kollam, India 691 001
   mlakshmi@sancaharnet.in

EEdited by H. Michael; received November 17, 2005; revised and accepted March 22, 2006; published May 20, 2006


Abstract

This paper reports a novel symbol-to-signal mapping for DNA sequences, based on the concept of categorical periodograms. A categorical periodogram is a numeric sequence with the n-th element of the sequence indicating the number of occurrences of cycles with period n in it. The period of the cycle is defined as the number of intervening events plus one. Spectral analysis studies have been conducted on Cumulative Categorical Periodogram (CCP) of 10 genes from the data set of Burset and Guigo. It is observed that the spectral signatures in CCP are functionally equivalent to the established N/3 peak in the spectrum of indicator sequences of genomes. Being a single sequence compared to four sequences in the case of indicator sequence representation, the method is claimed to be functionally equivalent, but computationally better for identification of gene coding regions in sequences.


Keywords: digital signature, categorical periodogram, cumulative categorical periodogram, mapping, indicator sequences, genomic signal processing