Coding Region Prediction Based on a Universal DNA Sequence Representation Method - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Computational Biology Année : 2008

Coding Region Prediction Based on a Universal DNA Sequence Representation Method

Résumé

Graphical representation of DNA sequences provides a simple and intuitive way of viewing, anchoring, and comparing various gene structures, so a simple and non-degenerate method is attractive to both biologists and computational biologists. In this study, a universal graphical representation method for DNA sequences based on S.S.-T. Yau's method is presented. The method adopts a trigonometric function to represent the four nucleotides A, G, C, and T. Some interesting characteristics of the universal representation are introduced. We exploit frequency analysis with our representation method on DNA sequences, demonstrating possible applications in coding region prediction, and sequence analysis. Based on the statistically experimental results from this frequency analysis, a simple coding region predictor and an optimized one are presented. An experiment on the broadly accepted ROSETTA data set demonstrates that the performance of the optimized predictor is comparable to that of other popular methods.
Fichier non déposé

Dates et versions

inria-00347594 , version 1 (16-12-2008)

Identifiants

Citer

Dominique Lavenier, Xianyang Jiang, Stephen Yau. Coding Region Prediction Based on a Universal DNA Sequence Representation Method. Journal of Computational Biology, 2008, 15 (10), pp.1237-1256. ⟨10.1089/cmb.2008.0041⟩. ⟨inria-00347594⟩
103 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More