Modelling Intermolecular Structures and Defining Ambiguity in Gene Sequences using Matrix Insertion-Deletion Systems

Abstract : Gene insertion and deletion are considered as the basic operations in DNA processing and RNA editing. Based on these evolutionary transformations, a computing model has been formulated in formal language theory known as insertion-deletion systems. Recently, in [6], a new computing model named Matrix insertion-deletion system has been introduced to model various bio-molecular structures such as hairpin, stem and loop, pseudoknot, attenuator, cloverleaf, dumbbell that occur at intramolecular level. In this paper, we model some of the intermolecular structures such as double strand languages, nick languages, hybrid molecules (with R-loops), holliday structure, replication fork and linear hybridization (ligated) languages using Matrix insertion-deletion system. In [2], the ambiguity in gene sequence was defined as deriving more than one structure for a single gene sequence. Here, we propose a different view of understanding the ambiguity in gene sequences: A gene sequence is obtained by more than one way such that their intermediate sequences are different. We further classify the ambiguity into many levels based on the components axiom, string (order of deletion/insertion) and contexts (order of the used contexts). We notice that some of the inter and intramolecular structures obey the newly defined ambiguity levels.
Document type :
Book sections
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/hal-00659487
Contributor : Eric Villemonte de la Clergerie <>
Submitted on : Thursday, January 12, 2012 - 7:13:51 PM
Last modification on : Thursday, February 7, 2019 - 2:37:02 PM
Long-term archiving on : Friday, May 11, 2012 - 4:06:19 PM

File

Interplay_2011_camera_ready_co...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00659487, version 1

Collections

Citation

Lakshmanan Kuppusamy, Anand Mahendran, Éric Villemonte de la Clergerie. Modelling Intermolecular Structures and Defining Ambiguity in Gene Sequences using Matrix Insertion-Deletion Systems. Gemma Bel-Enguix, Veronica Dahl, M. Dolores Jiménez-López. Biology, Computation and Linguistics, new interdisciplinary paradigms, IOS Press, pp.71-82, 2011, Frontiers in Artificial Intelligence and Applications, 978-1-60750-761-1. ⟨hal-00659487⟩

Share

Metrics

Record views

273

Files downloads

302