Skip to Main content Skip to Navigation
Book sections

To detect and analyze sequence repeats whatever be their origin

Jacques Nicolas 1, *
* Corresponding author
1 Dyliss - Dynamics, Logics and Inference for biological Systems and Sequences
IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE, Inria Rennes – Bretagne Atlantique
Abstract : The development of numerous programs for the identification of mobile elements raises the issue of the founding concepts that are shared in their design. This is necessary for at least three reasons. First, the cost of designing, developing, debugging and maintaining software could present a danger of distracting biologists from their main bioanalysis tasks that require a lot of energy. Some key concepts on exact repeats are always underlying the search for genomic repeats and we recall the most important ones. All along the chapter, we try to select practical tools that may help the design of new identification pipelines. Second, the huge increase of sequence production capacities requires to use the most efficient data structures and algorithms to scale up tools in front of the data deluge. This paper provides an up-to-date glimpse on the art of string indexing and string matching. Third, there exists a growing knowledge on the architecture of mobile elements built from literature and the analysis of results generated by these pipelines. Besides data management which has led to the discovery of new families or new elements of a family, the community has an increasing need in knowledge management tools in order to compare, to validate or simply to keep trace of mobile elements models. We end the paper with first considerations on what could help the near future of such research on models.
Complete list of metadatas

Cited literature [45 references]  Display  Hide  Download

https://hal.inria.fr/hal-00730207
Contributor : Jacques Nicolas <>
Submitted on : Friday, September 7, 2012 - 8:17:19 PM
Last modification on : Monday, October 19, 2020 - 11:07:34 AM
Long-term archiving on: : Saturday, December 8, 2012 - 3:42:25 AM

File

Chapter4.pdf
Files produced by the author(s)

Identifiers

Citation

Jacques Nicolas. To detect and analyze sequence repeats whatever be their origin. Yves Bigot. Mobile genetic elements : protocols and genomic applications, 859, Springer, pp.69-90, 2012, Springer Protocols, 978-1-61779-602-9. ⟨10.1007/978-1-61779-603-6⟩. ⟨hal-00730207⟩

Share

Metrics

Record views

907

Files downloads

522