Wolfram Library Archive


Courseware Demos MathSource Technical Notes
All Collections Articles Books Conference Proceedings
Title

Asymptotic Number of Hairpins of Saturated RNA Secondary Structures
Authors

Peter Clote
Organization: Boston College
Department: Department of Biology
Evangelos Kranakis
Organization: Carleton University
Department: School of Computer Science
Danny Krizanc
Organization: Wesleyan University
Department: Department of Mathematics and Computer Science
Journal / Anthology

Bulletin of Mathematical Biology
Year: 2013
Volume: 75
Issue: 12
Page range: 2410-2430
Description

In the absence of chaperone molecules, RNA folding is believed to depend on the distribution of kinetic traps in the energy landscape of all secondary structures. Kinetic traps in the Nussinov energy model are precisely those secondary structures that are saturated, meaning that no base pair can be added without introducing either a pseudoknot or base triple. In this paper, we compute the asymptotic expected number of hairpins in saturated structures. For instance, if every hairpin is required to contain at least θ = 3 unpaired bases and the probability that any two positions can base-pair is p = 3/8, then the asymptotic number of saturated structures is 1.34685 n −3/2 1.62178n, and the asymptotic expected number of hairpins follows a normal distribution with mean 0.06695640 n + 0.01909350 √ n N. Similar results are given for values θ = 1, 3, and p = 1, 1/2, 3/8; for instance, when θ = 1 and p = 1, the asymptotic expected number of hairpins in saturated secondary structures is 0.123194 n, a value greater than the asymptotic expected number 0.105573 n of hairpins over all secondary structures. Since RNA binding targets are often found in hairpin regions, it follows that saturated structures present potentially more binding targets than nonsaturated structures, on average. Next, we describe a novel algorithm to compute the hairpin profile of a given RNA sequence: given RNA sequence a1, . . . , an, for each integer k, we compute that secondary structure Sk having minimum energy in the Nussinov energy model, taken over all secondary structures having k hairpins. We expect that an extension of our algorithm to the Turner energy model may provide more accurate structure prediction for particular RNAs, such as tRNAs and purine riboswitches, known to have a particular number of hairpins. Mathematica computations, C and Python source code, and additional supplementary information are available at the website http://bioinformatics.bc.edu/clotelab/RNAhairpinProfile/.
Subjects

*Applied Mathematics
*Science > Biology
Keywords

RNA secondary structure, Algebraic combinatorics