Saturday, June 01, 2013

A New Biological Constant?

Earlier, I gave evidence for a surprising relationship between the amount of G+C (guanine plus cytosine) in DNA and the amount of "purine loading" on the message strand in coding regions. The fact that message strands are often purine-rich is not new, of course; it's called Szybalski's Rule. What's new and unexpected is that the amount of G+C in the genome lets you predict the amount of purine loading. Also, Szybalski's rule is not always right.

Genome A+T content versus message-strand purine content (A+G) for 260 bacterial genera. Chargaff's second parity rule predicts a horizontal line at Y = 0.50. (Szybalski's rule says that all points should lie at or above 0.50.) Surprisingly, as A+T approaches 1.0, A/T approaches the Golden Ratio.
When you look at coding regions from many different bacterial species, you find that if a species has DNA with a G+C content below about 68%, it tends to have more purines than pyrimidines on the message strand (thus purine-rich mRNA). On the other hand, if an organism has extremely GC-rich DNA (G+C > 68%), a gene's message strand tends to have more pyrimidines than purines. What it means is that Szybalski's Rule is correct only for organisms with genome G+C content less than 68%. And Chargaff's second parity rule (which says that A=T an G=C even within a single strand of DNA) is flat-out wrong all the time, except at the 68% G+C point, where Chargaff is right now and then by chance.

Since the last time I wrote on this subject, I've had the chance to look at more than 1,000 additional genomes. What I've found is that the relationship between purine loading and G+C content applies not only to bacteria (and archaea) and eukaryotes, but to mitochondrial DNA, chloroplast DNA, and virus genomes (plant, animal, phage), as well.

The accompanying graphs tell the story, but I should explain a change in the way these graphs are prepared versus the graphs in my earlier posts. Earlier, I plotted G+C along the X-axis and purine/pyrmidine ratio on the Y-axis. I now plot A+T on the X-axis instead of G+C, in order to convert an inverse relationship to a direct relationship. Also, I now plot A+G (purines, as a mole fraction) on the Y-axis. Thus, X- and Y-axes are now both expressed in mole fractions, hence both are normalized to the unit interval (i.e., all values range from 0..1).

The graph above shows the relationship between genome A+T content and purine content of message strands in genomes for 260 bacterial genera. The straight line is regression-fitted to minimize the sum of squared absolute error. (Software by The line conforms to:

y = a + bx
a =  0.45544384965539358
b =  0.14454244707261443

The line predicts that if a genome were to consist entirely of G+C (guanine and cytosine), it would be 45.54% guanine, whereas if (in some mythical creature) the genome were to consist entirely of A+T (adenine and thymine), adenine would comprise 59.99% of the DNA. Interestingly, the 95% confidence interval permits a value of 0.61803 at X = 1.0, which would mean that as guanine and cytosine diminish to zero, A/T approaches the Golden Ratio.

Do the most primitive bacteria (Archaea) also obey this relationship? Yes, they do. In preparing the graph below, I analyzed codon usage in 122 Archaeal genera to obtain A, G, T,  and C relative proportions in coding regions of genes. As you can see, the same basic relationship exists between purine content and A+T in Archaea as in Eubacteria. Regression analysis yielded a line with a slope of 0.16911 and a vertical offset 0.45865. So again, it's possible (or maybe it's just a very strange coincidence) that A/T approaches the Golden Ratio as A+T approaches unity.

Analysis of coding regions in 122 Archaea reveals that the same relationship exists between A+T content and purine mole-fraction (A+G) as exists in eubacteria.
For the graph below, I analyzed 114 eukaryotic genomes (everything from fungi and protists to insects, fish, worms, flowering and non-flowering plants, mosses, algae, and sundry warm- and cold-blooded animals). The slope of the generated regression line is 0.11567 and the vertical offset is 0.46116.

Eukaryotic organisms (N=114).

Mitochondria and chloroplasts (see the two graphs below) show a good bit more scatter in the data, but regression analysis still comes back with positive slopes (0.06702 and .13188, respectively) for the line of least squared absolute error.

Mitochondrial DNA (N=203).
Chloroplast DNA (N=227).
To see if this same fundamental relationship might hold even for viral genetic material, I looked at codon usage in 229 varieties of bacteriophage and 536 plant and animal viruses ranging in size from 3Kb to over 200 kilobases. Interestingly enough, the relationship between A+T and message-strand purine loading does indeed apply to viruses, despite the absence of dedicated protein-making machinery in a virion.

Plant and animal viruses (N=536).
Bacteriophage (N=229).
For the 536 plant and animal viruses (above left), the regression line has a slope of 0.23707 and meets the Y-axis at 0.62337 when X = 1.0. For bacteriophage (above right), the line's slope is 0.13733 and the vertical offset is 0.46395. (When inspecting the graphs, take note that the vertical-axis scaling is not the same for each graph. Hence the slopes are deceptive.) The Y-intercept at X = 1.0 is 0.60128. So again, it's possible A/T approaches the golden ratio as A+T approaches 100%.

The fact that viral nucleic acids follow the same purine trajectories as their hosts perhaps shouldn't come as a surprise, because viral genetic material is (in general) highly adapted to host machinery. Purine loading appropriate to the A+T milieu is just another adaptation.

It's striking that so many genomes, from so many diverse organisms (eubacteria, archaea, eukaryotes, viruses, bacteriophages, plus organelles), follow the same basic law of approximately

A+G = 0.46 + 0.14 * (A+T)

The above law is as universal a law of biology as I've ever seen. The only question is what to call the slope term. It's clearly a biological constant of considerable significance. Its physical interpretation is clear: It's the rate at which purines are accumulated in mRNA as genome A+T content increases. It says that a 1% increase in A+T content (or a 1% decrease in genome  G+C content) is worth a 0.14% increase in purine content in message strands. Maybe it should be called the purine rise rate? The purine amelioration rate?

Biologists, please feel free to get in touch to discuss. I'm interested in hearing your ideas. Reach out to me on LinkedIn, or simply leave a comment below.


  1. You're fitting noise.

    1. The development of artificial intelligence (AI) has propelled more programming architects, information scientists, and different experts to investigate the plausibility of a vocation in machine learning. Notwithstanding, a few newcomers will in general spotlight a lot on hypothesis and insufficient on commonsense application. machine learning projects for final year In case you will succeed, you have to begin building machine learning projects in the near future.

      Projects assist you with improving your applied ML skills rapidly while allowing you to investigate an intriguing point. Furthermore, you can include projects into your portfolio, making it simpler to get a vocation, discover cool profession openings, and Final Year Project Centers in Chennai even arrange a more significant compensation.

      Data analytics is the study of dissecting crude data so as to make decisions about that data. Data analytics advances and procedures are generally utilized in business ventures to empower associations to settle on progressively Python Training in Chennai educated business choices. In the present worldwide commercial center, it isn't sufficient to assemble data and do the math; you should realize how to apply that data to genuine situations such that will affect conduct. In the program you will initially gain proficiency with the specialized skills, including R and Python dialects most usually utilized in data analytics programming and usage; Python Training in Chennai at that point center around the commonsense application, in view of genuine business issues in a scope of industry segments, for example, wellbeing, promoting and account.

      The Nodejs Projects Angular Training covers a wide range of topics including Components, Angular Directives, Angular Services, Pipes, security fundamentals, Routing, and Angular programmability. The new Angular TRaining will lay the foundation you need to specialise in Single Page Application developer. Angular Training

  2. You're double-blind.

  3. Kas,

    No comment on the two posts above, but:

    a = 0.45544384965539358
    b = 0.14454244707261443


    I don't think there's any truth anywhere beyond the third or fourth significant digit. That looks like "a little under half" and "about a seventh" to me.


  4. Anonymous10:30 AM

    If you think it might clarify presentation of the relationships you have found, please consider posting graphs that display the 95% confidence levels.

    James Phillips

  5. Good suggestion by anon. I can see the point you're making. Phi is an incredible ratio found all over in nature. Why not DNA? It's curious, indeed.

  6. This golden ratio is interesting because I read on the internet the other day that after 31,500 generations of bacterial cell division in a petri dish bacteria mutate
    to be able to process a new chemical.The 31,500 is a figure only known to plus or minus 500 generations so it could be 31,400 which is 10,000 times pi.Since dna helices rotate as you go go along them we should not be surprised by the association of a circle ratio with this phenomenon.

    1. Geometry could be the key to mutation.

  7. this is great and awesome article.will share this valuable doc via my gb whatsapp account.

  8. interesting article, got many new things.

  9. Thanks for sharing this post, it was great reading this article! would like to know more! keep in touch and stay connected! Also Check here
    gbwhatsapp apk

    Vidmate App

    Vidmate Apk

    Vidmate For Pc

    Vidmate For IOS

    cheapest smm panel

  10. Your article is very useful to me, thank you very much for your sorting out and sharing.Vidmate app is a powerful aggregated audio-video player & live broadcast software, you can watch a large number of videos from over 1000+ sites in overseas countries. Download here! Vidmate apk

  11. Nice blog, thank you so much for sharing such an amazing blog with us. Visit for the best Website Designing and SEO Services in Delhi, India.
    Website Designing Company in India

  12. This is a nice blog to watch out for and we provided information on what is machine learning language? make sure you can check it out and keep on visiting and please share our blog.

  13. Looking for PUBG Names? Here is a huge collection of all types of Best & Unique PUBG names like stylish pubg names, cool pubg names etc.

  14. Thanks for this amazing blog, visit Mutualfundwala for the best Mutual Fund Agent and Mutual Fund Distributor.
    Mutual Fund Agent


    tvtap app for android

  16. For more tutorial videos, download Likee app.


  17. Tutu Helper is the one of the best ios,android App store to get the tons of free app

    and game. Here the latest version of TutuApp of free.
    Tutu Helper Apk
    Tutu App
    TutuApp Apk iOS

  18. Great blog, thank you so much for sharing with us. Get a custom mobile app development services at Appslure WebSolution by the professional ios app designers and developers and also get e-commerce app Services
    App development company in mumbai

  19. This is a nice blog to watch out and we provided informative articles on headphones, click here to reach our website.Make sure you can check it out and keep on visiting our blog.

  20. You must try this new version of Yowhatsapp through this site

  21. I really need this information, can I share it on my profile? Will I write the coppy source and author below. Thank word counter tool

  22. Very informative put up! This submit gives sincerely exceptional statistics. I locate that this put up is simply first rate. Thanks for this explanation and very nice information.

  23. Thanks for giving me this information. It is very helpfull for me.
    tutuapp ios
    Thanks you so much.
    whatsapp web

  24. Hi! This is my first comment here so I just wanted to give a quick shout out and say I genuinely enjoy reading your blog posts. Can you recommend any other Beauty Write For Us blogs that go over the same topics? Thanks a ton!

  25. Obrigado alguém por compartilhar isso por ser tão útil para mim e para todos.
    whatsapp gb 2020

  26. I got this amazing card check out spotify codes

  27. This article is really helpful for people who like to research like me. I will share it on WhatsApp Aero so my group friends can view it. People can also join our research team

  28. If you are at your office and want to order the food, you can check and view all restaurants menu price list on your desk and get easily the food that you are craving for.

    So here are the details of all the food items from starter to desserts you wanted to check out and compare the easy and tasty food by your own at any place click here

  29. Thanks for the interesting content. I like your post and your blog is amazing.
    If you are interested in Video Downloader apps you can check my blog site. It is new and really informative.

    VidMate 2011

  30. Thats great post !! I like ur every post they always give me some new knowledge.

    VidMate | VidMate for PC |
    VidMate 2014

  31. Thats great post !! I like ur every post they always give me some new knowledge.
    VidMate 2017 | VidMate Online | VidMate download 2018


Add a comment. Registration required because trolls.