Understanding Molecular Biology Part I – DNA Structure and Organization

I usually write about recent scientific publications that I find interesting and important, usually in the general area of biology. My goal is to promote scientific literacy and an awareness of the many remarkable advances in all areas of science. I try to provide some background context for the published study and then summarize the salient findings. As I’ve written these blogs over the last few years, I’ve realized that my readers may lack the foundational knowledge that underpins much of the biological literature. To further my goal of educating the public, I’ve decided to also write posts that focus on teaching concepts and principles. I’ll still write about current publications but will periodically post my “Understanding XXX” series. Each series will address a specific area or topic in biology and will be a mini-course on that subject. My first series is Understanding Molecular Biology. I hope you enjoy these blogs and find them useful learning tools. If there are topics you’d like me to address please let me know and I’ll add them to my list.

All life on Earth uses DNA as the genetic material, with exception of some viruses that use a related molecule called RNA which we will discuss in a future blog. DNA stands for DeoxyriboNucleic Acid and is a long polymer chain made up of four different subunits (abbreviated A, C, G, and T) called deoxynucleotides (Fig. 1).

**Fig. 1. The DNA polymer.** Shown is a short piece of linear, single-stranded DNA. DNA is a polymer comprised of four deoxyribonucleotides (abbreviated A, C, G, and T) that are linked together to form an unbroken chain. The length of the chain in the figure is 16 nucleotides long, but the chains can be enormous. For example, the largest single piece of DNA in the human genome is around 250 million deoxyribonucleotides long. Figure created with BioRender.com.

The deoxynucleotides are the basic building blocks of DNA, and each deoxynucleotide is in turn composed of three components: a phosphate group, a sugar molecule (called deoxyribose), and a molecule called a base (Fig. 2).

**Figure 2.** **The Structure of Deoxyribonucleotides.** All deoxyribonucleotides are composed of three parts as indicated: a phosphate group and a deoxyribose sugar which are constant and one of the four bases (adenine, guanine, cytosine, thymine). Shown on the left is the structure of deoxyriboadenosine (A). To the right are the other three bases found in DNA that would replace adenine to form their respective deoxyribonucleotides. Adenine and guanine have similar structures and are called purines while the related cytosine and thymine are called pyrimidines. Figure created with BioRender.com.

There are just 4 bases in DNA, 2 purines (adenine [A] and guanine [G]) and 2 pyrimidines (cytosine [C] and thymine [T]). Just like individual letters of the alphabet are the building blocks of words, in the language of DNA the 4 bases (A, G, C, and T) are the letters of the genetic alphabet.

Other than some DNA viruses, the DNA of other life forms is a double-stranded molecule that forms the canonical double helix (Fig. 3A).

**Fig. 3. DNA Structures. A.** Double-stranded DNA in its helical form. The red and blue vertical rectangles represent the paired deoxyribonucleotides. B. A short, double-stranded DNA that is shown in the linear form with the paired deoxyribonucleotides indicated. A can only pair with T and C can only pair with G. Because of this inherent pairing property, the opposite strands of a double-strand DNA are always complementary to each other. C. A depiction of chromatin. The blue discs are a complex of histone proteins. The double-stranded DNA (black line) wraps around the discs to compact the DNA. Each disc with its associate DNA is called a nucleosome. D. A depiction of a chromosome pair. The chromatin is highly compacted to form the chromosomes that are visible by microscopy within our cells. The position where the two paired chromosomes attach is called the centromere. Figure created with BioRender.com.

The two strands are held together by bonding between deoxyribonucleotides on opposite strands. Importantly, the two strands of the double helix have a complementary sequence: an A base always pairs with a T base and a C base always pairs with a G base (Fig. 3B). This complementary pairing means that either strand can serve as a template for replicating the other strand. The complementarity of base pairing is also critical for the production of messenger RNA (mRNA) which will be described in a future blog.

In the cell, DNA is associated with many proteins, including a family of proteins called histones which wrap the DNA into structures called nucleosomes (Fig. 3C); histone-bound DNA is referred to as chromatin. Wrapping and compacting the DNA using proteins is important for fitting the extremely long DNA molecule into the tiny space of a cell. In addition, the human genome is not a single piece of chromatin but instead consists of 23 pieces called chromosomes. For our somatic cells (every cell except for egg and sperm cells), each chromosome exists in two copies, one copy comes from our mother and one from our father (Fig. 3D). Because we have 2 copies of each chromosome this is called a diploid genome. Egg and sperm cells have haploid genomes because these specialized cells only contain one copy of each chromosome. When a sperm fertilizes an egg the resulting cell now has both the maternal and paternal chromosome copies and is diploid. That fertilized egg can now grow and divide to give rise to all the parts of the body. Every time the cells divide the entire chromosome content must be duplicated so that each of the two new daughter cells receives the complete genome. In the next blog, I’ll discuss DNA replication and its consequences.

Understanding Molecular Biology Part I – DNA Structure and Organization

Share this:

Leave a comment Cancel reply