Skip to content

Biological Macromolecules

Summary

All living matter is built from four classes of biological macromolecules, and this chapter provides a thorough examination of each. Beginning with carbohydrates as the primary energy currency and structural material, the chapter moves through lipids (including phospholipids as membrane components), proteins (from amino acid sequence to quaternary structure and denaturation), and nucleic acids (DNA and RNA structure). Understanding macromolecular structure-function relationships is the conceptual foundation for Cell Biology, Cellular Energetics, and Molecular Biology.

Concepts Covered

This chapter covers the following 23 concepts from the learning graph:

Carbohydrates

  1. Carbohydrates
  2. Monosaccharides
  3. Disaccharides
  4. Polysaccharides
  5. Glycosidic Bonds

Lipids

  1. Lipids
  2. Fatty Acids
  3. Saturated and Unsaturated Fats
  4. Triglycerides
  5. Phospholipids
  6. Sterols and Cholesterol

Proteins

  1. Proteins
  2. Amino Acids
  3. Peptide Bonds
  4. Primary Protein Structure
  5. Secondary Protein Structure
  6. Tertiary Protein Structure
  7. Quaternary Protein Structure
  8. Protein Denaturation

Nucleic Acids

  1. Nucleic Acids
  2. Nucleotides
  3. DNA Structure
  4. RNA Structure

Prerequisites

This chapter builds on concepts from:


Gregor Welcomes You!

Gregor welcomes you Welcome to Chapter 3, investigators! If Chapter 2 gave us the chemical vocabulary of life, this chapter introduces the cast of characters — the four families of molecules that build, power, protect, and inform every living cell. By the time we finish, you will be able to look at any biological molecule and identify its class, predict its properties, and understand its role. These are the molecules the AP exam tests constantly, so every section here is high-yield. Let's investigate!

Introduction

A single human cell contains thousands of different molecular species, yet nearly all of them belong to just four chemical families: carbohydrates, lipids, proteins, and nucleic acids. Each family is defined by a characteristic set of monomers, the types of covalent bonds that link those monomers into polymers, and the functional groups that determine the family's chemical behavior. The common thread introduced in Chapter 2 — that condensation reactions build polymers and hydrolysis reactions break them — applies consistently across all four classes.

What distinguishes the macromolecules from each other is not the chemistry of polymerization but the diversity of their monomers and the information encoded in the sequence of those monomers. Twenty amino acid types can be arranged into an essentially unlimited number of protein sequences, each folding into a unique three-dimensional shape that performs a specific task. Four nucleotide types, arranged in precise order along a DNA strand, encode every heritable instruction an organism possesses. Understanding how monomer sequence determines macromolecular structure, and how structure determines function, is the central theme of this chapter and of molecular biology as a whole.


Part 1: Carbohydrates

Carbohydrates

Carbohydrates are organic molecules with the general empirical formula \(\ce{(CH2O)_n}\), meaning they contain carbon, hydrogen, and oxygen in approximately a 1:2:1 ratio. The name reflects this composition: "carbo" (carbon) + "hydrate" (water). Carbohydrates serve two primary biological roles: energy storage and structural support. Glucose is the universal cellular fuel, starch and glycogen store energy in plants and animals respectively, while cellulose and chitin provide rigid structural frameworks. A third role — cell-surface recognition — is performed by short carbohydrate chains attached to membrane proteins (glycoproteins) and lipids (glycolipids), which form the glycocalyx that coats the outside of animal cells.

Monosaccharides

Monosaccharides are the monomers of carbohydrates. They are single-sugar units that cannot be hydrolyzed into simpler carbohydrates. The most biologically important monosaccharide is glucose (\(\ce{C6H12O6}\)), a six-carbon sugar (hexose) that is the primary substrate for cellular respiration. Other important hexoses include fructose (fruit sugar, found in many fruits and in sucrose) and galactose (found in lactose, milk sugar). Ribose and deoxyribose are five-carbon sugars (pentoses) that form the backbone of RNA and DNA, respectively.

Monosaccharides exist predominantly in ring forms in aqueous solution. Glucose cyclizes from its open-chain form into a six-membered pyranose ring when the C-1 aldehyde reacts with the C-5 hydroxyl group. This cyclization creates a new asymmetric carbon at C-1, producing two stereoisomers: α-glucose (hydroxyl group on C-1 pointing down) and β-glucose (hydroxyl group on C-1 pointing up). This seemingly trivial structural difference has enormous consequences: polymers built from α-glucose are digestible (starch), while polymers built from β-glucose are indigestible by most animals (cellulose).

Disaccharides and Glycosidic Bonds

Disaccharides are formed when two monosaccharides are joined by a glycosidic bond — the condensation-reaction covalent bond between the anomeric carbon (C-1) of one sugar and a hydroxyl group on another sugar, with loss of one water molecule. The orientation of the glycosidic bond (α or β) and the specific carbons involved determine the properties of the resulting disaccharide.

Three biologically important disaccharides are:

  • Sucrose — glucose + fructose (α-1,2-glycosidic bond); table sugar; the primary transport form of carbohydrates in plants
  • Lactose — galactose + glucose (β-1,4-glycosidic bond); milk sugar; hydrolyzed by the enzyme lactase (lactase deficiency causes lactose intolerance)
  • Maltose — glucose + glucose (α-1,4-glycosidic bond); produced by starch hydrolysis during digestion and germination

Polysaccharides

Polysaccharides are polymers of hundreds to thousands of monosaccharide units linked by glycosidic bonds. The type and orientation of the glycosidic bond, and the degree of branching, determine whether the polymer is used for energy storage or structural support.

Polysaccharide Monomer Bond Branching Function Organism
Starch (amylose) α-glucose α-1,4 None Energy storage Plants
Starch (amylopectin) α-glucose α-1,4 main; α-1,6 at branches Yes (~every 30 units) Energy storage Plants
Glycogen α-glucose α-1,4 main; α-1,6 at branches Extensive (~every 10 units) Energy storage Animals, fungi
Cellulose β-glucose β-1,4 None Structural (cell walls) Plants
Chitin N-acetylglucosamine β-1,4 None Structural (exoskeletons, fungal walls) Animals, fungi

The extensive branching of glycogen (more than starch) gives it a larger number of free ends, allowing many glucose units to be released simultaneously during periods of high energy demand — a structural adaptation to the rapid metabolic needs of animal cells.

Gregor's Tip: α vs. β — A Structural Choice with Huge Consequences

Gregor tip The AP exam frequently asks why humans can digest starch but not cellulose, even though both are glucose polymers. The answer is the glycosidic bond orientation: α-1,4 bonds in starch are hydrolyzed by amylase, but humans lack the enzyme (cellulase) needed to break β-1,4 bonds in cellulose. Herbivores digest cellulose using cellulase-producing bacteria in their gut. This single structural difference — α vs. β — determines whether glucose is food or fiber.

Diagram: Carbohydrate Structures Explorer

View Fullscreen

Carbohydrate Structures Explorer MicroSim

Type: infographic sim-id: carbohydrate-structures
Library: p5.js
Status: Specified

Bloom Level: Understand (L2) Bloom Verb: explain Learning Objective: Students will explain the structural difference between α- and β-glucose, identify the glycosidic bond connecting monosaccharide monomers, and contrast the structures of starch and cellulose in terms of bond orientation and biological function.

Canvas layout: - Left panel (30%): Clickable list of molecules: α-Glucose, β-Glucose, Sucrose, Starch (short chain), Cellulose (short chain) - Center panel (50%): Structural diagram of the selected molecule, drawn at large scale using p5.js Haworth projections (ring structures) - Right panel (20%): Properties box showing type, bond type, function, and one key biological fact

Visual elements: - Haworth projection ring diagrams for each molecule with correct ring shapes (pyranose 6-membered ring for glucose, furanose 5-membered ring for fructose within sucrose) - For α-glucose: C-1 –OH highlighted in red pointing DOWN from ring plane - For β-glucose: C-1 –OH highlighted in blue pointing UP from ring plane - For disaccharides: glycosidic bond shown as a thick colored line connecting two rings, with bond type label (α-1,4, β-1,4, α-1,2) - For polysaccharides: 4-unit chain shown with consistent bond orientation; α-1,4 chain bends into a coil (starch); β-1,4 chain stays straight and linear (cellulose) - Water molecule shown departing at each glycosidic bond connection - Color scheme: carbon atoms dark gray, oxygen atoms red, hydroxyl groups green; glycosidic bond orange

Interactive controls: - Left panel: clicking any molecule name updates center and right panels - Hovering over any atom in the center panel shows atom label and brief chemical description - Toggle button "Show bond orientation labels" — highlights the anomeric C-1 position and the O atom of the glycosidic bond in both the α and β forms

Default state: α-Glucose selected

Behavior: smooth 200 ms transition when new molecule selected; all ring positions correctly drawn for each molecule

Instructional Rationale: Seeing the structural drawings of α- and β-glucose side by side, with the C-1 hydroxyl orientation explicitly highlighted, makes concrete the abstract claim that "one bond direction determines digestibility." The polymer chain view reinforces that bond orientation propagates into dramatically different 3D shapes.

Canvas size: 660 × 460 px Responsive: Must respond to window resize events


Part 2: Lipids

Lipids

Lipids are a chemically diverse group of biological molecules unified by their hydrophobicity: they are nonpolar, largely or entirely hydrocarbon compounds that do not dissolve in water but do dissolve in nonpolar organic solvents. Unlike carbohydrates, proteins, and nucleic acids, most lipids are not true polymers built from repeating monomers; instead, they are assembled from glycerol and fatty acid components by ester bonds.

Lipids serve four major biological functions:

  • Long-term energy storage — triglycerides store more than twice as much energy per gram as carbohydrates (9 kcal/g vs. 4 kcal/g) because their hydrocarbon chains are highly reduced (low oxygen content)
  • Membrane structure — phospholipids form the bilayer matrix of all biological membranes
  • Insulation and protection — subcutaneous fat insulates endotherms; lipid coatings on nerve fibers (myelin) speed signal conduction
  • Signaling — steroid hormones (derived from cholesterol) and certain lipid-derived molecules act as chemical messengers

Fatty Acids

Fatty acids are long hydrocarbon chains (typically 14–22 carbons) with a carboxyl group (\(\ce{-COOH}\)) at one end. The carboxyl group is the only polar, hydrophilic part of a fatty acid; the rest of the molecule is an entirely nonpolar hydrocarbon tail. This amphipathic character — having both polar and nonpolar regions — is exploited in the architecture of biological membranes.

Saturated and Unsaturated Fats

The critical structural variable in fatty acids is the presence or absence of double bonds in the hydrocarbon chain:

  • Saturated fatty acids contain no C=C double bonds. Every carbon is "saturated" with hydrogen atoms. The chain is straight and flexible, allowing saturated fatty acid chains to pack tightly together. At room temperature, saturated fats are solid (butter, lard, coconut oil). Palmitic acid (16:0) and stearic acid (18:0) are common examples.
  • Unsaturated fatty acids contain one or more C=C double bonds. Each double bond introduces a rigid kink (bend) in the chain. These kinks prevent tight packing, making unsaturated fats liquid at room temperature (oils). Oleic acid (18:1, one double bond — monounsaturated) and linoleic acid (18:2 — polyunsaturated) are biologically important examples.
  • Trans fatty acids (artificial, from partial hydrogenation of vegetable oils) have a double bond in the trans configuration, which straightens the chain like a saturated fat. Trans fats pack tightly, raise LDL cholesterol, and are associated with cardiovascular disease.

Membrane fluidity depends critically on fatty acid saturation. Unsaturated fatty acid tails, with their kinks, keep membrane phospholipids from packing too closely, maintaining fluidity at physiological temperatures. Organisms that live in cold environments have a higher proportion of unsaturated fatty acids in their membranes — a phenomenon called homeoviscous adaptation.

Triglycerides

A triglyceride (triacylglycerol) consists of a glycerol molecule esterified to three fatty acid chains via ester bonds (formed by condensation between the carboxyl group of each fatty acid and one of the three hydroxyl groups of glycerol). Triglycerides are entirely nonpolar and are the primary form of long-term energy storage in animals (stored in adipose tissue) and plants (stored in seeds as oils).

\[\text{Glycerol} + 3\ \text{Fatty acids} \xrightarrow{\text{3 condensations}} \text{Triglyceride} + 3\ \ce{H2O}\]

When energy is needed, lipases hydrolyze the ester bonds, releasing glycerol and fatty acids. Fatty acids are then broken down by beta-oxidation in the mitochondrial matrix, generating acetyl-CoA, NADH, and FADH₂ that feed into the citric acid cycle and electron transport chain — topics developed fully in Chapter 8.

Phospholipids

Phospholipids are the structural foundation of biological membranes. A phospholipid resembles a triglyceride, but one of the three fatty acid chains is replaced by a phosphate group linked to a small, charged organic head group (choline, ethanolamine, serine, or inositol). This creates a molecule with a distinctly amphipathic structure:

  • Hydrophilic head — the phosphate-head group; polar and charged; faces water
  • Hydrophobic tails — two fatty acid chains; nonpolar; face away from water

When placed in an aqueous environment, phospholipids spontaneously form a bilayer: two sheets of phospholipids with their tails pointing inward (toward each other) and their heads pointing outward (toward the surrounding water). This thermodynamically stable arrangement — driven by the hydrophobic effect described in Chapter 2 — forms the basis of every biological membrane.

Diagram: Cell Membrane — Phospholipid Bilayer

View Fullscreen

Cell Membrane — Fluid Mosaic Model (existing sim)

Type: diagram sim-id: cell-membrane
Library: p5.js
Status: Complete

This interactive diagram shows the full fluid mosaic model of the cell membrane, including the phospholipid bilayer, embedded integral proteins, peripheral proteins, glycolipids, glycoproteins, and cholesterol. Explore mode reveals the structure and function of each labeled component. Use this sim to see how the amphipathic phospholipid architecture creates the bilayer and how membrane proteins carry out transport and signaling.

Sterols and Cholesterol

Sterols are lipids with a characteristic four-ring hydrocarbon skeleton. Cholesterol is the most important sterol in animals and is a critical component of animal cell membranes, where it inserts between phospholipid tails. Cholesterol has a dual effect on membrane fluidity:

  • At high temperatures, cholesterol restricts phospholipid movement, reducing fluidity and preventing the membrane from becoming too loose
  • At low temperatures, cholesterol prevents phospholipid tails from packing together and crystallizing, maintaining fluidity

This fluidity buffer role makes cholesterol essential for maintaining membrane function across a range of temperatures. Beyond its membrane role, cholesterol is the precursor for the synthesis of all steroid hormones (including testosterone, estrogen, cortisol, and aldosterone) and bile salts (which emulsify fats during digestion).

Lipid Class Glycerol? Ester Bonds Polar Head? Primary Role
Triglyceride Yes 3 No Energy storage
Phospholipid Yes 2 (+ phosphate group) Yes Membrane structure
Wax No (long-chain alcohol) 1 No Waterproofing (cuticle, feathers)
Sterol (cholesterol) No (ring skeleton) None Weak (–OH) Membrane fluidity; hormone precursor

Part 3: Proteins

Amino Acids

Proteins are the most structurally and functionally diverse macromolecules in biology. They serve as enzymes, structural materials (collagen, keratin, actin, tubulin), transporters (hemoglobin, membrane channels), hormones (insulin, growth hormone), antibodies, and molecular motors (myosin, kinesin). This extraordinary functional diversity arises from the diversity of amino acids — the 20 monomers from which all proteins are built.

Every amino acid shares a common core structure: a central alpha carbon (\(C_\alpha\)) bonded to a hydrogen atom, an amino group (\(\ce{-NH2}\)), a carboxyl group (\(\ce{-COOH}\)), and a distinctive R group (side chain) that is unique to each amino acid. At cellular pH (~7), the amino group is protonated (\(\ce{-NH3+}\)) and the carboxyl group is deprotonated (\(\ce{-COO-}\)), making each amino acid a zwitterion.

The R group determines every chemical property that distinguishes one amino acid from another: size, shape, charge, polarity, and reactivity. The 20 standard amino acids are grouped by R-group properties:

Category Examples R-Group Properties Protein Location
Nonpolar, aliphatic Glycine, Alanine, Valine, Leucine, Isoleucine, Proline Hydrophobic hydrocarbon Protein interior (buried)
Aromatic Phenylalanine, Tyrosine, Tryptophan Hydrophobic; Tyr/Trp have –OH or indole (polar capacity) Interior; Tyr in active sites
Polar, uncharged Serine, Threonine, Cysteine, Methionine, Asparagine, Glutamine Polar –OH, –SH, –CONH₂ Interior and surface
Positively charged (+) Lysine, Arginine, Histidine Carry positive charge at pH 7 Protein surface; active sites
Negatively charged (−) Aspartate, Glutamate Carry negative charge at pH 7 Protein surface; active sites

Peptide Bonds

Amino acids are joined into polypeptide chains by peptide bonds — condensation reactions between the carboxyl group of one amino acid and the amino group of the next, releasing water:

\[\ce{H2N-CHR1-COOH + H2N-CHR2-COOH -> H2N-CHR1-CO-NH-CHR2-COOH + H2O}\]

The peptide bond has partial double-bond character due to electron delocalization, which makes it planar and rigid — it cannot rotate freely. This rigidity is critical for determining the allowed conformations of the polypeptide backbone. The polypeptide chain is directional: it has a free amino terminus (N-terminus) at one end and a free carboxyl terminus (C-terminus) at the other. By convention, polypeptide sequences are always written and read from N to C.

A polypeptide of \(n\) amino acids contains \(n-1\) peptide bonds and \(n-1\) water molecules were released during its synthesis. A mature protein may consist of one or multiple polypeptide chains folded into a specific three-dimensional structure.

The Four Levels of Protein Structure

Protein structure is described at four hierarchical levels, each adding a layer of organization:

Key Insight: Structure Determines Function

Gregor thinking The four levels of protein structure are not just a classification scheme — they describe how the one-dimensional information in a gene (a DNA sequence) becomes a three-dimensional molecular machine. Level 1 (primary structure) encodes all the information needed to determine levels 2, 3, and 4. When we study mutations in Chapter 14, we will see how a single amino acid change in the primary structure can alter the tertiary structure and destroy a protein's function entirely.

Primary Structure is the linear sequence of amino acids along the polypeptide chain, held together by covalent peptide bonds. Primary structure is determined entirely by the nucleotide sequence of the gene (via the central dogma: DNA → mRNA → protein). All higher levels of protein structure are ultimately determined by the primary sequence, since the R groups that interact to drive folding are specified at this level.

Secondary Structure refers to regular, repeating local conformations of the polypeptide backbone, stabilized by hydrogen bonds between the peptide bond carbonyl oxygen (C=O) and the amide hydrogen (N–H) in the backbone. The two most common secondary structures are:

  • α-helix — a right-handed coil in which each C=O forms a hydrogen bond with the N–H four residues further along the chain. The R groups point outward from the helix axis. Approximately 3.6 amino acids per turn; pitch of 0.54 nm per turn.
  • β-pleated sheet — two or more segments of the polypeptide chain lie side by side in an extended conformation, forming hydrogen bonds between C=O and N–H groups on adjacent strands. Sheets can be parallel (strands run in the same N→C direction) or antiparallel (strands alternate in direction); antiparallel sheets are more stable.

Tertiary Structure is the overall three-dimensional shape of a single polypeptide chain — how the regions of secondary structure fold onto each other. Tertiary structure is stabilized by interactions between R groups (side chains):

  • Hydrophobic interactions — nonpolar R groups cluster in the interior, away from water (the primary driving force of protein folding)
  • Hydrogen bonds between polar R groups
  • Ionic bonds (salt bridges) between oppositely charged R groups
  • Disulfide bridges (\(\ce{-S-S-}\) bonds) between cysteine residues — covalent bonds that stabilize extracellular proteins (e.g., antibodies, insulin)
  • Van der Waals forces — weak interactions that become significant when many atoms pack closely together in the protein interior

Quaternary Structure exists only in proteins composed of two or more polypeptide chains (subunits). It describes the arrangement and interactions of those subunits. The same types of noncovalent forces that stabilize tertiary structure (hydrophobic interactions, hydrogen bonds, ionic bonds) hold the subunits together. Hemoglobin is a classic example: it has quaternary structure consisting of two α subunits and two β subunits arranged in a symmetric tetramer. Many enzymes are multimeric, and quaternary structure is often essential for cooperative binding and allosteric regulation.

Diagram: Protein Structure Levels Explorer

View Fullscreen

Protein Structure Levels Explorer MicroSim

Type: infographic sim-id: protein-structure-levels
Library: p5.js
Status: Specified

Bloom Level: Understand (L2) Bloom Verb: explain Learning Objective: Students will explain the defining features and stabilizing forces at each of the four levels of protein structure, and identify the type of bonds or interactions that maintain each level.

Canvas layout: - Top row: Four labeled panels side by side: "Primary", "Secondary", "Tertiary", "Quaternary" - Each panel is clickable; clicking expands it to fill the center of the canvas with a detailed diagram - Bottom strip: "Stabilizing forces" legend showing bond/interaction types and their icons

Visual elements — Primary panel: - Linear chain of 8 labeled colored circles (amino acids) connected by lines labeled "Peptide bond" - N-terminus labeled with "H₂N–" on left, C-terminus with "–COOH" on right - Each circle a different color representing a different amino acid type (nonpolar gray, polar blue, charged red/orange)

Visual elements — Secondary panel: - Side-by-side: α-helix (coil) and β-pleated sheet, each drawn with p5.js - Hydrogen bonds shown as dashed green lines between backbone C=O and N–H - R groups shown as stubs projecting outward from the helix - Labels: "H-bonds stabilize both secondary structures"

Visual elements — Tertiary panel: - Schematic globular protein showing helices (coils) and sheets (arrows) connected by loops, folded into a compact shape - Color-coded regions of secondary structure: helices orange, sheets blue, loops gray - Callout labels with arrows pointing to: "Hydrophobic core", "Disulfide bridge (–S–S–)", "Salt bridge (+/–)", "H-bonds between R groups"

Visual elements — Quaternary panel: - Hemoglobin tetramer: two α subunits (light blue) and two β subunits (light red) arranged symmetrically - Heme groups shown as flat disk icons within each subunit - Subunit interfaces labeled "Noncovalent interactions hold subunits together" - Small oxygen molecules (O₂) shown binding to heme groups

Interactive controls: - Clicking any panel header expands that level to full-canvas view with more detail - "Back to overview" button returns to four-panel view - Hovering over any labeled element shows a tooltip with definition and biological example

Default state: four-panel overview visible

Instructional Rationale: A step-through from linear (primary) to 3D multimeric (quaternary) builds the scaffold for understanding structure–function relationships. The overview lets students see all four levels simultaneously, reinforcing the hierarchical nature of protein organization.

Canvas size: 700 × 480 px Responsive: Must respond to window resize events

Protein Denaturation

Denaturation is the disruption of a protein's three-dimensional structure without breaking the peptide bonds of the primary sequence. Because the three-dimensional shape is maintained by relatively weak noncovalent forces (hydrophobic interactions, hydrogen bonds, ionic bonds), conditions that disrupt these forces can unfold a protein while leaving its amino acid sequence intact.

Conditions that cause denaturation include:

  • Heat — increases molecular motion, disrupting hydrogen bonds and hydrophobic packing
  • Extreme pH — alters the ionization state of R groups, disrupting ionic bonds and hydrogen bonds
  • Organic solvents — compete with hydrophobic interactions
  • Reducing agents — break disulfide bridges (used when preparing proteins for gel electrophoresis)
  • Detergents — interfere with hydrophobic packing (e.g., SDS denatures proteins in SDS-PAGE)

Denaturation destroys the protein's function because function depends entirely on the three-dimensional structure. A denatured enzyme cannot bind its substrate; a denatured receptor cannot respond to its ligand. Denaturation is generally irreversible under biological conditions (cooking an egg denatures the albumin permanently), although some small proteins can renature (refold correctly) when the denaturing conditions are removed — a finding by Anfinsen that confirmed that the primary sequence alone encodes all the information needed for correct folding.

Common Mistake: Denaturation ≠ Hydrolysis

Gregor warning Students often confuse denaturation with hydrolysis. Denaturation disrupts the 3D shape by breaking noncovalent interactions between R groups — the primary sequence (peptide bonds) is unchanged. Hydrolysis breaks covalent peptide bonds and produces free amino acids. A denatured protein still has the same amino acid sequence; a hydrolyzed protein is broken into individual amino acids. On the AP exam, if a question asks what happens when you cook a protein, the answer is denaturation — not hydrolysis.


Part 4: Nucleic Acids

Nucleic Acids and Nucleotides

Nucleic acids — DNA and RNA — are the information-storage and information-transfer macromolecules of life. They are polymers of nucleotides, joined by phosphodiester bonds. Each nucleotide consists of three components:

  • A five-carbon sugar (deoxyribose in DNA, ribose in RNA)
  • A phosphate group (\(\ce{-OPO3^{2-}}\))
  • A nitrogenous base (one of five types)

The five nitrogenous bases are divided into two structural categories:

  • Purines (double-ring): Adenine (A) and Guanine (G) — found in both DNA and RNA
  • Pyrimidines (single-ring): Cytosine (C) — in both DNA and RNA; Thymine (T) — DNA only; Uracil (U) — RNA only

Nucleotides are linked when the 3′ hydroxyl group of one nucleotide's sugar forms a phosphodiester bond with the 5′ phosphate of the next nucleotide, releasing pyrophosphate (\(\ce{PPi}\)). This creates a sugar-phosphate backbone with a defined directionality: one end has a free 5′ phosphate (5′ end) and the other has a free 3′ hydroxyl (3′ end). Nucleic acid sequences are always written 5′ → 3′.

DNA Structure

Deoxyribonucleic acid (DNA) is the molecule of heredity in all living organisms and most viruses. Watson and Crick's 1953 model, built using Chargaff's base-pairing rules and Franklin's X-ray diffraction data, revealed DNA as a double helix: two antiparallel polynucleotide strands wound around a common central axis, with:

  • The sugar-phosphate backbones on the outside, facing the aqueous environment
  • The nitrogenous bases pointing inward, stacked and paired in the interior

The two strands are held together by hydrogen bonds between specific, complementary base pairs — Chargaff's rules:

  • Adenine (A) pairs with Thymine (T) — 2 hydrogen bonds
  • Guanine (G) pairs with Cytosine (C) — 3 hydrogen bonds

G–C pairs are stronger than A–T pairs because they form an additional hydrogen bond. DNA sequences with higher G+C content denature (strands separate) at higher temperatures than A+T-rich sequences — a property exploited in PCR (polymerase chain reaction).

The two strands run antiparallel: one strand runs 5′ → 3′ while its complementary partner runs 3′ → 5′. This antiparallel orientation is required by the geometry of base pairing and the directionality of the phosphodiester backbone.

Diagram: DNA Double Helix Structure

View Fullscreen

DNA Double Helix Structure (existing sim)

Type: diagram sim-id: dna-double-helix
Library: p5.js
Status: Complete

This interactive diagram shows the DNA double helix with labeled structural components: the two antiparallel strands, the sugar-phosphate backbone, base pairs (A–T and G–C), the major and minor grooves, and strand directionality (5′→3′). Explore mode provides detailed descriptions of each labeled element; Quiz mode challenges students to identify structures by name.

RNA Structure

Ribonucleic acid (RNA) differs from DNA in three key structural respects:

  • Sugar — RNA contains ribose (with a 2′–OH group) instead of deoxyribose. The 2′–OH makes RNA less stable than DNA (it can participate in hydrolysis reactions) but also gives RNA the ability to fold into complex 3D structures with catalytic activity.
  • One of the bases — RNA uses uracil (U) instead of thymine (T). Both pair with adenine (A), but uracil lacks the methyl group at C-5 that thymine has.
  • Structure — DNA is predominantly double-stranded; RNA is predominantly single-stranded, though it can form local double-stranded regions by folding back on itself (hairpin loops, stem-loop structures).

The three major functional classes of RNA are:

  • Messenger RNA (mRNA) — carries the genetic message from DNA in the nucleus to the ribosome for translation. Its sequence of codons (three-nucleotide words) specifies the amino acid sequence of a protein.
  • Transfer RNA (tRNA) — adaptor molecules that carry specific amino acids to the ribosome and decode the mRNA codons via complementary anticodons. Each tRNA folds into a characteristic L-shaped tertiary structure.
  • Ribosomal RNA (rRNA) — the major structural and catalytic component of ribosomes. The ribosome's ability to catalyze peptide bond formation resides in rRNA, making the ribosome a ribozyme — an RNA molecule with enzymatic activity.

The comparison between DNA and RNA reveals a structural logic: DNA's stability (no 2′–OH, thymine instead of uracil, double-stranded) makes it an ideal long-term information archive, while RNA's flexibility (single-stranded, capable of folding, 2′–OH enabling catalysis) makes it suited for the diverse information-transfer and catalytic roles it plays in gene expression.

Feature DNA RNA
Sugar Deoxyribose (no 2′–OH) Ribose (has 2′–OH)
Bases A, T, G, C A, U, G, C
Strands Double-stranded (antiparallel) Single-stranded (with local folds)
Stability More stable (long-term storage) Less stable (short-lived message)
Location Nucleus (eukaryotes); cytoplasm (prokaryotes) Nucleus + cytoplasm
Function Information storage and template Information transfer, catalysis, translation

You've Got This!

Gregor encouraging Twenty-three concepts is a lot to take in — but notice that every section follows the same logic: monomer structure → polymer structure → biological function → consequence of disruption. Amino acids → polypeptides → folded protein → denaturation. Nucleotides → polynucleotide chains → double helix → base pairing rules. Once you see this pattern, you can reason about any macromolecule even if you haven't memorized every detail.


Key Connections and Chapter Summary

The four classes of biological macromolecules are the molecular foundation of every biological process studied in AP Biology. The key themes to carry forward are:

  • Carbohydrates — built from monosaccharides by α- or β-glycosidic bonds; α-linkages yield digestible energy-storage polymers (starch, glycogen); β-linkages yield structural, indigestible polymers (cellulose, chitin).
  • Lipids — unified by hydrophobicity, not by polymer structure; saturated fats are solid (tightly packed chains), unsaturated fats are liquid (kinked chains prevent packing); phospholipids self-assemble into bilayers; cholesterol buffers membrane fluidity and is the precursor to steroid hormones.
  • Proteins — built from 20 amino acids by peptide bonds; function depends on four levels of structure; tertiary structure is primarily stabilized by hydrophobic interactions; denaturation disrupts 3D structure without breaking peptide bonds.
  • Nucleic acids — built from nucleotides by phosphodiester bonds; DNA is a stable double helix encoding heritable information; RNA is single-stranded, versatile, and carries, transfers, and catalyzes the expression of genetic information.

Excellent Work!

Gregor celebrating You have just completed one of the most content-dense chapters in AP Biology, investigators! The macromolecules you mastered here will appear in every remaining chapter. Phospholipids in Chapter 5 (membranes and transport). Proteins in Chapter 6 (enzyme kinetics). DNA and RNA in Chapters 13 and 14 (gene expression). Polysaccharides in the context of cell walls and glycolipids throughout. Every concept you learned here is an investment that pays dividends for the rest of the course.

Self-Check: Test Your Understanding — Click to Reveal

Question: A student heats a solution containing the enzyme catalase until the enzyme no longer breaks down hydrogen peroxide. She then cools the solution back to 37°C, but enzyme activity does not return. (a) What process destroyed the enzyme's activity? (b) Which level(s) of protein structure were disrupted? (c) Were any peptide bonds broken? (d) Why does the enzyme not recover when cooled?

Answer: (a) Denaturation — heat disrupted the noncovalent interactions holding the enzyme in its functional 3D shape. (b) Tertiary structure was disrupted (and secondary structure locally). Primary structure was unchanged because peptide bonds were not broken. (c) No peptide bonds were broken — the amino acid sequence is intact. (d) Denaturation of most large proteins is irreversible because the completely unfolded polypeptide has a very low probability of refolding correctly into the precise tertiary structure — it is far more likely to misfold or aggregate with other denatured chains. Only small, simple proteins refold spontaneously under laboratory conditions.