Skip to content

Mutations, Gene Regulation, and Biotechnology

Gregor Welcomes You!

Gregor welcomes you Welcome, investigators! In the last chapter you traced the central dogma — the flow of information from DNA through RNA to protein. But what happens when that information changes? And how do cells decide which genes to turn on and when? This chapter tackles three interconnected themes: mutations that alter the genetic message, the sophisticated regulatory systems cells use to control gene expression, and the powerful biotechnology tools scientists have developed to read, copy, and edit DNA. Let's investigate!

Summary

Gene expression is not static — cells tune it constantly, and errors in the sequence or regulation of genes have profound consequences. This chapter opens with the five classes of point and frameshift mutations, explaining how each alters protein structure and potentially fitness. It then examines gene regulation in prokaryotes (the lac and trp operons as models of inducible and repressible systems) and eukaryotes (transcription factors, enhancers, epigenetic marks, and microRNAs as layers of control). The final section introduces the biotechnology toolkit: restriction enzymes, gel electrophoresis, PCR, DNA cloning, recombinant DNA, CRISPR-Cas9, gene therapy, DNA sequencing, genomics, and bioinformatics.

Concepts Covered

This chapter covers the following 29 concepts from the learning graph:

Mutations

  1. Point Mutations
  2. Frameshift Mutations
  3. Silent Mutations
  4. Missense Mutations
  5. Nonsense Mutations
  6. Mutagens and DNA Damage

Prokaryotic Gene Regulation

  1. Gene Regulation (Prokaryotes)
  2. Operons
  3. Lac Operon
  4. Trp Operon

Eukaryotic Gene Regulation

  1. Gene Regulation (Eukaryotes)
  2. Transcription Factors
  3. Enhancers and Silencers
  4. Epigenetic Regulation
  5. DNA Methylation
  6. Histone Modification
  7. Chromatin Remodeling
  8. MicroRNAs

Biotechnology

  1. Biotechnology Overview
  2. Restriction Enzymes
  3. Gel Electrophoresis
  4. PCR
  5. DNA Cloning
  6. Recombinant DNA Technology
  7. CRISPR-Cas9
  8. Gene Therapy Concepts
  9. DNA Sequencing
  10. Genomics Overview
  11. Bioinformatics Basics

Prerequisites

This chapter builds on concepts from:


Part 1: Mutations

A mutation is any change in the nucleotide sequence of DNA. Mutations are the ultimate source of genetic variation — the raw material for evolution. Some mutations are harmful, some are neutral, and a rare few are beneficial.

Point Mutations

A point mutation (also called a base substitution) is a change in a single nucleotide pair in DNA. The three outcomes depend on how the altered codon is translated:

Silent Mutations

A silent mutation changes a codon but does NOT change the amino acid. This is possible because the genetic code is redundant — for example, both GCU and GCC code for alanine.

  • The mutation typically affects the third position (wobble position) of the codon
  • The protein is unchanged → usually no phenotypic effect
  • Silent mutations can still affect mRNA stability or splicing in some cases

Missense Mutations

A missense mutation changes a codon so that it specifies a different amino acid. The impact depends on the chemical properties of the substituted amino acid:

  • Conservative missense — the new amino acid has similar properties (e.g., replacing one hydrophobic amino acid with another) → protein function may be preserved
  • Non-conservative missense — the new amino acid has very different properties → protein function may be disrupted

Example: Sickle cell disease — a single missense mutation changes the sixth amino acid of \(\beta\)-globin from glutamic acid (hydrophilic) to valine (hydrophobic), causing hemoglobin molecules to polymerize and red blood cells to sickle.

Nonsense Mutations

A nonsense mutation changes a codon that specifies an amino acid into a premature stop codon (UAA, UAG, or UGA). The result is a truncated (shortened) protein that is usually nonfunctional.

Mutation type Base change Codon effect Protein effect
Silent Single base Different codon, same amino acid No change
Missense Single base Different codon, different amino acid May alter function
Nonsense Single base Creates premature stop codon Truncated protein

Frameshift Mutations

Frameshift mutations involve the insertion or deletion of nucleotides (not in multiples of three) that shift the reading frame of all downstream codons. Because codons are read as non-overlapping triplets from the start codon, adding or removing one or two bases changes every subsequent codon.

Frameshifts are typically more devastating than point mutations because they alter every amino acid downstream of the mutation, and they frequently introduce a premature stop codon.

Diagram: Mutation Effects Comparator

View Fullscreen

Mutation Effects Comparator — Specification

Type: MicroSim (p5.js)
sim-id: mutation-effects
Library: p5.js
Status: Specified

Learning objective: Students will be able to differentiate (Bloom's L4: Analyze) among silent, missense, nonsense, and frameshift mutations by observing how each type alters the mRNA codon sequence and resulting amino acid chain.

Instructional Rationale: Side-by-side comparison of normal vs. mutant sequences with concrete amino acid data lets students see exactly why a frameshift is more damaging than a point mutation. Interactive mutation placement empowers exploration.

Canvas: 800 × 480 px, responsive.

Layout:

  • Top: Original DNA template strand with codons labeled and amino acids shown below
  • Center: Mutation selector panel
  • Buttons: Silent, Missense, Nonsense, Insertion (+1 base), Deletion (-1 base)
  • Click any nucleotide position in the sequence to place the mutation there
  • Bottom: Mutant sequence showing changed codons and altered amino acids
  • Changed nucleotides highlighted in red
  • Changed amino acids highlighted in orange
  • Premature stop codons highlighted in bright red

Data Visibility Requirements:

  • Original sequence: AUGGCUUACCGAAACUGA → Met-Ala-Tyr-Arg-Asn-Stop
  • After silent mutation at position 9 (C→U): AUGGCUUAUUGA... → same amino acids
  • After missense: one amino acid changes, shown in orange
  • After nonsense: premature stop, downstream amino acids grayed out with strikethrough
  • After insertion: reading frame shifts, all downstream amino acids change (shown in red), likely premature stop
  • After deletion: same as insertion but removing a base

Interaction: - Click mutation type button → click a position → both sequences update - "Reset" button to clear mutations - "Compare All" mode: show all 5 mutation types simultaneously for the same position - Amino acid property labels (hydrophobic, polar, charged) shown on hover

Colors: Normal: blue text. Mutated base: red. Changed amino acid: orange. Premature stop: bright red background. Frameshift downstream: all red.

Responsive design: Sequences wrap to multiple lines on narrow screens; mutation panel collapses to dropdown.

Mutagens and DNA Damage

Mutagens are agents that increase the mutation rate above the spontaneous background level. Categories include:

  • Chemical mutagens — base analogs (e.g., 5-bromouracil mimics thymine), alkylating agents (e.g., mustard gas adds alkyl groups to bases), deaminating agents (e.g., nitrous acid converts cytosine to uracil)
  • Radiation — UV light causes thymine dimers (covalent bonds between adjacent thymines); ionizing radiation (X-rays, gamma rays) causes single- and double-strand breaks
  • Biological mutagens — transposons ("jumping genes") can insert into genes and disrupt them

Cells combat DNA damage using the repair mechanisms discussed in Chapter 13 (proofreading, mismatch repair, nucleotide excision repair).


Part 2: Gene Regulation in Prokaryotes

Gene Regulation Overview

Not all genes are expressed at all times. Gene regulation is the set of mechanisms that control when, where, and how much of a gene product is made. Regulation saves energy by producing proteins only when they are needed.

Operons

In prokaryotes, genes with related functions are often organized into operons — clusters of genes transcribed together as a single mRNA under the control of a shared promoter and operator.

An operon consists of:

  • Promoter — RNA polymerase binding site
  • Operator — a DNA segment between the promoter and the structural genes where a repressor protein can bind to block transcription
  • Structural genes — the protein-coding genes transcribed as a polycistronic mRNA
  • Regulatory gene — encodes the repressor protein (located elsewhere on the chromosome)

The Lac Operon (Inducible)

The lac operon controls the metabolism of lactose in E. coli. It is an inducible operon — normally OFF, turned ON when lactose is present.

Three structural genes:

  • lacZ — encodes \(\beta\)-galactosidase (cleaves lactose into glucose + galactose)
  • lacY — encodes permease (transports lactose into the cell)
  • lacA — encodes transacetylase

Regulation logic:

Condition Lactose Glucose lac operon
No need Absent Present OFF — repressor binds operator
Lactose available Present Absent ON — allolactose (inducer) binds repressor, releases it from operator; cAMP + CAP activate transcription
Both sugars Present Present LOW — glucose is preferred; low cAMP means weak CAP activation

Key Insight

Gregor thinking The lac operon demonstrates a fundamental biological principle: cells are economical. Why waste energy making lactose-digesting enzymes when glucose — the preferred energy source — is already available? This "catabolite repression" (glucose effect) reflects the same thermodynamic logic from Chapter 6: organisms minimize energy expenditure wherever possible.

The Trp Operon (Repressible)

The trp operon controls the biosynthesis of tryptophan. It is a repressible operon — normally ON, turned OFF when tryptophan levels are high.

  • When tryptophan is absent: the repressor is inactive → operon is ON → enzymes are synthesized to make tryptophan
  • When tryptophan is abundant: tryptophan acts as a corepressor, binding to the repressor and activating it → repressor binds the operator → operon is OFF

This is an example of feedback inhibition at the gene expression level — the end product of a pathway shuts off the genes encoding the pathway's enzymes.

Diagram: Operon Regulation Simulator

View Fullscreen

Operon Regulation Simulator — Specification

Type: MicroSim (p5.js)
sim-id: operon-regulation
Library: p5.js
Status: Specified

Learning objective: Students will be able to compare (Bloom's L4: Analyze) inducible (lac) and repressible (trp) operon regulation, and predict (Bloom's L3: Apply) the transcription state of each operon given different environmental conditions.

Instructional Rationale: An interactive toggle between lactose/glucose/tryptophan conditions with visible molecular consequences (repressor binding/releasing, RNA polymerase proceeding/blocked) makes abstract regulation logic concrete and testable.

Canvas: 800 × 480 px, responsive.

Layout:

  • Top: Toggle buttons — "Lac Operon" / "Trp Operon"
  • Center: Operon diagram showing:
  • Regulatory gene → repressor protein
  • Promoter, operator, structural genes as labeled segments
  • RNA polymerase approaching/transcribing
  • Repressor protein binding/releasing from operator
  • Inducer or corepressor molecules shown
  • Bottom: Condition selector panel
  • Lac mode: toggles for Lactose (present/absent) and Glucose (present/absent)
  • Trp mode: toggle for Tryptophan (present/absent)
  • Right: Result display — "Operon: ON" or "Operon: OFF" with explanation

Interaction: - Toggle environmental conditions → operon diagram animates to show molecular response - Repressor visibly attaches/detaches from operator - RNA polymerase shown transcribing or blocked - mRNA and protein products appear when operon is ON - "Quiz Mode" button: presents a condition and asks student to predict ON/OFF before revealing

Colors: Promoter: green. Operator: orange. Structural genes: blue. Repressor: red (active) / gray (inactive). Inducer: purple. Corepressor: yellow.

Responsive design: Operon diagram scales horizontally; condition panel wraps below on narrow screens.


Part 3: Gene Regulation in Eukaryotes

Eukaryotic gene regulation is far more complex than prokaryotic regulation, with multiple layers of control:

  1. Chromatin level — epigenetic modifications determine accessibility
  2. Transcriptional level — transcription factors, enhancers, silencers
  3. Post-transcriptional level — mRNA processing, alternative splicing, mRNA stability
  4. Translational level — microRNAs, translation initiation factors
  5. Post-translational level — protein modification, degradation

Transcription Factors

Transcription factors are proteins that bind to specific DNA sequences to promote or inhibit transcription. In eukaryotes, RNA polymerase II cannot bind the promoter on its own — it requires a complex of general transcription factors (TFIIA, TFIIB, TFIID, etc.) to assemble at the TATA box.

Beyond these general factors, specific transcription factors (activators and repressors) bind to regulatory DNA sequences to fine-tune expression of individual genes.

Enhancers and Silencers

  • Enhancers — DNA sequences (often thousands of base pairs away from the gene) that increase transcription when bound by activator proteins. DNA looping brings the enhancer-bound activators into contact with the transcription machinery at the promoter.
  • Silencers — DNA sequences that decrease transcription when bound by repressor proteins.

Epigenetic Regulation

Epigenetic regulation involves heritable changes in gene expression that do NOT alter the DNA sequence itself. Instead, chemical modifications to DNA or histone proteins determine whether a gene is accessible for transcription.

DNA Methylation

DNA methylation is the addition of methyl groups (\(\ce{-CH3}\)) to cytosine bases, typically at CpG dinucleotides. Methylation generally silences gene expression by:

  • Blocking transcription factor binding
  • Recruiting proteins that condense chromatin

Methylation patterns can be inherited during cell division, explaining how differentiated cells maintain their identity (a liver cell stays a liver cell) even though all cells contain the same DNA.

Histone Modification

Histones are the protein spools around which DNA wraps to form nucleosomes. Chemical modifications to histone tails affect how tightly DNA is packaged:

  • Acetylation (adding acetyl groups) → loosens chromatin → activates transcription
  • Methylation (of histones, not DNA) → can activate or repress depending on which residue is modified
  • Phosphorylation → typically associated with chromosome condensation during mitosis

Chromatin Remodeling

Chromatin remodeling complexes use ATP energy to physically reposition, eject, or restructure nucleosomes, making DNA more or less accessible to transcription machinery.

  • Euchromatin — loosely packed; transcriptionally active
  • Heterochromatin — tightly packed; transcriptionally silent
Modification Effect on chromatin Effect on transcription
DNA methylation Condenses Silences
Histone acetylation Loosens Activates
Histone deacetylation Condenses Silences
Histone methylation Varies Context-dependent

MicroRNAs

MicroRNAs (miRNAs) are small (~22 nucleotide) non-coding RNA molecules that regulate gene expression post-transcriptionally. They bind to complementary sequences in the 3' untranslated region (UTR) of target mRNAs, leading to:

  • mRNA degradation — the mRNA is broken down
  • Translational repression — the ribosome cannot translate the mRNA

A single miRNA can regulate hundreds of different mRNA targets, and a single mRNA can be regulated by multiple miRNAs.

Diagram: Eukaryotic Gene Regulation Layers

View Fullscreen

Eukaryotic Gene Regulation Layers — Specification

Type: Infographic (p5.js)
sim-id: eukaryotic-gene-regulation
Library: p5.js
Status: Specified

Learning objective: Students will be able to organize (Bloom's L4: Analyze) the multiple layers of eukaryotic gene regulation from chromatin level to post-translational, and explain how each layer contributes to controlling gene expression.

Instructional Rationale: A layered infographic with progressive disclosure (hover to reveal details at each level) avoids overwhelming students with all regulatory mechanisms at once while showing the complete picture.

Canvas: 780 × 480 px, responsive.

Layout: Vertical stack of 5 layers, each represented as a horizontal band:

  1. Chromatin level — shows nucleosomes with methylation/acetylation marks
  2. Transcriptional level — shows promoter, enhancer, transcription factors
  3. Post-transcriptional level — shows mRNA processing, alternative splicing
  4. Translational level — shows miRNA binding to mRNA, ribosome blocked
  5. Post-translational level — shows protein folding, ubiquitin tagging

Interaction: - Hover over any layer → expands to show detail diagram and 2–3 sentence description - Click layer → locks the expanded view; click again to collapse - "Flow" animation button: traces a gene from chromatin through to functional protein, highlighting each regulatory checkpoint - Color-coded: activation mechanisms in green, repression mechanisms in red

Colors: Chromatin: purple band. Transcription: blue band. Post-transcription: teal band. Translation: orange band. Post-translation: brown band.

Responsive design: Layers stack and expand vertically; all text and diagrams scale with container.


Part 4: Biotechnology

Biotechnology Overview

Biotechnology is the application of biological knowledge and molecular tools to manipulate DNA for research, medicine, agriculture, and industry. The tools described below form the modern molecular biology toolkit.

Restriction Enzymes

Restriction enzymes (restriction endonucleases) are bacterial enzymes that cut DNA at specific recognition sequences (usually 4–8 base pair palindromes).

  • They produce either blunt ends or sticky ends (short single-stranded overhangs)
  • Sticky ends are especially useful because complementary sticky ends from different DNA sources can base-pair and be joined by DNA ligase → foundation of recombinant DNA technology

Example: EcoRI recognizes GAATTC and cuts between G and A on both strands, producing 4-base sticky ends.

Gel Electrophoresis

Gel electrophoresis separates DNA fragments by size using an electric field. DNA is negatively charged (due to phosphate groups) and migrates toward the positive electrode:

  • Smaller fragments move faster (farther through the gel)
  • Larger fragments move slower (stay closer to the wells)
  • Fragment sizes are determined by comparison to a DNA ladder (known size standards)

PCR (Polymerase Chain Reaction)

PCR amplifies a specific DNA segment exponentially. Starting from a tiny sample, PCR can produce billions of copies in a few hours.

Each PCR cycle has three steps:

  1. Denaturation (~95°C) — heat separates the double-stranded DNA into single strands
  2. Annealing (~55°C) — short DNA primers bind to complementary flanking sequences
  3. Extension (~72°C) — Taq polymerase (heat-stable DNA polymerase from Thermus aquaticus) synthesizes new DNA from the primers

After \(n\) cycles, the number of DNA copies = \(2^n\) (exponential amplification).

Cycle Copies
0 1
10 ~1,000
20 ~1,000,000
30 ~1,000,000,000

DNA Cloning and Recombinant DNA Technology

DNA cloning produces many identical copies of a gene or DNA segment by inserting it into a self-replicating vector (usually a bacterial plasmid).

Recombinant DNA technology combines DNA from different sources:

  1. Cut the gene of interest and the plasmid vector with the same restriction enzyme (same sticky ends)
  2. Mix → sticky ends base-pair
  3. DNA ligase seals the joins → recombinant plasmid
  4. Transform bacteria with the plasmid (heat shock or electroporation)
  5. Select transformed bacteria using an antibiotic resistance marker on the plasmid
  6. Bacteria replicate the plasmid → billions of copies of the inserted gene

CRISPR-Cas9

CRISPR-Cas9 is a revolutionary gene-editing technology adapted from a natural bacterial immune system.

Components:

  • Guide RNA (gRNA) — a ~20-nucleotide RNA sequence complementary to the target DNA
  • Cas9 — an endonuclease that creates a double-strand break at the target site

The system works by:

  1. The gRNA directs Cas9 to the complementary DNA sequence
  2. Cas9 cuts both strands of the DNA
  3. The cell's repair mechanisms fix the break:
  4. Non-homologous end joining (NHEJ) — error-prone; often introduces insertions/deletions → gene knockout
  5. Homology-directed repair (HDR) — uses a provided DNA template → precise gene editing

Gregor's Tip

Gregor's tip CRISPR is a frequent AP exam topic. Know three things: (1) the guide RNA provides specificity (targeting), (2) Cas9 provides the cutting function, and (3) the cell's own repair machinery does the actual editing. Also be prepared to discuss ethical considerations — CRISPR can edit human germline cells, raising questions about heritable genetic modifications.

Gene Therapy Concepts

Gene therapy uses genetic engineering to treat or prevent disease by:

  • Adding a functional gene to compensate for a defective one (most common approach)
  • Editing the defective gene directly using CRISPR or similar technology
  • Silencing a harmful gene that is overexpressed

Delivery methods include viral vectors (modified viruses that insert the gene) and non-viral methods (lipid nanoparticles, electroporation).

DNA Sequencing

DNA sequencing determines the exact order of nucleotides in a DNA molecule. Sanger sequencing (chain-termination method) was the first widely used technique:

  1. DNA polymerase extends from a primer using normal nucleotides plus fluorescently labeled dideoxynucleotides (ddNTPs) that terminate the chain
  2. This produces fragments of every possible length, each ending with a labeled base
  3. Gel electrophoresis separates fragments by size; a laser reads the fluorescent labels → sequence

Modern next-generation sequencing (NGS) technologies can sequence entire genomes in days at a fraction of the cost.

Genomics Overview and Bioinformatics Basics

Genomics is the study of entire genomes — all the DNA in an organism, including genes and non-coding sequences. The Human Genome Project (completed 2003) sequenced all ~3.2 billion base pairs of human DNA.

Bioinformatics applies computational tools to analyze large biological datasets:

  • Sequence alignment and comparison
  • Gene prediction and annotation
  • Phylogenetic analysis from sequence data
  • Protein structure prediction

Diagram: Biotechnology Toolkit Workflow

View Fullscreen

Biotechnology Toolkit Workflow — Specification

Type: Workflow diagram (p5.js)
sim-id: biotech-toolkit
Library: p5.js
Status: Specified

Learning objective: Students will be able to organize (Bloom's L4: Analyze) the major biotechnology tools into a logical workflow and explain (Bloom's L2: Understand) how each tool contributes to a gene cloning or gene editing experiment.

Instructional Rationale: A clickable workflow diagram connecting the tools in a logical experimental pipeline helps students understand that these are not isolated techniques but rather steps in an integrated process.

Canvas: 780 × 480 px, responsive.

Layout: Flowchart with two parallel tracks:

  • Track 1: Gene Cloning — Restriction enzymes → Gel electrophoresis → DNA cloning → Recombinant DNA → Transform bacteria → Screen colonies
  • Track 2: Gene Editing — PCR (amplify target region) → CRISPR-Cas9 design → Guide RNA + Cas9 delivery → Repair pathway → Verify edit (sequencing)
  • Shared tools (connecting both tracks): PCR, Gel electrophoresis, DNA sequencing

Each node is a rounded rectangle with tool name and icon.

Interaction: - Hover over any node: tooltip with 2–3 sentence description of the tool and its purpose in this step - Click any node: expands to show a mini-diagram of how the tool works - Arrows between nodes animate to show flow direction - Toggle: "Show Timeline" overlays approximate time for each step

Colors: Gene cloning track: blue (#3498DB). Gene editing track: green (#27AE60). Shared tools: purple (#8E44AD). Active node: bright highlight.

Responsive design: Flowchart reflows to vertical layout on narrow screens.


Key Takeaways

  1. Point mutations (single base changes) include silent (no amino acid change), missense (different amino acid), and nonsense (premature stop codon) mutations.

  2. Frameshift mutations (insertions/deletions not in multiples of 3) shift the reading frame of all downstream codons and are typically severe.

  3. Mutagens (chemicals, radiation, transposons) increase mutation rates above the spontaneous baseline.

  4. Operons organize related prokaryotic genes under shared regulatory control. The lac operon (inducible) turns ON with lactose; the trp operon (repressible) turns OFF when tryptophan is abundant.

  5. Eukaryotic gene regulation operates at multiple levels: chromatin (epigenetics), transcriptional (transcription factors, enhancers/silencers), post-transcriptional (splicing, miRNAs), translational, and post-translational.

  6. Epigenetic modifications — DNA methylation, histone acetylation/deacetylation, and chromatin remodeling — control gene accessibility without changing the DNA sequence.

  7. MicroRNAs silence gene expression post-transcriptionally by targeting mRNAs for degradation or blocking translation.

  8. Restriction enzymes cut DNA at specific palindromic sequences; gel electrophoresis separates DNA fragments by size.

  9. PCR amplifies specific DNA segments exponentially (\(2^n\) copies after \(n\) cycles).

  10. Recombinant DNA technology inserts foreign DNA into plasmid vectors using restriction enzymes and ligase, then transforms bacteria for cloning.

  11. CRISPR-Cas9 uses a guide RNA for targeting and Cas9 for cutting, enabling precise gene editing via the cell's own repair pathways.

  12. DNA sequencing (Sanger and NGS) determines nucleotide order; genomics and bioinformatics analyze entire genomes computationally.


AP Practice: Test Your Understanding

Question 1: A mutation changes the mRNA codon from UAC (tyrosine) to UAG. What type of mutation is this, and what is the likely effect on the protein?

Answer: This is a nonsense mutation — UAG is a stop codon. The protein will be truncated at this position, losing all downstream amino acids. The protein is likely nonfunctional.

Question 2: An E. coli cell is growing in a medium containing both glucose and lactose. Predict the state of the lac operon and explain your reasoning.

Answer: The lac operon is mostly OFF (low expression). Although lactose is present (so allolactose can remove the repressor from the operator), glucose is also present — which keeps cAMP levels low. Without cAMP, the CAP activator protein cannot bind the promoter region effectively, resulting in minimal transcription. The cell preferentially uses glucose.

Question 3: A researcher wants to insert a human insulin gene into a bacterial plasmid. Describe the key steps using recombinant DNA technology.

Answer: (1) Cut the human insulin gene and the plasmid with the same restriction enzyme to generate compatible sticky ends. (2) Mix the gene and plasmid → sticky ends base-pair. (3) Use DNA ligase to seal the phosphodiester backbone → recombinant plasmid. (4) Transform bacteria with the recombinant plasmid (heat shock or electroporation). (5) Select transformants using an antibiotic resistance gene on the plasmid. (6) Confirm the insert by gel electrophoresis or DNA sequencing.