Forward Propagation

About This MicroSim

This visualization shows exactly how data flows through a neural network, step by step. Watch as inputs are transformed through matrix multiplications and activation functions to produce outputs.

The Forward Propagation Algorithm

For each layer \(l\) from 1 to L:

Linear Step: \(\mathbf{z}^{[l]} = W^{[l]}\mathbf{a}^{[l-1]} + \mathbf{b}^{[l]}\)
Activation Step: \(\mathbf{a}^{[l]} = \sigma^{[l]}(\mathbf{z}^{[l]})\)

Interactive Features

Next Step: Advance one computation at a time
Auto Run: Watch the propagation animate automatically
Speed Control: Adjust animation speed
Input Sliders: Change input values and restart
Reset: Reinitialize with new random weights

What You'll See

Yellow nodes: Currently being computed
Colored nodes: Values already computed (green=input, blue=hidden, red=output)
Gray nodes: Not yet computed
Weight labels: Shown on connections during computation
Matrix equation: Full computation displayed at bottom

Lesson Plan

Learning Objectives

Students will be able to:

Trace data flow through a neural network layer by layer
Perform matrix multiplication for the linear step z = Wa + b
Apply activation functions elementwise
Verify dimension compatibility at each step

Suggested Activities

Manual Verification: Pause at each step and verify the z values by hand
Input Exploration: Try different input values and predict the output
Dimension Tracking: For each step, verify that matrix dimensions are compatible
ReLU vs Sigmoid: Notice which activation is used where (ReLU=hidden, sigmoid=output)

Discussion Questions

Why do we alternate between linear and nonlinear operations?
What would happen if we removed all activation functions?
How does the matrix multiplication \(Wa\) combine information from the previous layer?
Why might the output use sigmoid instead of ReLU?

Mathematical Details

For the network 2 → 3 → 2:

Layer 1: \(W^{[1]} \in \mathbb{R}^{3\times2}\), \(\mathbf{b}^{[1]} \in \mathbb{R}^{3}\)
Layer 2: \(W^{[2]} \in \mathbb{R}^{2\times3}\), \(\mathbf{b}^{[2]} \in \mathbb{R}^{2}\)

References

Goodfellow et al. (2016). Deep Learning, Chapter 6.5: Forward Propagation
Nielsen (2015). Neural Networks and Deep Learning, Chapter 2