Skip to content

Six-Step Data Science Workflow

Embed This MicroSim

Copy this iframe to embed this MicroSim in your website:

1
<iframe src="https://dmccreary.github.io/data-science-course/sims/data-science-workflow/main.html" height="300px" scrolling="no"></iframe>

Run the Data Science Workflow MicroSim in Fullscreen

Edit this MicroSim in the p5.js Editor

About This MicroSim

This interactive infographic illustrates the six essential steps of the data science workflow:

  1. Define the Problem - Clearly articulate the question you want to answer
  2. Collect Data - Gather relevant data from various sources
  3. Clean Data - Process and prepare data for analysis
  4. Analyze Data - Apply statistical and machine learning methods
  5. Visualize Results - Create charts and graphs to communicate findings
  6. Communicate Insights - Share conclusions with stakeholders

Lesson Plan

Grade Level, Subject and Topic

9th-12th grade introductory data science. Understanding the data science process.

Learning Objectives

By the end of this lesson, students will be able to:

  • Identify the six steps of the data science workflow
  • Explain the purpose of each step in the process
  • Describe how the steps connect to form a complete analysis pipeline

Activities

Step Identification

Have students identify which step they are performing when working on a data project.

Real-World Examples

Discuss real-world data science projects and map them to the six-step workflow.

Process Ordering

Provide students with scrambled workflow steps and have them arrange them in the correct order.

References

  1. What is Data Science? - Wikipedia - Overview of data science as a field
  2. The Data Science Process - KDnuggets - Detailed explanation of data science methodology
  3. p5.js Reference - p5.js Documentation - JavaScript library used to build this interactive simulation