Learning Graph Quality Metrics Report¶
Overview¶
- Total Concepts: 500
- Foundational Concepts (no prerequisites, other concepts depend on them): 6
- Terminal Nodes (nothing depends on them, but have prerequisites): 268
- Orphaned Nodes (completely disconnected, no edges): 0
- Concepts with Dependencies: 494
- Average Dependencies per Concept: 1.50
Graph Structure Validation¶
- Valid DAG Structure: ✅ Yes
- Self-Dependencies: None detected ✅
- Cycles Detected: 0
Foundational Concepts¶
These concepts have no prerequisites:
- 1: Public Health Definition
- 70: Descriptive Statistics
- 77: Probability Fundamentals
- 258: Bioethics Principles
- 333: Systems Thinking Concepts
- 402: Python for Public Health
Dependency Chain Analysis¶
- Maximum Dependency Chain Length: 12
Longest Learning Path:¶
- Public Health Definition (ID: 1)
- Population Health (ID: 2)
- Health Indicators (ID: 12)
- Incidence Rate (ID: 21)
- Prevalence Proportion (ID: 22)
- Relative Risk (ID: 31)
- Cohort Study Design (ID: 39)
- Randomized Controlled Trial (ID: 40)
- Hierarchy of Evidence (ID: 308)
- GRADE Framework (ID: 311)
- Dietary Intervention Evidence (ID: 319)
- Obesity Prevention Strategies (ID: 321)
Terminal Nodes Analysis¶
Terminal nodes are concepts that nothing else depends on but have prerequisites. They represent natural endpoints of learning paths — culminating or specialized concepts.
- Total Terminal Nodes: 268 (53.6% of all concepts)
- Healthy Range: 5-40% of total concepts
Concepts at the end of learning paths:
- 10: Essential Public Health Services
- 16: Tertiary Prevention
- 17: Primordial Prevention
- 18: Public Health Workforce
- 20: One Health Framework
- 23: Cumulative Incidence
- 27: Morbidity Rate
- 29: Secondary Attack Rate
- 35: Population Attrib Fraction
- 36: Number Needed to Treat
- 41: Ecological Study Design
- 42: Natural Experiment Design
- 44: Directed Acyclic Graph
- 45: Counterfactual Model
- 47: Selection Bias
- 48: Information Bias
- 49: Effect Modification
- 52: Active Surveillance
- 54: Syndromic Surveillance
- 59: Index Case Identification
...and 248 more
Orphaned Nodes Analysis¶
Orphaned nodes are completely disconnected concepts with no inbound AND no outbound edges. These indicate a quality problem — every concept should connect to the graph.
- Total Orphaned Nodes: 0
✅ No orphaned nodes detected. All concepts are connected to the graph.
Connected Components¶
- Number of Connected Components: 1
✅ All concepts are connected in a single graph.
Indegree Analysis¶
Top 10 concepts that are prerequisites for the most other concepts:
| Rank | Concept ID | Concept Label | Indegree |
|---|---|---|---|
| 1 | 443 | MicroSim Design Principles | 21 |
| 2 | 1 | Public Health Definition | 20 |
| 3 | 70 | Descriptive Statistics | 16 |
| 4 | 50 | Surveillance System Design | 15 |
| 5 | 230 | Social Determinants of Health | 13 |
| 6 | 2 | Population Health | 12 |
| 7 | 4 | Determinants of Health | 12 |
| 8 | 138 | Health Belief Model | 12 |
| 9 | 173 | US Health System Structure | 12 |
| 10 | 357 | Stock-and-Flow Diagram | 11 |
Outdegree Distribution¶
| Dependencies | Number of Concepts |
|---|---|
| 0 | 6 |
| 1 | 264 |
| 2 | 215 |
| 3 | 15 |
Recommendations¶
- ℹ️ High terminal node percentage (53.6%): Consider if some terminal concepts should be prerequisites for advanced concepts
- ✅ DAG structure verified: Graph supports valid learning progressions
- ℹ️ Consider adding cross-dependencies: More connections could create richer learning pathways
Report generated by learning-graph-reports/analyze_graph.py