Learning Graph Quality Metrics Report¶
Overview¶
- Total Concepts: 580
- Foundational Concepts (no prerequisites, other concepts depend on them): 5
- Terminal Nodes (nothing depends on them, but have prerequisites): 240
- Orphaned Nodes (completely disconnected, no edges): 0
- Concepts with Dependencies: 575
- Average Dependencies per Concept: 1.65
Graph Structure Validation¶
- Valid DAG Structure: ✅ Yes
- Self-Dependencies: None detected ✅
- Cycles Detected: 0
Foundational Concepts¶
These concepts have no prerequisites:
- 1: Information System
- 2: Data
- 6: Hardware
- 7: Software
- 11: Organization
Dependency Chain Analysis¶
- Maximum Dependency Chain Length: 19
Longest Learning Path:¶
- Data (ID: 2)
- Data Modeling (ID: 85)
- Entity (ID: 90)
- Attribute (ID: 91)
- Primary Key (ID: 94)
- Foreign Key (ID: 95)
- Normalization (ID: 98)
- Relational Database (ID: 115)
- Analytical Database (ID: 119)
- OLTP vs OLAP (ID: 120)
- Data Warehouse (ID: 121)
- Data Lake (ID: 143)
- Data Lakehouse (ID: 144)
- ELT Process (ID: 146)
- Data Pipeline (ID: 147)
- Data Lineage (ID: 150)
- AI Data Provenance (ID: 451)
- Training Data Documentation (ID: 452)
- Datasheet for Datasets (ID: 453)
Terminal Nodes Analysis¶
Terminal nodes are concepts that nothing else depends on but have prerequisites. They represent natural endpoints of learning paths — culminating or specialized concepts.
- Total Terminal Nodes: 240 (41.4% of all concepts)
- Healthy Range: 5-40% of total concepts
Concepts at the end of learning paths:
- 5: DIKW Hierarchy
- 17: Sociotechnical System
- 22: Competitive Advantage
- 23: Business Capability
- 25: COBIT Framework
- 28: CTO Role
- 30: CISO Role
- 35: Professional Responsibility
- 39: N-Tier Architecture
- 40: Presentation Tier
- 41: Application Tier
- 42: Data Tier
- 44: TOGAF Framework
- 45: Zachman Framework
- 46: Reference Architecture
- 47: Solution Architecture
- 53: Monolithic Architecture
- 56: GraphQL API
- 57: Webhooks
- 58: OpenAPI Specification
...and 220 more
Orphaned Nodes Analysis¶
Orphaned nodes are completely disconnected concepts with no inbound AND no outbound edges. These indicate a quality problem — every concept should connect to the graph.
- Total Orphaned Nodes: 0
✅ No orphaned nodes detected. All concepts are connected to the graph.
Connected Components¶
- Number of Connected Components: 1
✅ All concepts are connected in a single graph.
Indegree Analysis¶
Top 10 concepts that are prerequisites for the most other concepts:
| Rank | Concept ID | Concept Label | Indegree |
|---|---|---|---|
| 1 | 1 | Information System | 16 |
| 2 | 271 | GDPR | 15 |
| 3 | 7 | Software | 14 |
| 4 | 234 | CIA Triad | 13 |
| 5 | 405 | Artificial Intelligence | 13 |
| 6 | 11 | Organization | 12 |
| 7 | 406 | Machine Learning | 12 |
| 8 | 476 | AI Threat Landscape | 12 |
| 9 | 2 | Data | 11 |
| 10 | 212 | Cloud Computing | 11 |
Outdegree Distribution¶
| Dependencies | Number of Concepts |
|---|---|
| 0 | 5 |
| 1 | 237 |
| 2 | 310 |
| 3 | 25 |
| 4 | 1 |
| 5 | 2 |
Recommendations¶
- ℹ️ High terminal node percentage (41.4%): Consider if some terminal concepts should be prerequisites for advanced concepts
- ✅ DAG structure verified: Graph supports valid learning progressions
- ℹ️ Long dependency chains (19): Ensure students can follow extended learning paths
Report generated by learning-graph-reports/analyze_graph.py