Learning Graph Quality Metrics Report
Overview
- Total Concepts: 450
- Foundational Concepts (no prerequisites, other concepts depend on them): 11
- Terminal Nodes (nothing depends on them, but have prerequisites): 230
- Orphaned Nodes (completely disconnected, no edges): 0
- Concepts with Dependencies: 439
- Average Dependencies per Concept: 1.41
Graph Structure Validation
- Valid DAG Structure: ✅ Yes
- Self-Dependencies: None detected ✅
- Cycles Detected: 0
Foundational Concepts
These concepts have no prerequisites:
- 1: Genetic Inference
- 51: Genome Organization
- 111: Linkage
- 156: Quantitative Genetics
- 189: Population Genetics
- 211: Gene Expression
- 266: Forward Genetics
- 278: Model Organism
- 306: Genomics
- 346: Human Genetics
- 391: Genetic Ethics
Dependency Chain Analysis
- Maximum Dependency Chain Length: 10
Longest Learning Path:
- Genome Organization (ID: 51)
- Chromosome Structure (ID: 52)
- Chromatin (ID: 59)
- Nucleosome (ID: 60)
- Histone Proteins (ID: 61)
- Histone Modifications (ID: 62)
- Chromatin State (ID: 251)
- Chromatin Looping (ID: 258)
- Topologically Assoc Domain (ID: 257)
- 4D Nucleome (ID: 437)
Terminal Nodes Analysis
Terminal nodes are concepts that nothing else depends on but have prerequisites. They represent natural endpoints of learning paths — culminating or specialized concepts.
- Total Terminal Nodes: 230 (51.1% of all concepts)
- Healthy Range: 5-40% of total concepts
Concepts at the end of learning paths:
- 7: Likelihood Ratio
- 9: Autosomal Dominant Pedigree
- 12: X-Linked Recessive Pedigree
- 13: X-Linked Dominant Pedigree
- 18: Variable Expressivity
- 19: Phenocopy
- 21: Locus Heterogeneity
- 22: Allelic Heterogeneity
- 24: Duplicate Epistasis
- 25: Complementary Epistasis
- 27: Epistatic Pathway Analysis
- 30: Cis-Trans Test
- 32: Functional Allelism
- 34: Goodness of Fit Test
- 36: Reciprocal Cross
- 38: P-Value Interpretation
- 39: Modified Mendelian Ratios
- 40: Lethal Alleles
- 42: Genetic Background Effects
- 44: Anticipation
...and 210 more
Orphaned Nodes Analysis
Orphaned nodes are completely disconnected concepts with no inbound AND no outbound edges. These indicate a quality problem — every concept should connect to the graph.
- Total Orphaned Nodes: 0
✅ No orphaned nodes detected. All concepts are connected to the graph.
Connected Components
- Number of Connected Components: 1
✅ All concepts are connected in a single graph.
Indegree Analysis
Top 10 concepts that are prerequisites for the most other concepts:
| Rank | Concept ID | Concept Label | Indegree |
|---|---|---|---|
| 1 | 306 | Genomics | 15 |
| 2 | 52 | Chromosome Structure | 13 |
| 3 | 1 | Genetic Inference | 12 |
| 4 | 211 | Gene Expression | 11 |
| 5 | 346 | Human Genetics | 11 |
| 6 | 391 | Genetic Ethics | 11 |
| 7 | 23 | Epistasis | 10 |
| 8 | 189 | Population Genetics | 10 |
| 9 | 73 | Genetic Variation | 9 |
| 10 | 77 | Structural Variation | 9 |
Outdegree Distribution
| Dependencies | Number of Concepts |
|---|---|
| 0 | 11 |
| 1 | 268 |
| 2 | 164 |
| 3 | 7 |
Recommendations
- ℹ️ High terminal node percentage (51.1%): Consider if some terminal concepts should be prerequisites for advanced concepts
- ✅ DAG structure verified: Graph supports valid learning progressions
- ℹ️ Consider adding cross-dependencies: More connections could create richer learning pathways
Report generated by learning-graph-reports/analyze_graph.py