Skip to content

Learning Graph Quality Metrics Report

Overview

  • Total Concepts: 450
  • Foundational Concepts (no prerequisites, other concepts depend on them): 11
  • Terminal Nodes (nothing depends on them, but have prerequisites): 230
  • Orphaned Nodes (completely disconnected, no edges): 0
  • Concepts with Dependencies: 439
  • Average Dependencies per Concept: 1.41

Graph Structure Validation

  • Valid DAG Structure: ✅ Yes
  • Self-Dependencies: None detected ✅
  • Cycles Detected: 0

Foundational Concepts

These concepts have no prerequisites:

  • 1: Genetic Inference
  • 51: Genome Organization
  • 111: Linkage
  • 156: Quantitative Genetics
  • 189: Population Genetics
  • 211: Gene Expression
  • 266: Forward Genetics
  • 278: Model Organism
  • 306: Genomics
  • 346: Human Genetics
  • 391: Genetic Ethics

Dependency Chain Analysis

  • Maximum Dependency Chain Length: 10

Longest Learning Path:

  1. Genome Organization (ID: 51)
  2. Chromosome Structure (ID: 52)
  3. Chromatin (ID: 59)
  4. Nucleosome (ID: 60)
  5. Histone Proteins (ID: 61)
  6. Histone Modifications (ID: 62)
  7. Chromatin State (ID: 251)
  8. Chromatin Looping (ID: 258)
  9. Topologically Assoc Domain (ID: 257)
  10. 4D Nucleome (ID: 437)

Terminal Nodes Analysis

Terminal nodes are concepts that nothing else depends on but have prerequisites. They represent natural endpoints of learning paths — culminating or specialized concepts.

  • Total Terminal Nodes: 230 (51.1% of all concepts)
  • Healthy Range: 5-40% of total concepts

Concepts at the end of learning paths:

  • 7: Likelihood Ratio
  • 9: Autosomal Dominant Pedigree
  • 12: X-Linked Recessive Pedigree
  • 13: X-Linked Dominant Pedigree
  • 18: Variable Expressivity
  • 19: Phenocopy
  • 21: Locus Heterogeneity
  • 22: Allelic Heterogeneity
  • 24: Duplicate Epistasis
  • 25: Complementary Epistasis
  • 27: Epistatic Pathway Analysis
  • 30: Cis-Trans Test
  • 32: Functional Allelism
  • 34: Goodness of Fit Test
  • 36: Reciprocal Cross
  • 38: P-Value Interpretation
  • 39: Modified Mendelian Ratios
  • 40: Lethal Alleles
  • 42: Genetic Background Effects
  • 44: Anticipation

...and 210 more

Orphaned Nodes Analysis

Orphaned nodes are completely disconnected concepts with no inbound AND no outbound edges. These indicate a quality problem — every concept should connect to the graph.

  • Total Orphaned Nodes: 0

✅ No orphaned nodes detected. All concepts are connected to the graph.

Connected Components

  • Number of Connected Components: 1

✅ All concepts are connected in a single graph.

Indegree Analysis

Top 10 concepts that are prerequisites for the most other concepts:

Rank Concept ID Concept Label Indegree
1 306 Genomics 15
2 52 Chromosome Structure 13
3 1 Genetic Inference 12
4 211 Gene Expression 11
5 346 Human Genetics 11
6 391 Genetic Ethics 11
7 23 Epistasis 10
8 189 Population Genetics 10
9 73 Genetic Variation 9
10 77 Structural Variation 9

Outdegree Distribution

Dependencies Number of Concepts
0 11
1 268
2 164
3 7

Recommendations

  • ℹ️ High terminal node percentage (51.1%): Consider if some terminal concepts should be prerequisites for advanced concepts
  • DAG structure verified: Graph supports valid learning progressions
  • ℹ️ Consider adding cross-dependencies: More connections could create richer learning pathways

Report generated by learning-graph-reports/analyze_graph.py