Taxonomy Distribution Report
Overview
- Total Concepts: 450
- Number of Taxonomies: 14
- Average Concepts per Taxonomy: 32.1
Distribution Summary
| Category | TaxonomyID | Count | Percentage | Status |
|---|---|---|---|---|
| MAP | MAP | 46 | 10.2% | ✅ |
| REG | REG | 45 | 10.0% | ✅ |
| CLIN | CLIN | 45 | 10.0% | ✅ |
| PED | PED | 40 | 8.9% | ✅ |
| BIOINFO | BIOINFO | 40 | 8.9% | ✅ |
| EXP | EXP | 39 | 8.7% | ✅ |
| GVAR | GVAR | 38 | 8.4% | ✅ |
| QUANT | QUANT | 32 | 7.1% | ✅ |
| GSTR | GSTR | 28 | 6.2% | ✅ |
| ETHICS | ETHICS | 24 | 5.3% | ✅ |
| Foundation Concepts - Prerequisites | FOUND | 22 | 4.9% | ✅ |
| POP | POP | 21 | 4.7% | ✅ |
| FRONT | FRONT | 20 | 4.4% | ✅ |
| PROB | PROB | 10 | 2.2% | ℹ️ Under |
Visual Distribution
1 2 3 4 5 6 7 8 9 10 11 12 13 14 | |
Balance Analysis
✅ No Over-Represented Categories
All categories are under the 30% threshold. Good balance!
ℹ️ Under-Represented Categories (<3%)
- PROB (PROB): 10 concepts (2.2%)
- Note: Small categories are acceptable for specialized topics
Category Details
MAP (MAP)
Count: 46 concepts (10.2%)
Concepts:
-
- Genetic Linkage
-
- Recombination
-
- Crossing Over
-
- Recombination Frequency
-
- Genetic Map
-
- Map Distance
-
- Centimorgan
-
- Two-Point Cross
-
- Three-Point Cross
-
- Interference
-
- Coefficient of Coincidence
-
- Gene Order Determination
-
- Genetic Markers
-
- Molecular Markers
-
- Restriction Fragment Length
- ...and 31 more
REG (REG)
Count: 45 concepts (10.0%)
Concepts:
-
- Transcription Regulation
-
- Promoter
-
- TATA Box
-
- Transcription Factor
-
- General Transcription Factor
-
- Specific Transcription Factor
-
- Activator
-
- Repressor
-
- Enhancer
-
- Silencer
-
- Insulator
-
- Cis-Regulatory Element
-
- Trans-Acting Factor
-
- Transcriptional Logic
-
- Combinatorial Control
- ...and 30 more
CLIN (CLIN)
Count: 45 concepts (10.0%)
Concepts:
-
- Mendelian Disease
-
- Complex Disease
-
- Genetic Counseling
-
- Risk Assessment
-
- Carrier Screening
-
- Newborn Screening
-
- Prenatal Genetic Testing
-
- Preimplantation Diagnosis
-
- Family History Assessment
-
- Genetic Testing Types
-
- Diagnostic Testing
-
- Predictive Testing
-
- Presymptomatic Testing
-
- Drug Metabolism Variation
-
- CYP450 Polymorphisms
- ...and 30 more
PED (PED)
Count: 40 concepts (8.9%)
Concepts:
-
- Pedigree Analysis
-
- Autosomal Dominant Pedigree
-
- Autosomal Recessive Pedigree
-
- X-Linked Inheritance
-
- X-Linked Recessive Pedigree
-
- X-Linked Dominant Pedigree
-
- Carrier Probability
-
- Penetrance
-
- Incomplete Penetrance
-
- Expressivity
-
- Variable Expressivity
-
- Phenocopy
-
- Genetic Heterogeneity
-
- Locus Heterogeneity
-
- Allelic Heterogeneity
- ...and 25 more
BIOINFO (BIOINFO)
Count: 40 concepts (8.9%)
Concepts:
-
- Genome Sequencing
-
- Sanger Sequencing
-
- Next-Gen Sequencing
-
- Illumina Sequencing
-
- Long-Read Sequencing
-
- Whole Genome Sequencing
-
- Whole Exome Sequencing
-
- Targeted Sequencing
-
- Sequence Alignment
-
- BLAST Algorithm
-
- Pairwise Alignment
-
- Multiple Sequence Alignment
-
- Genome Annotation
-
- Gene Prediction
-
- Variant Calling
- ...and 25 more
EXP (EXP)
Count: 39 concepts (8.7%)
Concepts:
-
- Reverse Genetics
-
- Mutagenesis Screen
-
- Chemical Mutagenesis
-
- EMS Mutagenesis
-
- Insertional Mutagenesis
-
- Saturation Mutagenesis
-
- Enhancer Trap
-
- Suppressor Screen
-
- Modifier Screen
-
- Genetic Mosaic Analysis
-
- Clonal Analysis
-
- Drosophila Genetics
-
- Yeast Genetics
-
- Mouse Genetics
-
- C. Elegans Genetics
- ...and 24 more
GVAR (GVAR)
Count: 38 concepts (8.4%)
Concepts:
-
- Genetic Variation
-
- Single Nucleotide Polymorphism
-
- Insertion Deletion Variant
-
- Copy Number Variation
-
- Structural Variation
-
- Chromosomal Inversion
-
- Chromosomal Translocation
-
- Chromosomal Deletion
-
- Chromosomal Duplication
-
- Tandem Repeat
-
- Short Tandem Repeat
-
- Microsatellite
-
- Minisatellite
-
- Variable Number Tandem Repeat
-
- Haplotype
- ...and 23 more
QUANT (QUANT)
Count: 32 concepts (7.1%)
Concepts:
-
- Quantitative Trait
-
- Continuous Variation
-
- Polygenic Inheritance
-
- Multifactorial Trait
-
- Threshold Trait
-
- Heritability
-
- Broad Sense Heritability
-
- Narrow Sense Heritability
-
- Additive Genetic Variance
-
- Dominance Variance
-
- Epistatic Variance
-
- Environmental Variance
-
- Phenotypic Variance
-
- Twin Studies
-
- Monozygotic Twins
- ...and 17 more
GSTR (GSTR)
Count: 28 concepts (6.2%)
Concepts:
-
- Chromosome Structure
-
- Euchromatin
-
- Heterochromatin
-
- Constitutive Heterochromatin
-
- Facultative Heterochromatin
-
- Centromere Structure
-
- Telomere Structure
-
- Chromatin
-
- Nucleosome
-
- Histone Proteins
-
- Histone Modifications
-
- Histone Acetylation
-
- Histone Methylation
-
- Chromatin Remodeling
-
- Epigenetics
- ...and 13 more
ETHICS (ETHICS)
Count: 24 concepts (5.3%)
Concepts:
-
- Informed Consent
-
- Genetic Privacy
-
- Genetic Discrimination
-
- GINA Legislation
-
- Data Ownership
-
- Biobank Ethics
-
- Return of Results
-
- Incidental Findings
-
- Duty to Warn
-
- Equity in Genomic Medicine
-
- Health Disparities
-
- Reference Genome Bias
-
- Ancestry and Identity
-
- Gene Editing Ethics
-
- Germline Editing Debate
- ...and 9 more
Foundation Concepts - Prerequisites (FOUND)
Count: 22 concepts (4.9%)
Concepts:
-
- Genetic Inference
-
- Genome Organization
-
- Linkage
-
- Quantitative Genetics
-
- Population Genetics
-
- Gene Expression
-
- Stem Cell Gene Expression
-
- Forward Genetics
-
- Model Organism
-
- Functional Genomics
-
- Genomics
-
- Version Control in Genomics
-
- Human Genetics
-
- Pharmacogenomics
-
- Genetic Ethics
- ...and 7 more
POP (POP)
Count: 21 concepts (4.7%)
Concepts:
-
- Allele Frequency
-
- Genotype Frequency
-
- Hardy-Weinberg Equilibrium
-
- Hardy-Weinberg Assumptions
-
- Chi-Square HWE Test
-
- Natural Selection
-
- Fitness
-
- Selection Coefficient
-
- Directional Selection
-
- Stabilizing Selection
-
- Disruptive Selection
-
- Balancing Selection
-
- Heterozygote Advantage
-
- Genetic Drift
-
- Bottleneck Effect
- ...and 6 more
FRONT (FRONT)
Count: 20 concepts (4.4%)
Concepts:
-
- CRISPR Advancements
-
- CRISPR Therapeutics
-
- In Vivo Gene Editing
-
- Epigenome Editing
-
- Single-Cell RNA Sequencing
-
- Spatial Transcriptomics
-
- Cell Atlas Projects
-
- Machine Learning Variants
-
- Large Language Models Bio
-
- Protein Structure AI
-
- Pangenome
-
- Pangenome Reference
-
- Structural Variant Calling
-
- Telomere-to-Telomere
-
- Microbiome Genetics
- ...and 5 more
PROB (PROB)
Count: 10 concepts (2.2%)
Concepts:
-
- Probability in Genetics
-
- Conditional Probability
-
- Bayesian Reasoning
-
- Prior Probability
-
- Posterior Probability
-
- Likelihood Ratio
-
- Chi-Square Test
-
- Goodness of Fit Test
-
- Null Hypothesis in Genetics
-
- P-Value Interpretation
Recommendations
- ✅ Excellent balance: Categories are evenly distributed (spread: 8.0%)
- ✅ MISC category minimal: Good categorization specificity
Educational Use Recommendations
- Use taxonomy categories for color-coding in graph visualizations
- Design curriculum modules based on taxonomy groupings
- Create filtered views for focused learning paths
- Use categories for assessment organization
- Enable navigation by topic area in interactive tools
Report generated by learning-graph-reports/taxonomy_distribution.py