Taxonomy Distribution Report
This report shows how the 200 concepts are distributed across taxonomy categories.
Summary Statistics
- Total Concepts: 200
- Number of Categories: 13
- Average Concepts per Category: 15.4
- Largest Category: Patient Data (25 concepts, 12.5%)
- Smallest Category: Capstone & Career (5 concepts, 2.5%)
Distribution by Category
| Rank | Taxonomy ID | Category Name | Count | Percentage | Visualization |
|---|---|---|---|---|---|
| 1 | PAT | Patient Data | 25 | 12.5% | ██████ |
| 2 | PROV | Provider Operations | 25 | 12.5% | ██████ |
| 3 | HCARE | Healthcare Domain | 20 | 10.0% | █████ |
| 4 | PAYER | Payer & Insurance | 20 | 10.0% | █████ |
| 5 | FOUND | Foundation Concepts | 15 | 7.5% | ███ |
| 6 | FIN | Financial & Business | 15 | 7.5% | ███ |
| 7 | FRAUD | Fraud & Compliance | 15 | 7.5% | ███ |
| 8 | ANAL | Graph Analytics | 15 | 7.5% | ███ |
| 9 | AI | AI & Machine Learning | 15 | 7.5% | ███ |
| 10 | GTECH | Graph Technologies | 10 | 5.0% | ██ |
| 11 | SEC | Security & Privacy | 10 | 5.0% | ██ |
| 12 | GOV | Data Governance | 10 | 5.0% | ██ |
| 13 | CAP | Capstone & Career | 5 | 2.5% | █ |
Balance Assessment
⚠ Under-represented categories (<3% of total):
- Capstone & Career: 5 concepts (2.5%)
Recommendations
- Consider merging under-represented categories with related topics or expanding content.
Report generated by taxonomy-distribution.py