Taxonomy Distribution Report
Overview
- Total Concepts: 254
- Number of Taxonomies: 16
- Average Concepts per Taxonomy: 15.9
Distribution Summary
| Category | TaxonomyID | Count | Percentage | Status |
|---|---|---|---|---|
| ATAM | ATAM | 22 | 8.7% | ✅ |
| REL | REL | 20 | 7.9% | ✅ |
| DIST | DIST | 20 | 7.9% | ✅ |
| Foundation Concepts - Prerequisites | FOUND | 18 | 7.1% | ✅ |
| ACID | ACID | 18 | 7.1% | ✅ |
| SCALE | SCALE | 18 | 7.1% | ✅ |
| NACID | NACID | 16 | 6.3% | ✅ |
| ANAL | ANAL | 15 | 5.9% | ✅ |
| GRAPH | GRAPH | 15 | 5.9% | ✅ |
| HA | HA | 15 | 5.9% | ✅ |
| LLM | LLM | 15 | 5.9% | ✅ |
| VEC | VEC | 14 | 5.5% | ✅ |
| KV | KV | 12 | 4.7% | ✅ |
| COL | COL | 12 | 4.7% | ✅ |
| DOC | DOC | 12 | 4.7% | ✅ |
| SEL | SEL | 12 | 4.7% | ✅ |
Visual Distribution
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 | |
Balance Analysis
✅ No Over-Represented Categories
All categories are under the 30% threshold. Good balance!
Category Details
ATAM (ATAM)
Count: 22 concepts (8.7%)
Concepts:
-
- Architecture Tradeoff Analysis
-
- Quality Attribute Workshop
-
- Utility Tree
-
- Quality Attribute Scenario
-
- Architectural Driver
-
- Sensitivity Point
-
- Tradeoff Point
-
- Architectural Risk
-
- Non-Risk
-
- Risk Theme
-
- Utility Tree Prioritization
-
- ATAM Stakeholder Roles
-
- Business Driver
-
- Architectural Approach
-
- ATAM Evaluation Team
- ...and 7 more
REL (REL)
Count: 20 concepts (7.9%)
Concepts:
-
- Relational Data Model
-
- SQL
-
- Primary Key
-
- Foreign Key
-
- Normalization
-
- First Normal Form
-
- Third Normal Form
-
- Join Operation
-
- B-Tree Index
-
- Query Execution Plan
-
- Stored Procedure
-
- Database View
-
- Transaction Log
-
- Write-Ahead Log
-
- Multi-Version Concurrency
- ...and 5 more
DIST (DIST)
Count: 20 concepts (7.9%)
Concepts:
-
- CAP Theorem
-
- Consistency (CAP)
-
- Availability (CAP)
-
- Partition Tolerance
-
- CP Database System
-
- AP Database System
-
- PACELC Model
-
- Latency-Consistency Tradeoff
-
- Eventual Consistency
-
- Strong Consistency
-
- Read-Your-Writes Consistency
-
- Monotonic Read Consistency
-
- Causal Consistency
-
- Session Consistency
-
- Linearizability
- ...and 5 more
Foundation Concepts - Prerequisites (FOUND)
Count: 18 concepts (7.1%)
Concepts:
-
- Database Management System
-
- Data Model
-
- Schema
-
- Query Language
-
- Database Index
-
- Query Optimizer
-
- Storage Engine
-
- Data Serialization
-
- Workload Characterization
-
- Read/Write Ratio
-
- Latency vs Throughput
-
- OLTP Workload
-
- OLAP Workload
-
- HTAP Workload
-
- Data Volume Scaling
- ...and 3 more
ACID (ACID)
Count: 18 concepts (7.1%)
Concepts:
-
- Atomicity
-
- Consistency (ACID)
-
- Isolation
-
- Durability
-
- Transaction Isolation Level
-
- Read Uncommitted
-
- Read Committed
-
- Repeatable Read
-
- Serializable Isolation
-
- Dirty Read
-
- Phantom Read
-
- Non-Repeatable Read
-
- Two-Phase Locking
-
- Optimistic Concurrency Control
-
- Savepoint
- ...and 3 more
SCALE (SCALE)
Count: 18 concepts (7.1%)
Concepts:
-
- Horizontal Scaling
-
- Vertical Scaling
-
- Data Sharding
-
- Range-Based Sharding
-
- Hash-Based Sharding
-
- Directory-Based Sharding
-
- Geographic Sharding
-
- Database Replication
-
- Single-Leader Replication
-
- Multi-Leader Replication
-
- Leaderless Replication
-
- Quorum Read/Write
-
- Read Replica
-
- Replication Lag
-
- Split-Brain Problem
- ...and 3 more
NACID (NACID)
Count: 16 concepts (6.3%)
Concepts:
-
- Two-Phase Commit
-
- Three-Phase Commit
-
- Distributed Transaction Coordinator
-
- Saga Orchestration
-
- Saga Choreography
-
- Compensating Transaction
-
- NewSQL Database
-
- Google Spanner
-
- CockroachDB
-
- YugabyteDB
-
- TrueTime API
-
- Global Transaction ID
-
- Cross-Shard Transaction
-
- Paxos Protocol
-
- Raft Consensus Protocol
- ...and 1 more
ANAL (ANAL)
Count: 15 concepts (5.9%)
Concepts:
-
- Columnar Storage
-
- Massively Parallel Processing
-
- Data Warehouse
-
- Star Schema
-
- Snowflake Schema
-
- OLAP Cube
-
- ETL Pipeline
-
- Inmon Architecture
-
- Kimball Architecture
-
- Bitmap Index
-
- Materialized View
-
- Query Pushdown
-
- Snowflake Database
-
- BigQuery
-
- Apache Parquet Format
GRAPH (GRAPH)
Count: 15 concepts (5.9%)
Concepts:
-
- Property Graph Model
-
- RDF Graph Model
-
- SPARQL
-
- Cypher Query Language
-
- Graph Traversal Algorithm
-
- Depth-First Search
-
- Breadth-First Search
-
- Shortest Path Algorithm
-
- Neo4j
-
- TigerGraph
-
- Amazon Neptune
-
- Distributed Graph Database
-
- Graph Partitioning
-
- GSQL
-
- Knowledge Graph
HA (HA)
Count: 15 concepts (5.9%)
Concepts:
-
- High Availability
-
- Five-Nines Availability
-
- SLA Decomposition
-
- Failure Domain
-
- Single Point of Failure
-
- Active-Active Clustering
-
- Active-Passive Clustering
-
- Failover
-
- Heartbeat Monitoring
-
- Chaos Engineering
-
- Mean Time Between Failures
-
- Mean Time to Recovery
-
- Geographic Redundancy
-
- Multi-Region Deployment
-
- Circuit Breaker Pattern
LLM (LLM)
Count: 15 concepts (5.9%)
Concepts:
-
- Large Language Model
-
- Transformer Architecture
-
- Tokenization
-
- Attention Mechanism
-
- CLS Token Pooling
-
- Mean Pooling
-
- Embedding Model Selection
-
- OpenAI Embeddings API
-
- Sentence Transformers
-
- Self-Hosted Embedding Model
-
- Embedding Cost at Scale
-
- Re-Embedding Migration
-
- Multimodal Embedding
-
- Embedding Pipeline Architecture
-
- Embedding Model Versioning
VEC (VEC)
Count: 14 concepts (5.5%)
Concepts:
-
- Vector Embedding
-
- Embedding Dimensionality
-
- Cosine Similarity
-
- Dot Product Similarity
-
- Euclidean Distance
-
- Approximate Nearest Neighbor
-
- HNSW Index
-
- IVF Index
-
- Flat Vector Index
-
- pgvector Extension
-
- Semantic Search
-
- Hybrid Search
-
- Native Vector Search Feature
-
- ANN Recall vs Speed
KV (KV)
Count: 12 concepts (4.7%)
Concepts:
-
- Key-Value Data Model
-
- Hash Table Storage
-
- Time-to-Live
-
- Cache Eviction Policy
-
- Redis
-
- DynamoDB
-
- Read-Through Cache
-
- Write-Through Cache
-
- Write-Behind Cache
-
- Cache Stampede
-
- Hot Key Problem
-
- Key Expiration
COL (COL)
Count: 12 concepts (4.7%)
Concepts:
-
- Column-Family Data Model
-
- Wide Row
-
- LSM Tree
-
- Compaction Strategy
-
- Bloom Filter
-
- Apache Cassandra
-
- Apache HBase
-
- Partition Key
-
- Clustering Column
-
- Write-Optimized Storage
-
- Read Amplification
-
- Write Amplification
DOC (DOC)
Count: 12 concepts (4.7%)
Concepts:
-
- Document Data Model
-
- JSON Document Storage
-
- BSON Format
-
- Embedded Document
-
- Document Reference
-
- Aggregation Pipeline
-
- Schema Flexibility
-
- MongoDB
-
- Couchbase
-
- Compound Index
-
- Full-Text Search Index
-
- Change Stream
SEL (SEL)
Count: 12 concepts (4.7%)
Concepts:
-
- Polyglot Persistence
-
- Database Selection Framework
-
- Scoring Matrix
-
- Total Cost of Ownership
-
- Vendor Lock-In Risk
-
- Database Migration Plan
-
- Schema Migration
-
- Multi-Model Database
-
- Operational Runbook
-
- Team Expertise Factor
-
- Database Deprecation Risk
-
- Data Access Pattern Analysis
Recommendations
- ✅ Excellent balance: Categories are evenly distributed (spread: 3.9%)
- ✅ MISC category minimal: Good categorization specificity
Educational Use Recommendations
- Use taxonomy categories for color-coding in graph visualizations
- Design curriculum modules based on taxonomy groupings
- Create filtered views for focused learning paths
- Use categories for assessment organization
- Enable navigation by topic area in interactive tools
Report generated by learning-graph-reports/taxonomy_distribution.py