Skip to content

Taxonomy Distribution Report

Overview

  • Total Concepts: 254
  • Number of Taxonomies: 16
  • Average Concepts per Taxonomy: 15.9

Distribution Summary

Category TaxonomyID Count Percentage Status
ATAM ATAM 22 8.7%
REL REL 20 7.9%
DIST DIST 20 7.9%
Foundation Concepts - Prerequisites FOUND 18 7.1%
ACID ACID 18 7.1%
SCALE SCALE 18 7.1%
NACID NACID 16 6.3%
ANAL ANAL 15 5.9%
GRAPH GRAPH 15 5.9%
HA HA 15 5.9%
LLM LLM 15 5.9%
VEC VEC 14 5.5%
KV KV 12 4.7%
COL COL 12 4.7%
DOC DOC 12 4.7%
SEL SEL 12 4.7%

Visual Distribution

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
ATAM                      ████  22 (  8.7%)
REL                       ███  20 (  7.9%)
DIST                      ███  20 (  7.9%)
Foundation Concepts - Pre ███  18 (  7.1%)
ACID                      ███  18 (  7.1%)
SCALE                     ███  18 (  7.1%)
NACID                     ███  16 (  6.3%)
ANAL                      ██  15 (  5.9%)
GRAPH                     ██  15 (  5.9%)
HA                        ██  15 (  5.9%)
LLM                       ██  15 (  5.9%)
VEC                       ██  14 (  5.5%)
KV                        ██  12 (  4.7%)
COL                       ██  12 (  4.7%)
DOC                       ██  12 (  4.7%)
SEL                       ██  12 (  4.7%)

Balance Analysis

✅ No Over-Represented Categories

All categories are under the 30% threshold. Good balance!

Category Details

ATAM (ATAM)

Count: 22 concepts (8.7%)

Concepts:

    1. Architecture Tradeoff Analysis
    1. Quality Attribute Workshop
    1. Utility Tree
    1. Quality Attribute Scenario
    1. Architectural Driver
    1. Sensitivity Point
    1. Tradeoff Point
    1. Architectural Risk
    1. Non-Risk
    1. Risk Theme
    1. Utility Tree Prioritization
    1. ATAM Stakeholder Roles
    1. Business Driver
    1. Architectural Approach
    1. ATAM Evaluation Team
  • ...and 7 more

REL (REL)

Count: 20 concepts (7.9%)

Concepts:

    1. Relational Data Model
    1. SQL
    1. Primary Key
    1. Foreign Key
    1. Normalization
    1. First Normal Form
    1. Third Normal Form
    1. Join Operation
    1. B-Tree Index
    1. Query Execution Plan
    1. Stored Procedure
    1. Database View
    1. Transaction Log
    1. Write-Ahead Log
    1. Multi-Version Concurrency
  • ...and 5 more

DIST (DIST)

Count: 20 concepts (7.9%)

Concepts:

    1. CAP Theorem
    1. Consistency (CAP)
    1. Availability (CAP)
    1. Partition Tolerance
    1. CP Database System
    1. AP Database System
    1. PACELC Model
    1. Latency-Consistency Tradeoff
    1. Eventual Consistency
    1. Strong Consistency
    1. Read-Your-Writes Consistency
    1. Monotonic Read Consistency
    1. Causal Consistency
    1. Session Consistency
    1. Linearizability
  • ...and 5 more

Foundation Concepts - Prerequisites (FOUND)

Count: 18 concepts (7.1%)

Concepts:

    1. Database Management System
    1. Data Model
    1. Schema
    1. Query Language
    1. Database Index
    1. Query Optimizer
    1. Storage Engine
    1. Data Serialization
    1. Workload Characterization
    1. Read/Write Ratio
    1. Latency vs Throughput
    1. OLTP Workload
    1. OLAP Workload
    1. HTAP Workload
    1. Data Volume Scaling
  • ...and 3 more

ACID (ACID)

Count: 18 concepts (7.1%)

Concepts:

    1. Atomicity
    1. Consistency (ACID)
    1. Isolation
    1. Durability
    1. Transaction Isolation Level
    1. Read Uncommitted
    1. Read Committed
    1. Repeatable Read
    1. Serializable Isolation
    1. Dirty Read
    1. Phantom Read
    1. Non-Repeatable Read
    1. Two-Phase Locking
    1. Optimistic Concurrency Control
    1. Savepoint
  • ...and 3 more

SCALE (SCALE)

Count: 18 concepts (7.1%)

Concepts:

    1. Horizontal Scaling
    1. Vertical Scaling
    1. Data Sharding
    1. Range-Based Sharding
    1. Hash-Based Sharding
    1. Directory-Based Sharding
    1. Geographic Sharding
    1. Database Replication
    1. Single-Leader Replication
    1. Multi-Leader Replication
    1. Leaderless Replication
    1. Quorum Read/Write
    1. Read Replica
    1. Replication Lag
    1. Split-Brain Problem
  • ...and 3 more

NACID (NACID)

Count: 16 concepts (6.3%)

Concepts:

    1. Two-Phase Commit
    1. Three-Phase Commit
    1. Distributed Transaction Coordinator
    1. Saga Orchestration
    1. Saga Choreography
    1. Compensating Transaction
    1. NewSQL Database
    1. Google Spanner
    1. CockroachDB
    1. YugabyteDB
    1. TrueTime API
    1. Global Transaction ID
    1. Cross-Shard Transaction
    1. Paxos Protocol
    1. Raft Consensus Protocol
  • ...and 1 more

ANAL (ANAL)

Count: 15 concepts (5.9%)

Concepts:

    1. Columnar Storage
    1. Massively Parallel Processing
    1. Data Warehouse
    1. Star Schema
    1. Snowflake Schema
    1. OLAP Cube
    1. ETL Pipeline
    1. Inmon Architecture
    1. Kimball Architecture
    1. Bitmap Index
    1. Materialized View
    1. Query Pushdown
    1. Snowflake Database
    1. BigQuery
    1. Apache Parquet Format

GRAPH (GRAPH)

Count: 15 concepts (5.9%)

Concepts:

    1. Property Graph Model
    1. RDF Graph Model
    1. SPARQL
    1. Cypher Query Language
    1. Graph Traversal Algorithm
    1. Depth-First Search
    1. Breadth-First Search
    1. Shortest Path Algorithm
    1. Neo4j
    1. TigerGraph
    1. Amazon Neptune
    1. Distributed Graph Database
    1. Graph Partitioning
    1. GSQL
    1. Knowledge Graph

HA (HA)

Count: 15 concepts (5.9%)

Concepts:

    1. High Availability
    1. Five-Nines Availability
    1. SLA Decomposition
    1. Failure Domain
    1. Single Point of Failure
    1. Active-Active Clustering
    1. Active-Passive Clustering
    1. Failover
    1. Heartbeat Monitoring
    1. Chaos Engineering
    1. Mean Time Between Failures
    1. Mean Time to Recovery
    1. Geographic Redundancy
    1. Multi-Region Deployment
    1. Circuit Breaker Pattern

LLM (LLM)

Count: 15 concepts (5.9%)

Concepts:

    1. Large Language Model
    1. Transformer Architecture
    1. Tokenization
    1. Attention Mechanism
    1. CLS Token Pooling
    1. Mean Pooling
    1. Embedding Model Selection
    1. OpenAI Embeddings API
    1. Sentence Transformers
    1. Self-Hosted Embedding Model
    1. Embedding Cost at Scale
    1. Re-Embedding Migration
    1. Multimodal Embedding
    1. Embedding Pipeline Architecture
    1. Embedding Model Versioning

VEC (VEC)

Count: 14 concepts (5.5%)

Concepts:

    1. Vector Embedding
    1. Embedding Dimensionality
    1. Cosine Similarity
    1. Dot Product Similarity
    1. Euclidean Distance
    1. Approximate Nearest Neighbor
    1. HNSW Index
    1. IVF Index
    1. Flat Vector Index
    1. pgvector Extension
    1. Semantic Search
    1. Hybrid Search
    1. Native Vector Search Feature
    1. ANN Recall vs Speed

KV (KV)

Count: 12 concepts (4.7%)

Concepts:

    1. Key-Value Data Model
    1. Hash Table Storage
    1. Time-to-Live
    1. Cache Eviction Policy
    1. Redis
    1. DynamoDB
    1. Read-Through Cache
    1. Write-Through Cache
    1. Write-Behind Cache
    1. Cache Stampede
    1. Hot Key Problem
    1. Key Expiration

COL (COL)

Count: 12 concepts (4.7%)

Concepts:

    1. Column-Family Data Model
    1. Wide Row
    1. LSM Tree
    1. Compaction Strategy
    1. Bloom Filter
    1. Apache Cassandra
    1. Apache HBase
    1. Partition Key
    1. Clustering Column
    1. Write-Optimized Storage
    1. Read Amplification
    1. Write Amplification

DOC (DOC)

Count: 12 concepts (4.7%)

Concepts:

    1. Document Data Model
    1. JSON Document Storage
    1. BSON Format
    1. Embedded Document
    1. Document Reference
    1. Aggregation Pipeline
    1. Schema Flexibility
    1. MongoDB
    1. Couchbase
    1. Compound Index
    1. Full-Text Search Index
    1. Change Stream

SEL (SEL)

Count: 12 concepts (4.7%)

Concepts:

    1. Polyglot Persistence
    1. Database Selection Framework
    1. Scoring Matrix
    1. Total Cost of Ownership
    1. Vendor Lock-In Risk
    1. Database Migration Plan
    1. Schema Migration
    1. Multi-Model Database
    1. Operational Runbook
    1. Team Expertise Factor
    1. Database Deprecation Risk
    1. Data Access Pattern Analysis

Recommendations

  • Excellent balance: Categories are evenly distributed (spread: 3.9%)
  • MISC category minimal: Good categorization specificity

Educational Use Recommendations

  • Use taxonomy categories for color-coding in graph visualizations
  • Design curriculum modules based on taxonomy groupings
  • Create filtered views for focused learning paths
  • Use categories for assessment organization
  • Enable navigation by topic area in interactive tools

Report generated by learning-graph-reports/taxonomy_distribution.py