Skip to content

Taxonomy Distribution Report

Overview

  • Total Concepts: 200
  • Number of Taxonomies: 13
  • Average Concepts per Taxonomy: 15.4

Distribution Summary

Category TaxonomyID Count Percentage Status
CHAT CHAT 46 23.0%
SEARCH SEARCH 28 14.0%
RAG RAG 18 9.0%
EMBED EMBED 17 8.5%
SEC SEC 16 8.0%
GRAPH GRAPH 15 7.5%
EVAL EVAL 15 7.5%
QUERY QUERY 11 5.5%
Foundation Concepts - Prerequisites FOUND 9 4.5%
NLP NLP 8 4.0%
METRIC METRIC 7 3.5%
LLM LLM 7 3.5%
TOOL TOOL 3 1.5% ℹ️ Under

Visual Distribution

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
CHAT   ███████████  46 ( 23.0%)
SEARCH ███████  28 ( 14.0%)
RAG    ████  18 (  9.0%)
EMBED  ████  17 (  8.5%)
SEC    ████  16 (  8.0%)
GRAPH  ███  15 (  7.5%)
EVAL   ███  15 (  7.5%)
QUERY  ██  11 (  5.5%)
FOUND  ██   9 (  4.5%)
NLP    ██   8 (  4.0%)
METRIC █   7 (  3.5%)
LLM    █   7 (  3.5%)
TOOL      3 (  1.5%)

Balance Analysis

✅ No Over-Represented Categories

All categories are under the 30% threshold. Good balance!

ℹ️ Under-Represented Categories (<3%)

  • TOOL (TOOL): 3 concepts (1.5%)
  • Note: Small categories are acceptable for specialized topics

Category Details

CHAT (CHAT)

Count: 46 concepts (23.0%)

Concepts:

    1. Chatbot
    1. Conversational Agent
    1. Dialog System
    1. Intent Recognition
    1. Intent Modeling
    1. Intent Classification
    1. Entity Extraction
    1. Named Entity Recognition
    1. Entity Type
    1. Entity Linking
    1. FAQ
    1. FAQ Analysis
    1. Question-Answer Pair
    1. User Query
    1. User Intent
  • ...and 31 more

Count: 28 concepts (14.0%)

Concepts:

    1. Keyword Search
    1. Search Index
    1. Inverted Index
    1. Reverse Index
    1. Full-Text Search
    1. Boolean Search
    1. Search Query
    1. Query Parser
    1. Synonym Expansion
    1. Thesaurus
    1. Ontology
    1. Taxonomy
    1. Controlled Vocabulary
    1. Metadata
    1. Metadata Tagging
  • ...and 13 more

RAG (RAG)

Count: 18 concepts (9.0%)

Concepts:

    1. External Knowledge
    1. Public Knowledge Base
    1. Internal Knowledge
    1. Private Documents
    1. Document Corpus
    1. RAG Pattern
    1. Retrieval Augmented Generation
    1. Retrieval Step
    1. Augmentation Step
    1. Generation Step
    1. Context Window
    1. Prompt Engineering
    1. System Prompt
    1. User Prompt
    1. RAG Limitations
  • ...and 3 more

EMBED (EMBED)

Count: 17 concepts (8.5%)

Concepts:

    1. Word Embedding
    1. Embedding Vector
    1. Vector Space Model
    1. Vector Dimension
    1. Embedding Model
    1. Word2Vec
    1. GloVe
    1. FastText
    1. Sentence Embedding
    1. Contextual Embedding
    1. Vector Database
    1. Vector Store
    1. Vector Index
    1. Approximate Nearest Neighbor
    1. FAISS
  • ...and 2 more

SEC (SEC)

Count: 16 concepts (8.0%)

Concepts:

    1. Security
    1. Authentication
    1. Authorization
    1. User Permission
    1. Role-Based Access Control
    1. RBAC
    1. Access Policy
    1. Data Privacy
    1. PII
    1. Personally Identifiable Info
    1. GDPR
    1. Data Retention
    1. Log Storage
    1. Chat Log
    1. Logging System
  • ...and 1 more

GRAPH (GRAPH)

Count: 15 concepts (7.5%)

Concepts:

    1. GraphRAG Pattern
    1. Knowledge Graph
    1. Graph Database
    1. Node
    1. Edge
    1. Triple
    1. Subject-Predicate-Object
    1. RDF
    1. Graph Query
    1. OpenCypher
    1. Cypher Query Language
    1. Neo4j
    1. Corporate Nervous System
    1. Organizational Knowledge
    1. Knowledge Management

EVAL (EVAL)

Count: 15 concepts (7.5%)

Concepts:

    1. Query Frequency
    1. Frequency Analysis
    1. Pareto Analysis
    1. 80/20 Rule
    1. Chatbot Metrics
    1. KPI
    1. Key Performance Indicator
    1. Chatbot Dashboard
    1. Acceptance Rate
    1. User Satisfaction
    1. Response Accuracy
    1. Chatbot Evaluation
    1. A/B Testing
    1. Performance Tuning
    1. Optimization

QUERY (QUERY)

Count: 11 concepts (5.5%)

Concepts:

    1. Database Query
    1. SQL Query
    1. Query Parameter
    1. Parameter Extraction
    1. Query Template
    1. Parameterized Query
    1. Query Execution
    1. Query Description
    1. Natural Language to SQL
    1. Question to Query Mapping
    1. Slot Filling

Foundation Concepts - Prerequisites (FOUND)

Count: 9 concepts (4.5%)

Concepts:

    1. Artificial Intelligence
    1. AI Timeline
    1. AI Doubling Rate
    1. Moore's Law
    1. Natural Language Processing
    1. Text Processing
    1. String Matching
    1. Regular Expressions
    1. Grep Command

NLP (NLP)

Count: 8 concepts (4.0%)

Concepts:

    1. NLP Pipeline
    1. Text Preprocessing
    1. Text Normalization
    1. Stemming
    1. Lemmatization
    1. Part-of-Speech Tagging
    1. Dependency Parsing
    1. Coreference Resolution

METRIC (METRIC)

Count: 7 concepts (3.5%)

Concepts:

    1. Search Precision
    1. Search Recall
    1. F-Measure
    1. F1 Score
    1. Confusion Matrix
    1. True Positive
    1. False Positive

LLM (LLM)

Count: 7 concepts (3.5%)

Concepts:

    1. Large Language Model
    1. Transformer Architecture
    1. Attention Mechanism
    1. Token
    1. Tokenization
    1. Subword Tokenization
    1. Byte Pair Encoding

TOOL (TOOL)

Count: 3 concepts (1.5%)

Concepts:

    1. Team Project
    1. Capstone Project
    1. Chatbot Career

Recommendations

  • Good balance: Categories are reasonably distributed (spread: 21.5%)
  • MISC category minimal: Good categorization specificity

Educational Use Recommendations

  • Use taxonomy categories for color-coding in graph visualizations
  • Design curriculum modules based on taxonomy groupings
  • Create filtered views for focused learning paths
  • Use categories for assessment organization
  • Enable navigation by topic area in interactive tools

Report generated by learning-graph-reports/taxonomy_distribution.py