Taxonomy Distribution Report
Overview
- Total Concepts: 200
- Number of Taxonomies: 13
- Average Concepts per Taxonomy: 15.4
Distribution Summary
| Category | TaxonomyID | Count | Percentage | Status |
|---|---|---|---|---|
| CHAT | CHAT | 46 | 23.0% | ✅ |
| SEARCH | SEARCH | 28 | 14.0% | ✅ |
| RAG | RAG | 18 | 9.0% | ✅ |
| EMBED | EMBED | 17 | 8.5% | ✅ |
| SEC | SEC | 16 | 8.0% | ✅ |
| GRAPH | GRAPH | 15 | 7.5% | ✅ |
| EVAL | EVAL | 15 | 7.5% | ✅ |
| QUERY | QUERY | 11 | 5.5% | ✅ |
| Foundation Concepts - Prerequisites | FOUND | 9 | 4.5% | ✅ |
| NLP | NLP | 8 | 4.0% | ✅ |
| METRIC | METRIC | 7 | 3.5% | ✅ |
| LLM | LLM | 7 | 3.5% | ✅ |
| TOOL | TOOL | 3 | 1.5% | ℹ️ Under |
Visual Distribution
1 2 3 4 5 6 7 8 9 10 11 12 13 | |
Balance Analysis
✅ No Over-Represented Categories
All categories are under the 30% threshold. Good balance!
ℹ️ Under-Represented Categories (<3%)
- TOOL (TOOL): 3 concepts (1.5%)
- Note: Small categories are acceptable for specialized topics
Category Details
CHAT (CHAT)
Count: 46 concepts (23.0%)
Concepts:
-
- Chatbot
-
- Conversational Agent
-
- Dialog System
-
- Intent Recognition
-
- Intent Modeling
-
- Intent Classification
-
- Entity Extraction
-
- Named Entity Recognition
-
- Entity Type
-
- Entity Linking
-
- FAQ
-
- FAQ Analysis
-
- Question-Answer Pair
-
- User Query
-
- User Intent
- ...and 31 more
SEARCH (SEARCH)
Count: 28 concepts (14.0%)
Concepts:
-
- Keyword Search
-
- Search Index
-
- Inverted Index
-
- Reverse Index
-
- Full-Text Search
-
- Boolean Search
-
- Search Query
-
- Query Parser
-
- Synonym Expansion
-
- Thesaurus
-
- Ontology
-
- Taxonomy
-
- Controlled Vocabulary
-
- Metadata
-
- Metadata Tagging
- ...and 13 more
RAG (RAG)
Count: 18 concepts (9.0%)
Concepts:
-
- External Knowledge
-
- Public Knowledge Base
-
- Internal Knowledge
-
- Private Documents
-
- Document Corpus
-
- RAG Pattern
-
- Retrieval Augmented Generation
-
- Retrieval Step
-
- Augmentation Step
-
- Generation Step
-
- Context Window
-
- Prompt Engineering
-
- System Prompt
-
- User Prompt
-
- RAG Limitations
- ...and 3 more
EMBED (EMBED)
Count: 17 concepts (8.5%)
Concepts:
-
- Word Embedding
-
- Embedding Vector
-
- Vector Space Model
-
- Vector Dimension
-
- Embedding Model
-
- Word2Vec
-
- GloVe
-
- FastText
-
- Sentence Embedding
-
- Contextual Embedding
-
- Vector Database
-
- Vector Store
-
- Vector Index
-
- Approximate Nearest Neighbor
-
- FAISS
- ...and 2 more
SEC (SEC)
Count: 16 concepts (8.0%)
Concepts:
-
- Security
-
- Authentication
-
- Authorization
-
- User Permission
-
- Role-Based Access Control
-
- RBAC
-
- Access Policy
-
- Data Privacy
-
- PII
-
- Personally Identifiable Info
-
- GDPR
-
- Data Retention
-
- Log Storage
-
- Chat Log
-
- Logging System
- ...and 1 more
GRAPH (GRAPH)
Count: 15 concepts (7.5%)
Concepts:
-
- GraphRAG Pattern
-
- Knowledge Graph
-
- Graph Database
-
- Node
-
- Edge
-
- Triple
-
- Subject-Predicate-Object
-
- RDF
-
- Graph Query
-
- OpenCypher
-
- Cypher Query Language
-
- Neo4j
-
- Corporate Nervous System
-
- Organizational Knowledge
-
- Knowledge Management
EVAL (EVAL)
Count: 15 concepts (7.5%)
Concepts:
-
- Query Frequency
-
- Frequency Analysis
-
- Pareto Analysis
-
- 80/20 Rule
-
- Chatbot Metrics
-
- KPI
-
- Key Performance Indicator
-
- Chatbot Dashboard
-
- Acceptance Rate
-
- User Satisfaction
-
- Response Accuracy
-
- Chatbot Evaluation
-
- A/B Testing
-
- Performance Tuning
-
- Optimization
QUERY (QUERY)
Count: 11 concepts (5.5%)
Concepts:
-
- Database Query
-
- SQL Query
-
- Query Parameter
-
- Parameter Extraction
-
- Query Template
-
- Parameterized Query
-
- Query Execution
-
- Query Description
-
- Natural Language to SQL
-
- Question to Query Mapping
-
- Slot Filling
Foundation Concepts - Prerequisites (FOUND)
Count: 9 concepts (4.5%)
Concepts:
-
- Artificial Intelligence
-
- AI Timeline
-
- AI Doubling Rate
-
- Moore's Law
-
- Natural Language Processing
-
- Text Processing
-
- String Matching
-
- Regular Expressions
-
- Grep Command
NLP (NLP)
Count: 8 concepts (4.0%)
Concepts:
-
- NLP Pipeline
-
- Text Preprocessing
-
- Text Normalization
-
- Stemming
-
- Lemmatization
-
- Part-of-Speech Tagging
-
- Dependency Parsing
-
- Coreference Resolution
METRIC (METRIC)
Count: 7 concepts (3.5%)
Concepts:
-
- Search Precision
-
- Search Recall
-
- F-Measure
-
- F1 Score
-
- Confusion Matrix
-
- True Positive
-
- False Positive
LLM (LLM)
Count: 7 concepts (3.5%)
Concepts:
-
- Large Language Model
-
- Transformer Architecture
-
- Attention Mechanism
-
- Token
-
- Tokenization
-
- Subword Tokenization
-
- Byte Pair Encoding
TOOL (TOOL)
Count: 3 concepts (1.5%)
Concepts:
-
- Team Project
-
- Capstone Project
-
- Chatbot Career
Recommendations
- ✅ Good balance: Categories are reasonably distributed (spread: 21.5%)
- ✅ MISC category minimal: Good categorization specificity
Educational Use Recommendations
- Use taxonomy categories for color-coding in graph visualizations
- Design curriculum modules based on taxonomy groupings
- Create filtered views for focused learning paths
- Use categories for assessment organization
- Enable navigation by topic area in interactive tools
Report generated by learning-graph-reports/taxonomy_distribution.py