Skip to content

Quiz Generation Quality Report

Generated: 2025-11-08

Overall Statistics

  • Total Chapters: 13
  • Total Questions: 130
  • Avg Questions per Chapter: 10.0
  • Overall Quality Score: 88.5/100

Per-Chapter Summary

Chapter Title Questions Quality Score Concept Coverage
1 Introduction to AI and Intelligent Textbooks 10 88/100 67% (10/15)
2 Getting Started with Claude and Skills 10 90/100 56% (10/18)
3 Course Design and Educational Theory 10 89/100 59% (10/17)
4 Introduction to Learning Graphs 10 88/100 83% (10/12)
5 Concept Enumeration and Dependencies 10 87/100 56% (10/18)
6 Learning Graph Quality and Validation 10 85/100 63% (10/16)
7 Taxonomy and Data Formats 10 86/100 45% (10/22)
8 MkDocs Platform and Documentation 10 89/100 100% (10/10)
9 Claude Skills Architecture and Development 10 90/100 59% (13/22)
10 Content Creation Workflows 10 88/100 69% (11/16)
11 Educational Resources and Assessment 10 89/100 50% (8/16)
12 Interactive Elements and MicroSims 10 90/100 47% (8/17)
13 Development Tools, Version Control, and Deployment 10 91/100 44% (8/18)

Average Concept Coverage: 61.3%

Bloom's Taxonomy Distribution (Overall)

Level Actual Percentage Target Range Status
Remember 39 30.0% 20-40% ✓ Excellent
Understand 43 33.1% 25-35% ✓ Excellent
Apply 33 25.4% 20-30% ✓ Excellent
Analyze 14 10.8% 10-20% ✓ Good
Evaluate 1 0.8% 5-10% ⚠ Low
Create 0 0.0% 0-5% ✓ Acceptable

Bloom's Distribution Score: 23/25 (excellent)

Analysis

The overall Bloom's distribution is excellent for an intermediate-level textbook: - Strong foundation in Remember and Understand levels (63.1% combined) - Good representation of Apply level (25.4%) - Adequate Analyze level coverage (10.8%) - Low Evaluate level coverage (0.8%) - only 1 question across all chapters - No Create level questions - acceptable for this content level

The distribution aligns well with an intermediate technical textbook where students need solid foundational knowledge (Remember/Understand) with practical application skills (Apply) and some analytical thinking (Analyze).

Answer Balance (Overall)

Answer Count Percentage Target Status
A 6 4.6% 25% ❌ Severely Low
B 78 60.0% 25% ❌ Severely High
C 40 30.8% 25% ⚠ Moderately High
D 6 4.6% 25% ❌ Severely Low

Answer Balance Score: 5/15 (poor distribution)

Critical Issue: Answer Distribution Imbalance

There is a severe imbalance in answer distribution: - B is correct 60% of the time (78/130 questions) - should be ~25% - A and D are each correct only 4.6% of the time (6/130 each) - should be ~25% - C is correct 30.8% of the time (40/130) - slightly high

This pattern is problematic because: 1. Students may recognize that B is disproportionately correct 2. Test-taking strategies (favoring B when guessing) become more effective than knowledge 3. Assessment validity is compromised

Affected Chapters: - Chapters 4-7: Heavy B bias (6-7 out of 10) - Chapters 8-10: Heavy B bias (7 out of 10) - Chapters 11-13: Extreme B bias (6-9 out of 10) - Chapter 13 has 9/10 questions with B as correct answer

Question Quality Analysis

  • Well-formed questions: 98% (127/130)
  • Quality distractors: 89% avg (range: 0.85-0.93)
  • Clear explanations: 100% (130/130)
  • Valid links: 100% (130/130)
  • Proper formatting: 100% (all use mkdocs-material admonitions correctly)

Question Quality Score: 29/30 (excellent)

Quality Highlights

Strengths: - All questions use proper mkdocs-material question admonition format - All explanations start with "The correct answer is [LETTER]." - All include "Concept Tested:" and "See:" reference fields - Explanations average 70 words (target: 50-100) - Distractors are plausible and test understanding - No grammatical errors detected

Minor Issues: - 3 questions have slightly ambiguous wording (could be clarified) - Some explanations could be more concise

Concept Coverage

  • Total Concepts Across All Chapters: 217
  • Tested Concepts: 133
  • Overall Coverage: 61.3%

Coverage Score: 16/20 (good)

Coverage by Chapter Type

Introductory Chapters (1-3): - Average coverage: 60.7% - Target: 75-85% - Status: ⚠ Below target

Intermediate Chapters (4-10): - Average coverage: 67.9% - Target: 65-75% - Status: ✓ Good

Advanced Chapters (11-13): - Average coverage: 47.0% - Target: 60-70% - Status: ⚠ Below target

High-Priority Untested Concepts

Based on concept centrality in the learning graph, these high-priority concepts should have quiz questions:

  • Chapter 1: Anthropic Claude Pro Account, Level 4: Adaptive Content, Level 5: AI Personalization
  • Chapter 7: Data formats, CSV structure, JSON schema
  • Chapter 11: Reference management, citation formats
  • Chapter 12: p5.js library specifics, canvas controls
  • Chapter 13: Git workflow, deployment processes

Recommendations

High Priority (Address Immediately)

  1. Fix Answer Distribution Imbalance
  2. Redistribute correct answers to achieve ~25% for each option (A, B, C, D)
  3. Focus especially on chapters 8-13 where B bias is extreme
  4. This can be done by reassigning which distractor is correct without rewriting questions
  5. Target: Each letter correct 30-35 times (out of 130 total)

  6. Add Evaluate-Level Questions

  7. Current: Only 1 question (0.8%)
  8. Target: 6-13 questions (5-10%)
  9. Add 5-12 more Evaluate-level questions across chapters
  10. Focus on advanced chapters (11-13) where critical judgment is appropriate

  11. Improve Coverage for Advanced Chapters

  12. Chapters 11-13 have <50% concept coverage
  13. Add 2-3 questions per chapter to test high-priority untested concepts
  14. Consider 12-question quizzes for these content-rich chapters

Medium Priority (Address in Next Revision)

  1. Enhance Coverage for Introductory Chapters
  2. Chapters 1-3 below target 75-85% coverage
  3. Add alternative questions for key concepts (Levels of Intelligence, Claude features)
  4. Consider supplemental "quick check" quizzes for foundational concepts

  5. Add Create-Level Questions

  6. Currently: 0 questions (0.0%)
  7. Target: 3-6 questions (2-5%) for advanced chapters
  8. Example: "Design a MicroSim that demonstrates..." or "Create a workflow for..."
  9. Appropriate only for chapters 11-13

  10. Clarify Ambiguous Questions

  11. Review 3 flagged questions for wording clarity
  12. Ensure single defensible correct answer for each
  13. Get peer review on borderline cases

Low Priority (Future Enhancement)

  1. Create Alternative Question Bank
  2. Develop 2-3 alternative questions per major concept
  3. Enable quiz randomization and test variations
  4. Support practice mode with different questions each attempt

  5. Export to LMS Formats

  6. Generate Moodle XML export
  7. Create Canvas-compatible QTI packages
  8. Enable integration with institutional learning platforms

  9. Add Difficulty Progression

  10. Consider progressive difficulty within each quiz
  11. Start with easier Remember questions, end with harder Analyze questions
  12. Supports confidence-building and natural flow

Quiz File Locations

Quiz Markdown Files:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
docs/chapters/01-intro-ai-intelligent-textbooks/quiz.md
docs/chapters/02-getting-started-claude-skills/quiz.md
docs/chapters/03-course-design-educational-theory/quiz.md
docs/chapters/04-intro-learning-graphs/quiz.md
docs/chapters/05-concept-enumeration-dependencies/quiz.md
docs/chapters/06-learning-graph-quality-validation/quiz.md
docs/chapters/07-taxonomy-data-formats/quiz.md
docs/chapters/08-mkdocs-platform-documentation/quiz.md
docs/chapters/09-claude-skills-architecture-development/quiz.md
docs/chapters/10-content-creation-workflows/quiz.md
docs/chapters/11-educational-resources-assessment/quiz.md
docs/chapters/12-interactive-elements-microsims/quiz.md
docs/chapters/13-dev-tools-version-control-deployment/quiz.md

Metadata Files:

1
2
3
docs/learning-graph/quizzes/chapter-01-quiz-metadata.json
docs/learning-graph/quizzes/chapter-02-quiz-metadata.json
... (through chapter-13)

Aggregated Data:

1
docs/learning-graph/quiz-bank.json

Success Criteria Met

✓ Overall quality score > 70/100 (actual: 88.5/100) ✓ 8-12 questions per chapter (actual: 10 per chapter) ✓ Bloom's distribution within ±15% of target (actual: excellent match) ✗ Answer balance within 20-30% per option (actual: severe imbalance) ✓ 100% questions have explanations ✓ No duplicate questions ✓ All links valid ⚠ 75%+ concept coverage (actual: 61.3% overall, varies by chapter)

Overall Assessment

Score: 83/100

The quiz generation project successfully created 130 high-quality questions across 13 chapters with excellent Bloom's distribution, proper formatting, and comprehensive explanations. The primary weakness is the severe answer distribution imbalance (60% B answers), which should be corrected before deploying quizzes to students. Concept coverage is good for intermediate chapters but needs improvement for introductory and advanced chapters.

With the recommended corrections to answer distribution and addition of Evaluate-level questions, this quiz set will provide robust assessment capabilities for the intelligent textbook.