References: Synthetic Data Generation and AI-Assisted LRS Testing¶
<<<<<<< HEAD 1. Synthetic Data - Wikipedia - Overview of synthetic data generation methods, statistical fidelity criteria, and the privacy-vs-utility trade-offs that motivate LLM-driven xAPI corpus generation.
-
Load Testing - Wikipedia - Background on load-testing methodology, including the distinction between smoke, load, stress, and soak tests — all relevant patterns for stressing an LRS.
-
Large Language Model - Wikipedia - Background on LLM capabilities, with sections on structured-output generation that map onto using Claude or similar tools to produce valid xAPI statements.
-
The Art of Software Testing (3rd Edition) - Glenford J. Myers, Corey Sandler & Tom Badgett - Wiley - Classic text on test design; the equivalence-class and boundary-value chapters frame what "realistic synthetic data" actually means.
-
Software Performance Engineering - Connie U. Smith & Lloyd G. Williams - Addison-Wesley - Methodology for capacity planning and load characterization; particularly useful for designing the cohort-mix specifications that drive realistic test runs.
-
k6 Documentation - Grafana Labs - Open-source load-testing tool that scripts xAPI emit traffic in JavaScript; the constant-arrival-rate and ramping-arrival-rate executors map onto common LRS test scenarios.
-
Locust Documentation - Locust project - Python-based load testing framework with a clean user-modeling abstraction that maps naturally onto learner archetypes.
-
Faker.js - Faker.js community - Library for generating fake names, emails, and identifiers, useful for the deterministic-pseudonym side of xAPI synthetic data.
-
JSON Schema - JSON Schema Organization - Schema authoring tutorial — critical for validating LLM-generated statements before they hit the LRS so you don't bake in malformed data.
10. Anthropic API Documentation - Anthropic - Reference for Claude API capabilities including structured-output generation, tool use, and the patterns most useful for producing schema-conformant xAPI statements at scale.¶
-
Synthetic Data - Wikipedia - Comprehensive overview of synthetic data techniques, distributional fidelity, and the privacy advantages that make synthetic xAPI cohorts a better choice than scrubbed real data for most testing.
-
Large language model - Wikipedia - The technology behind Claude Code; explains how LLMs convert prompts into structured JSON output and the constraints that govern statement-generation quality.
-
Load testing - Wikipedia - Covers ramp-up profiles, sustained throughput tests, and breakpoint testing — the patterns this chapter applies to LRS endpoints with synthetic statement streams.
-
The Art of Application Performance Testing (2nd Edition) - Ian Molyneaux - O'Reilly Media - Practitioner-focused treatment of load-test design, ramp profiles, and result interpretation that grounds the synthetic-cohort strategy this chapter teaches.
-
Statistical Rethinking (2nd Edition) - Richard McElreath - CRC Press - The statistical foundation for evaluating whether a synthetic learner cohort is "representative" — distribution comparison, posterior predictive checks, and the discipline that prevents synthetic data from quietly lying.
-
Anthropic API Documentation - Anthropic - The reference for the Claude API used throughout this chapter, including prompt-caching, structured tool use for JSON output, and rate-limit considerations for batch generation runs.
-
Claude Code Documentation - Anthropic - The CLI tool this chapter uses to author synthetic-statement scripts; covers the agent-loop model, tool permissions, and the workflows the synthetic-data examples are built on.
-
Faker Library - joke2k - The Python library for generating realistic names, emails, and timestamps; commonly used as a deterministic component alongside an LLM that handles the verb/object choices.
-
Locust Load Testing Tool - Locust Project - Python-based distributed load tester that pairs naturally with synthetic-statement generators for stress-testing an LRS at production scale.
-
k6 Load Testing - Grafana Labs - JavaScript-scriptable load tester whose statement-generation scripts can share JSON fixtures with the textbook's xAPI client. Excellent for end-to-end load tests of the full pipeline.
d2ecc9b (iframe updates)