References

This textbook draws upon the following high-quality resources curated for college-level students studying Conversational AI:

AI Capability Increases

Measuring AI Ability to Complete Long Tasks - March 19 2025 - METR Blog. This chart and paper give convincing evidence that the length of tasks that AI can complete with 50% probability of correctness is doubling every seven months. This has been consistent since GPT-2 in 2019. In a few years, you will be able to generate an entire college-level textbook complete with complex charts, diagrams and simulations in a few hours. This is a wake call for educators around the world!

Foundational AI and Large Language Models

Attention Is All You Need - 2017-06-12 - arXiv - Seminal paper by Vaswani et al. introducing the Transformer architecture that revolutionized natural language processing and forms the foundation for modern large language models like GPT and BERT. If you had to pick one paper that transformed the field of text generation, this is the paper to read.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding - 2018-10-11 - arXiv - Google AI's breakthrough paper introducing bidirectional pre-training for language understanding, demonstrating how models can jointly condition on both left and right context in all layers. BERT was very influential in my career. I spent over two years at a Fortune 50 healthcare company building custom BERT LLMs that were fine-tuned on clinical terminology. Although BERT is not used in most commercial systems today, it was very influential is guiding today's industry.
Language Models are Few-Shot Learners - 2020-05-28 - arXiv - The GPT-3 paper by OpenAI demonstrating that scaling language models to 175 billion parameters enables few-shot learning without gradient updates or fine-tuning.
Deep Learning - 2016-11-18 - MIT Press - Comprehensive textbook by Goodfellow, Bengio, and Courville covering mathematical foundations, deep learning techniques, and research perspectives essential for understanding modern AI systems.
Efficient Estimation of Word Representations in Vector Space - 2013-01-16 - arXiv - Mikolov et al.'s Word2Vec paper introducing continuous vector representations of words that capture semantic relationships and laid groundwork for modern embeddings.
The Illustrated Transformer - 2018-06-27 - Jay Alammar's Blog - Visual guide breaking down the Transformer architecture through intuitive diagrams, used in ML courses at Stanford, MIT, Harvard, and other top universities.
A Survey of Large Language Models - 2023-03-31 - arXiv - Comprehensive survey by Zhao et al. reviewing LLM evolution from statistical to neural language models, covering pre-training techniques, model families, and evaluation methods.
LoRA: Low-Rank Adaptation of Large Language Models - 2021-06-17 - arXiv - Paper introducing parameter-efficient fine-tuning that reduces trainable parameters by 10,000x while maintaining performance, enabling practical customization of large models.

Search Technologies and Natural Language Processing

Introduction to Information Retrieval - 2009-04-01 - Cambridge University Press - Authoritative textbook by Manning, Raghavan, and Schütze covering search fundamentals including TF-IDF, Boolean retrieval, vector space models, and PageRank algorithms essential for understanding chatbot search.
Speech and Language Processing - 2025-08-24 - Stanford University - Comprehensive NLP textbook by Jurafsky and Martin, now in its 3rd edition with extensive coverage of language models, tokenization, intent modeling, and neural approaches used in conversational AI.
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks - 2019-08-27 - arXiv - Reimers and Gurevych's modification of BERT using siamese networks to derive semantically meaningful sentence embeddings, reducing similarity search time from 65 hours to 5 seconds.
Faiss: A Library for Efficient Similarity Search - 2017-03-29 - Meta Engineering Blog - Introduction to Facebook AI's open-source library for billion-scale vector similarity search, demonstrating practical implementation of efficient nearest-neighbor algorithms for embeddings.
Dense Passage Retrieval for Open-Domain Question Answering - 2020-04-10 - arXiv - Facebook AI paper showing that dense vector representations outperform traditional BM25 search by 9-19% for passage retrieval, establishing foundations for modern RAG systems.
The PageRank Citation Ranking: Bringing Order to the Web - 1998-01-29 - Stanford InfoLab - Original Stanford paper by Page and Brin introducing the PageRank algorithm that made Google-scale search possible, demonstrating graph-based ranking essential for understanding search performance.
Efficient Estimation of Word Representations in Vector Space - 2013-01-16 - arXiv - Foundational Word2Vec paper introducing continuous bag-of-words and skip-gram models for learning distributed word representations efficiently from large datasets.

RAG, GraphRAG, and Knowledge Graphs

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks - 2020-05-22 - arXiv - Lewis et al.'s paper introducing RAG architecture combining parametric (LLM) and non-parametric (vector store) memory, the foundation for most production chatbots today.
GraphRAG: Unlocking LLM Discovery on Narrative Private Data - 2024-02-13 - Microsoft Research Blog - Introduction to GraphRAG approach using LLM-generated knowledge graphs with hierarchical clustering to improve retrieval beyond traditional vector search for complex queries.
Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods - 2017-01-01 - Semantic Web Journal - Paulheim's comprehensive survey of knowledge graph construction and refinement methods, covering entity extraction, relationship inference, and error detection techniques.
Neo4j Cypher Manual - 2024-01-01 - Neo4j Documentation - Official documentation for Cypher, the declarative graph query language used to interact with Neo4j databases, essential for implementing GraphRAG patterns.
Vector Database Comparison: Pinecone vs Weaviate vs Milvus vs Chroma - 2025-01-15 - Medium - Comprehensive comparison of vector database platforms covering scalability, performance, deployment options, and cost considerations for production chatbot systems.

Evaluation, Metrics, and Quality Assessment

SQuAD: 100,000+ Questions for Machine Comprehension of Text - 2016-06-16 - arXiv - Stanford's reading comprehension dataset with 100,000+ question-answer pairs serving as a benchmark for evaluating question-answering systems and chatbot comprehension abilities.
Classification: Accuracy, Recall, Precision, and Related Metrics - 2024-01-01 - Google Machine Learning - Comprehensive guide explaining precision, recall, F1 score, and their applications in evaluating search quality and chatbot response accuracy.
Metrics of Success: Evaluating User Satisfaction in AI Chatbots - 2024-01-01 - ACM Digital Library - Recent research examining chatbot evaluation metrics including CSAT, NPS, and multi-dimensional satisfaction frameworks for production systems.
Survey on Evaluation Methods for Dialogue Systems - 2021-01-01 - Artificial Intelligence Review - Comprehensive survey of dialogue system evaluation methods covering both automatic metrics and human evaluation approaches for conversational AI.

Production Systems, Security, and Ethics

ACM Code of Ethics and Professional Conduct - 2018-06-22 - ACM - Professional ethics guidelines covering privacy, security, transparency, and responsible AI development essential for building production chatbot systems.
Prompt Engineering Guide - 2024-01-01 - OpenAI Documentation - Official best practices for crafting effective prompts including clear instructions, few-shot learning, and systematic testing strategies for LLM-based applications.
Build an LLM RAG Chatbot with LangChain - 2024-01-01 - Real Python - Comprehensive tutorial demonstrating practical implementation of RAG chatbots using LangChain framework, covering indexing, retrieval, and generation patterns.
Named Entity Recognition using spaCy - 2024-01-01 - spaCy Documentation - Guide to implementing NER for entity extraction from conversations, essential for building knowledge graphs and understanding user queries.
Software-Based Dialogue Systems: Survey, Taxonomy, and Challenges - 2022-01-01 - ACM Computing Surveys - Systematic literature review covering evolution of dialogue systems from rule-based approaches to modern neural architectures, including task-oriented and open-domain systems.
Chatbot Frameworks Comparison: Rasa vs Dialogflow vs Microsoft Bot Framework - 2024-01-01 - Rootstack Blog - Industry comparison of major chatbot frameworks covering features, deployment options, customization capabilities, and use case recommendations for production systems.

Textbooks on Intelligent Textbooks

Here is an intelligent textbooks on the topic of intelligent textbooks:

Intelligent Textbooks

You can find examples of over 40 other related Intelligent Textbooks here:

Intelligent Textbook Case Studies

The Claude Skills for Building Intelligent Textbooks and the textbook on Learning Graphs are especially useful for anyone that would like to customize this textbook for your own classroom.

For readers not familiar with Graphs, these two intelligent textbooks are relevant.

References for the Book Building Tools

mkdocs - https://www.mkdocs.org/ - this is our tool for building the website. It converts Markdown into HTML in the site directory.
mkdocs material theme - https://squidfunk.github.io/mkdocs-material/ - this is the theme for our site. The theme adds the user interface elements that give our site the look and feel. It also has the features such as social cards.
GitHub Pages - https://pages.github.com/ - this is the free tool for hosting public websites created by mkdocs
Markdown - https://www.mkdocs.org/user-guide/writing-your-docs/#writing-with-markdown - this is the format we use for text. It allows us to have headers, lists, tables, links and images without learning HTML.
Deploy Mkdocs GitHub Action - https://github.com/marketplace/actions/deploy-mkdocs - this is the tool we use to automatically build our site after edits are checked in with Git.
Git Book - https://git-scm.com/book/en/v2 - a useful book on Git. Just read the first two chapters to learn how to check in new code.
Conda - https://conda.io/ - this is a command line tool that keeps our Python libraries organized for each project.
VS Code - https://code.visualstudio.com/ - this is the integrated development environment we use to mange the files on our website.
Markdown Paste - https://marketplace.visualstudio.com/items?itemName=telesoho.vscode-markdown-paste-image - this is the VS code extension we use to make sure we keep the markdown format generated by ChatGPT.

Dan's Personal Favorites

Five Levels of Intelligent Textbooks - Nov 19, 2024 - Dan McCreary's Personal Blog on Medium. This is the article I wrote when I first started to use the five-level model for intelligent textbooks. It is critical because it shows the great divide between 2.99 level books and 3.0 level books where student data needs to be protected and costs go up dramatically. This is why this textbook does not store any student-specific data.
Using GenAI to Create Learning Graphs: Data Structures for Hyper-Personalized Lesson Plans - Oct 16, 2024 - Dan McCreary's Personal Blog on Medium. This is a good five minute intro to Learning Graphs.
Micro-Simulations for Education: An interview with Valerie Lockhart of Code Savvy - Nov 4, 2023 - Dan McCreary's Personal Blog on Medium. This was the first time we used the term "MicroSim". All credit to Valerie for her insight that combining processing with generative AI would transform education!
ChatGPT Brings us Closer to the Diamond Age: Personalized Learning Agents are Just Years Away - Jan 30, 2023 - Dan McCreary's Personal Blog on Medium. This is the story behind what inspired me to try to visualize how an intelligent textbook would work. This is something I have been thinking about since 1995 when Diamond Age was published.
Showing Bias in BERT: How we can easily show occupation-gender bias using the Hugging Face website - Feb 25, 2022 - Dan McCreary's Personal Blog on Medium. Are LLMs biased? You bet they are! This blog was the basis for a class I taught on detecting and correcting for bias in LLMs.
The Learning-Knowledge-Language Innovation Hot Zone - Aug 23, 2021 - Dan McCreary's Personal Blog on Medium. A good summary of why the LKL Innovation Hot Zone needs focus.
Grading GPT-3 For STEM Lesson Plan Content Generation - Jan 24, 2021 - Dan McCreary's Personal Blog on Medium. My first attempt to score LLMs on their ability to generate educational content. Generating good multiple choice questions was REALLY hard back in early 2021. Now we have a Claude Skill that totally rocks and generating high-quality multiple-choice questions.
Grounding LLMs: The Knowledge Graph foundation every AI project needs - Nov 6, 2025 - by Alessandro Negro. If you wonder why all our intelligent textbooks are built around a graph, this is a good place to start. Alessandro is the Chief Scientist at GraphAware and author of the books Graph-Powered Machine Learning (Manning, 2021) and Knowledge Graphs and LLMs in Action (Manning, 2025). He and I are both Manning authors.

Additional Resources

For readers interested in deeper exploration of conversational AI topics, the following supplementary resources are recommended:

Online Courses and Tutorials

LangChain Documentation - Comprehensive guides for building production RAG systems
Neo4j GraphAcademy - Free courses on graph databases and Cypher query language
Hugging Face Transformers - Tutorials and pre-trained models for NLP tasks

Communities and Forums

r/LanguageTechnology - Reddit community for NLP discussions
Papers With Code - Benchmark datasets and model implementations
AI Alignment Forum - Discussions on responsible AI development

Research Venues

NeurIPS (Neural Information Processing Systems) - Premier AI/ML conference
ACL (Association for Computational Linguistics) - Leading NLP conference
EMNLP (Empirical Methods in Natural Language Processing) - Applied NLP research

References last updated: 2025-11-16

Note: This reference list includes 30 college-level resources mixing peer-reviewed papers (60%), authoritative textbooks, official documentation, and high-quality technical articles. URLs were verified where possible, though some sites restrict automated access while remaining publicly available through web browsers.