Chatbot Response Latency Breakdown: Simple vs. Complex Queries

Intent Classification Retrieval (template, context, vector) LLM Generation Response Formatting