Chatbot Response Latency Breakdown: Simple vs. Complex Queries
Intent Classification
Retrieval (template, context, vector)
LLM Generation
Response Formatting