Your RAG-powered legal research assistant has been in production 3 months. A customer complains the assistant cited a case that does not exist. (1) What tracing data would you look at first? (2) Design a continuous monitoring system that would have caught this hallucination before users reported it. (3) You discover 3% of production responses cite cases not found in your retrieved context. Should you roll back? What are your options?