Hallucination in Responses
The Problem
Symptoms
Real-World Example
Retrieved context: "API rate limit is 1000 requests per hour"
User query: "What happens if I exceed the rate limit?"
AI response: "If you exceed the rate limit of 1000 requests per hour,
your account will be temporarily suspended for 15 minutes and you'll
receive a 429 error. After three violations, your API key will be
permanently revoked."
Problem: Context only mentions the limit
→ "15 minutes suspension" - INVENTED
→ "three violations" policy - INVENTED
→ "permanent revocation" - INVENTED
Only "1000 requests/hour" and "429 error" might be accurateDeep Technical Analysis
Retrieval-Generation Gap
Instruction Following vs Grounding
Confidence Calibration Failure
Context Length Limitations
Mitigation Strategies
How to Solve
Last updated

