Response Inconsistency
The Problem
Symptoms
Real-World Example
Query: "What's the API rate limit?"
Response 1: "The rate limit is 1000 requests per hour"
Response 2: "You can make up to 100 requests per minute"
Response 3: "Rate limits vary by plan tier"
All from same knowledge base!
Causes: Different retrieved chunks, different model sampling, different timestampDeep Technical Analysis
Sources of Inconsistency
How to Solve
Last updated

