Conflicting Sources in Context

The Problem

Retrieved chunks contain contradictory information from different documents, causing AI to provide inconsistent or confused answers.

Symptoms

  • ❌ AI hedges: "Some sources say X, others say Y"

  • ❌ Contradictory statements in response

  • ❌ Unclear which source is authoritative

  • ❌ Old vs new docs both retrieved

  • ❌ Different product tiers mixed up

Real-World Example

Retrieved chunks:
→ Chunk A (2022 docs): "API rate limit is 100 req/hour"
→ Chunk B (2024 docs): "API rate limit is 1000 req/hour"

Both retrieved with similar scores

AI response: "The API rate limit is 100 requests per hour, though
some documentation indicates it may be 1000 requests per hour."

Confused answer - which is correct?

Deep Technical Analysis

Version Conflicts

Document Versioning:

Timestamp Metadata:

Multi-Source Inconsistencies

Different Systems of Record:

Source Trust Levels:

Reconciliation Strategies

LLM Arbitration:

Pre-Retrieval Filtering:

Citation Clarity

Source Attribution:


How to Solve

Tag chunks with version/timestamp + filter to most recent docs + assign source authority levels (official > community) + implement conflict detection + prefer highest-priority source + add "last updated" in responses + archive old docs with metadata flag + prompt LLM to prefer recent over old when conflict detected. See Source Conflicts.

Last updated