Memory & Context

Learn how Twig AI agents maintain conversation context and use memory to provide coherent, contextual responses across multiple interactions.

What is Memory?

Memory in AI agents refers to the ability to:

  • Remember previous messages in a conversation

  • Maintain context across multiple turns

  • Reference earlier parts of the dialogue

  • Provide contextually relevant responses

Without memory, each query is treated independently. With memory, agents understand the conversation flow.

How Memory Works

Conversation Flow

Without Memory (Stateless):

User: "What are your pricing plans?"
Agent: "We have Free, Pro, and Enterprise plans..."

User: "What's included in the middle one?"
Agent: "I'm not sure what you're referring to." ❌

With Memory (Stateful):

Memory Storage

Session-Based Memory:

  • Stored for duration of conversation

  • Cleared when session ends

  • Tied to sessionId

  • Best for: Live conversations, chat widgets

User-Level Memory:

  • Persists across sessions

  • Stored per user

  • Can span days/weeks

  • Best for: Returning users, ongoing projects

Ephemeral Memory:

  • Exists only during single request

  • No persistence

  • Used for: Single-query endpoints

Memory Components

1. Conversation History

Stores the actual messages:

2. Context Summary

Compressed representation of conversation:

3. Entity Tracking

Tracks mentioned entities:

Memory Configuration

Basic Settings

Advanced Settings

Memory Strategies

1. Full History (Small Conversations)

Best for: Short conversations (< 10 turns)

Configuration:

Pros:

  • Complete context

  • No information loss

  • Best accuracy

Cons:

  • Token usage grows

  • Slower with long conversations

2. Windowed History (Medium Conversations)

Best for: Moderate conversations (10-20 turns)

Configuration:

Example:

Pros:

  • Balanced approach

  • Recent context preserved

  • Manageable token usage

Cons:

  • Some detail lost in summary

  • Slight complexity

3. Summarized History (Long Conversations)

Best for: Extended conversations (20+ turns)

Configuration:

Example:

Pros:

  • Handles very long conversations

  • Low token usage

  • Maintains key context

Cons:

  • Loss of detail

  • Summary quality matters

Context Enhancement

Memory-Enhanced Queries

Cedar and Cypress strategies use memory to improve retrieval:

Original Query:

With Memory Context:

Impact:

Entity Resolution

Memory helps resolve ambiguous references:

Session Management

Creating Sessions

Continuing Sessions

Ending Sessions

Sessions auto-expire based on TTL.

Memory in Different RAG Strategies

Redwood (Standard RAG)

Memory Usage:

  • Basic memory storage

  • Not used for query rewriting

  • Available in context

Impact: Minimal

Cedar (Context-Aware)

Memory Usage:

  • Active memory analysis

  • Used for query rewriting

  • Enhanced context

Example:

Impact: High - Memory directly improves query quality

Cypress (Advanced)

Memory Usage:

  • Comprehensive memory integration

  • Query expansion considers history

  • Context-aware reranking

Impact: Highest - Memory enhances all stages

Privacy & Security

PII Detection

Automatically detect and handle sensitive information:

Example:

Encryption

User Control

Best Practices

1. Choose Appropriate Memory Type

Session: Chat widgets, customer support ✅ User: Returning customers, ongoing projects ✅ Ephemeral: Public APIs, privacy-sensitive ❌ Don't use user memory for anonymous users

2. Set Reasonable Limits

✅ maxTurns: 10-20 for most cases ✅ Summarize after 10-15 turns ✅ TTL: 1 hour for sessions, 24 hours for user ❌ Don't store unlimited history

3. Handle Long Conversations

✅ Implement summarization ✅ Keep recent turns in full ✅ Monitor token usage ❌ Don't truncate abruptly

4. Respect Privacy

✅ Detect and mask PII ✅ Encrypt sensitive data ✅ Allow user deletion ✅ Clear expired sessions ❌ Don't store sensitive data unnecessarily

5. Test Memory Behavior

✅ Test multi-turn conversations ✅ Verify entity resolution ✅ Check summary quality ✅ Monitor token usage ❌ Don't assume it works

Troubleshooting

Memory Not Working

Symptoms:

  • Agent doesn't remember previous turns

  • Follow-up questions fail

  • No context continuity

Diagnosis:

Solutions:

  1. Verify memory is enabled

  2. Use same sessionId for follow-ups

  3. Check TTL hasn't expired

  4. Verify maxTurns not exceeded

Token Limit Exceeded

Symptoms:

  • Errors about context length

  • Truncated responses

  • Missing information

Solutions:

Poor Summarization

Symptoms:

  • Lost important context

  • Agent forgets key details

  • Irrelevant responses

Solutions:

Advanced Topics

Custom Memory Backends

Memory Analytics

Metrics to Monitor:

  • Average turns per session

  • Token usage per turn

  • Summary quality scores

  • Memory hit rate

Cross-Device Sync

Next Steps

Last updated