Overview

Twig AI offers three distinct RAG (Retrieval-Augmented Generation) strategies, each optimized for different use cases and performance requirements.

Strategy Comparison

Strategy

Speed

Prompt Rewriting

Retrieval Method

Reranking

Best For

Redwood

~1-2 sec

❌ No

Direct vector search

❌ No

Clear, simple questions

Cedar

~2-3 sec

✅ Context-aware

Memory-enhanced search

❌ No

Conversational queries

Cypress

~3-4 sec

✅ Advanced

Tier-based + expansion

✅ Yes

Complex, high-accuracy needs

Feature Matrix

Feature

Redwood

Cedar

Cypress

Vector Search

✅

Chunking

✅

Memory

✅

Privacy Controls

✅

Memory-Enhanced Prompt

❌

✅

Context-Aware Query Rewriting

❌

✅

Vector Retrieval Optimization

❌

✅

Tier-Based Source Retrieval

❌

✅

Automatic Reranking

❌

✅

Higher Retrieval Volume

❌

✅

Query Expansion

❌

✅

Redwood Strategy

Overview

The simplest and fastest RAG approach. Uses the original user query directly for vector search without any prompt rewriting.

How It Works

User asks a question
Original query is converted to embedding
Vector database returns top matching documents
Context is built from retrieved documents
LLM generates response with context

Performance

Speed: ~1-2 seconds
Token Usage: Minimal (single LLM call)
Cost: Lowest

When to Use Redwood

✅ Use when:

Questions are clear and well-formed
No ambiguity in user queries
Speed is the top priority
Simple, direct questions
High query volume with cost sensitivity

❌ Avoid when:

Questions are ambiguous or context-dependent
Follow-up questions that reference previous context
Complex or multi-part queries
Highest accuracy is critical

Example Use Cases

FAQ chatbots
Simple help desk queries
Product information lookup
Quick reference tools

Learn more about Redwood →

Cedar Strategy

Overview

Enhances retrieval by rewriting the user's query based on conversation context and memory before searching the vector database.

How It Works

User asks a question
System analyzes conversation history (memory)
Query is rewritten to be more explicit and searchable
Rewritten query is used for vector search
Context is built from retrieved documents
LLM generates response with full context

Performance

Speed: ~2-3 seconds
Token Usage: Moderate (additional rewriting call)
Cost: Medium

When to Use Cedar

✅ Use when:

Conversational queries are common
Questions reference previous context
Users ask follow-up questions
Ambiguous phrasing is frequent
Balance of speed and accuracy needed

❌ Avoid when:

Maximum speed is required
Queries are always self-contained
Budget is extremely tight
Ultra-high accuracy is critical

Example Use Cases

Customer support chatbots
Interactive help systems
Multi-turn conversations
General Q&A assistants

Learn more about Cedar →

Cypress Strategy

Overview

The most sophisticated RAG strategy combining query expansion, tier-based retrieval, and automatic reranking for maximum accuracy.

How It Works

User asks a question
Query is enhanced with memory (if available)
Query Expansion: Prompt is rewritten to include synonyms, related terms, and alternative phrasings
Tier 1 Retrieval: Search high-priority data sources (topK=50)
Tier 2 Retrieval: Search supplementary data sources (topK=50)
Reranking: All results are reranked using bge-reranker-v2-m3 model
Top 10 most relevant documents are selected
Context is built with highest quality results
Final query rewriting for LLM (context-aware)
LLM generates response with optimized context

Unique Features

Query Expansion for Retrieval:

Original: "reset password"
Expanded: "reset password, change password, recover account, 
          password reset process, account recovery, reset credentials"

Tier-Based Retrieval:

Tier 1: Official documentation, primary knowledge bases
Tier 2: Community content, secondary sources
Both tiers treated equally in reranking

Automatic Reranking:

Cross-encoder model (more accurate than vector similarity)
Considers full query-document relationship
Improves precision significantly

Performance

Speed: ~3-4 seconds
Token Usage: Higher (multiple rewriting + reranking)
Cost: Highest

When to Use Cypress

✅ Use when:

Accuracy is the top priority
Questions involve diverse terminology
Multiple data source tiers exist
Query ambiguity is common
Latency trade-off is acceptable
High-stakes decisions depend on answers

❌ Avoid when:

Speed is critical
Budget is constrained
Simple, clear questions only
Low query volume

Example Use Cases

Medical or legal Q&A (high accuracy required)
Complex technical documentation
Multi-domain knowledge bases
Enterprise knowledge management
Compliance-sensitive applications

Learn more about Cypress →

Performance Comparison

Latency

Redwood:  ▓░░░░░░░░░ 1-2 seconds
Cedar:    ▓▓▓░░░░░░░ 2-3 seconds
Cypress:  ▓▓▓▓▓░░░░░ 3-4 seconds

Accuracy

Redwood:  ▓▓▓▓▓▓░░░░ Good
Cedar:    ▓▓▓▓▓▓▓▓░░ Better
Cypress:  ▓▓▓▓▓▓▓▓▓▓ Best

Cost

Redwood:  ▓░░░░░░░░░ Lowest
Cedar:    ▓▓▓▓░░░░░░ Medium
Cypress:  ▓▓▓▓▓▓▓░░░ Highest

Choosing the Right Strategy

Decision Tree

Is speed the top priority?
├─ Yes → Use Redwood
└─ No
    └─ Are questions conversational/ambiguous?
        ├─ Sometimes → Use Cedar
        └─ Often
            └─ Is highest accuracy critical?
                ├─ Yes → Use Cypress
                └─ No → Use Cedar

By Use Case

Use Case

Recommended Strategy

Reason

FAQ Bot

Redwood

Clear questions, speed matters

Customer Support Chat

Cedar

Conversational, follow-ups common

Medical Q&A

Cypress

Accuracy is critical

Legal Research

Cypress

High-stakes, must be accurate

Product Documentation

Cedar

Balance of speed and accuracy

Internal Wiki

Cedar

Conversational queries

API Reference

Redwood

Technical, clear queries

Troubleshooting Guide

Cedar

Multi-step, contextual

Compliance Questions

Cypress

Cannot afford mistakes

Switching Strategies

You can change an agent's strategy at any time:

Open agent settings
Navigate to RAG Strategy
Select new strategy
Save changes
Test in Playground

Note: Changes take effect immediately. Test thoroughly before deploying to production.

A/B Testing Strategies

To compare strategies objectively:

Duplicate your agent
Assign different strategies to each copy
Use the same test questions
Compare responses, speed, and citations
Check analytics for quality metrics

Next Steps

PreviousMemory & Context NextRedwood - Standard RAG

Last updated 7 hours ago

hashtagStrategy Comparison

hashtagFeature Matrix

hashtagRedwood Strategy

hashtagOverview

hashtagHow It Works

hashtagPerformance

hashtagWhen to Use Redwood

hashtagExample Use Cases

hashtagCedar Strategy

hashtagOverview

hashtagHow It Works

hashtagPerformance

hashtagWhen to Use Cedar

hashtagExample Use Cases

hashtagCypress Strategy

hashtagOverview

hashtagHow It Works

hashtagUnique Features

hashtagPerformance

hashtagWhen to Use Cypress

hashtagExample Use Cases

hashtagPerformance Comparison

hashtagLatency

hashtagAccuracy

hashtagCost

hashtagChoosing the Right Strategy

hashtagDecision Tree

hashtagBy Use Case

hashtagSwitching Strategies

hashtagA/B Testing Strategies

hashtagNext Steps

Strategy Comparison

Feature Matrix

Redwood Strategy

Overview

How It Works

Performance

When to Use Redwood

Example Use Cases

Cedar Strategy

Overview

How It Works

Performance

When to Use Cedar

Example Use Cases

Cypress Strategy

Overview

How It Works

Unique Features

Performance

When to Use Cypress

Example Use Cases

Performance Comparison

Latency

Accuracy

Cost

Choosing the Right Strategy

Decision Tree

By Use Case

Switching Strategies

A/B Testing Strategies

Next Steps