AI Labs | CertLabz

Lab 16: RAG System Configuration

RAG Architecture / Expert

Scenario: Enterprise Knowledge Base RAG

LegalTech Corp needs a Retrieval-Augmented Generation system for their 50,000 legal documents. Configure all components including embeddings, vector database, chunking strategy, retrieval parameters, and LLM settings. The system must handle 1,000 queries/hour with 95%+ relevance.

Learning Objectives:

Embedding Models: Select appropriate embedding dimensions and models
Chunking Strategy: Configure optimal chunk sizes and overlap
Vector Database: Set up indexing and search parameters
Retrieval Tuning: Configure top-k, similarity thresholds, reranking

RAG System Configuration

Configure all parameters

System Requirements

• Document corpus: 50,000 legal docs

• Avg document length: 15,000 tokens

• Query throughput: 1,000 queries/hour

• Target relevance: ≥95%

• Max response latency: 3 seconds

• Monthly budget: $5,000

Section 1: Embedding Configuration

Embedding Model

Embedding Dimensions

Distance Metric

Normalization

Section 2: Chunking Strategy

Chunk Size (tokens)

Chunk Overlap (tokens)

Chunking Method

Section 3: Vector Database Configuration

Vector Database

Index Type

HNSW M Parameter

EF Construction

Section 4: Retrieval Parameters

Top-K Results

Similarity Threshold

Reranker Model

Hybrid Search Weight (BM25)

Query Expansion

Context Window

Section 5: LLM Generation Settings

LLM Model

Temperature

Max Output Tokens

Top-P (Nucleus Sampling)

Frequency Penalty

Presence Penalty

Section 6: Cost & Performance Calculations

Based on your configuration, calculate the following metrics:

Total Chunks in Index

50,000 docs × 15,000 tokens avg / chunk_size

Monthly Embedding Cost ($)

total_tokens / 1M × embed_price

Monthly Query Cost ($)

1000 queries/hr × 24 × 30 × token costs

Estimated Retrieval Latency (ms)

HNSW: ~50ms, IVF: ~100ms, Flat: ~500ms

Progress: 0/26 fields configured

Score: 0/100

0%

Lab Completed!

Excellent RAG configuration!

Lab 17: LLM Security Red Team

Security / Critical

Scenario: AI Security Assessment

BankSecure AI deployed a customer service chatbot that handles sensitive financial queries. Conduct a red team assessment to identify vulnerabilities, craft attack vectors, and design defensive measures to harden the system.

Learning Objectives:

Attack Taxonomy: Understand prompt injection, jailbreaks, data exfiltration
Vulnerability Testing: Craft and test attack payloads
Defense Strategies: Implement input sanitization, output filtering
Security Hardening: Design defense-in-depth measures

Red Team Workbench

Identify vulnerabilities

📋 Task: Security Red Team Assessment

Identify 3 attack vectors, craft test payloads for each, and design corresponding defense mechanisms. Each attack must include the vulnerability type, sample payload, and mitigation strategy.

Known Attack Categories

• Direct Prompt Injection

• Indirect Prompt Injection

• Jailbreak Attempts

• Data Exfiltration

• Model Extraction

• Denial of Service

• PII Extraction

• System Prompt Leakage

Attack Vectors (0/3 required)

No attack vectors defined. Add attacks to begin red team assessment.

Defense Configuration

Configure defenses for each identified vulnerability. Each defense must address the specific attack vector.

Add attack vectors first to configure defenses.

Progress: 0/5 tasks completed

Score: 0/100

0%

Lab Completed!

Excellent security assessment!

Lab 18: LLM Token & Cost Calculator

Cost Analysis / Expert

Scenario: Production Chatbot Cost Estimation

TechSupport Inc. is launching an AI chatbot and needs to estimate operational costs. Using the provided traffic data and model pricing, calculate token usage and select the most cost-effective model that stays within budget.

Learning Objectives:

Token Calculation: Compute input/output token volumes
Cost Estimation: Apply pricing per million tokens
Model Selection: Choose optimal model within budget
System Prompts: Account for per-conversation overhead

Token Cost Calculator

Calculate token costs

📋 Task: Calculate LLM Operational Costs

Using the scenario data and model pricing below, calculate total daily tokens, select the most cost-effective model under budget, and compute the daily cost. All answers have exact correct values.

Scenario Data

Your chatbot receives the following daily traffic:

• Daily conversations: 5,000

• Avg messages per conversation: 6

• Avg input tokens per message: 150

• Avg output tokens per message: 200

• System prompt tokens: 500

• Budget limit: $800/day

Model Pricing (per 1M tokens)

Model

Input

Output

Context

GPT-4 Turbo

$10.00

$30.00

128K

GPT-4o

$2.50

$10.00

128K

GPT-3.5 Turbo

$0.50

$1.50

16K

Claude 3 Sonnet

$3.00

$15.00

200K

Claude 3 Haiku

$0.25

$1.25

200K

Task 1: Calculate Total Daily Messages

How many total messages does the system process per day?

Formula: conversations × messages_per_conversation

Task 2: Calculate Daily Input Tokens

Total input tokens per day (including system prompt sent with each conversation)?

Formula: (total_messages × input_tokens_per_msg) + (conversations × system_prompt_tokens)

Task 3: Calculate Daily Output Tokens

Total output tokens generated per day?

Formula: total_messages × output_tokens_per_msg

Task 4: Select Most Cost-Effective Model

Which model stays under budget ($800/day) at the lowest cost?

Task 5: Calculate Daily Cost for Selected Model

What is the total daily cost using your selected model? (to nearest dollar)

Formula: (input_tokens/1M × input_price) + (output_tokens/1M × output_price)

Progress: 0/5 tasks completed

Score: 0/100

0%

Lab Completed!

Excellent cost analysis!

AI & Machine Learning Labs

GenAI Expert Labs - Module 6

Learning Objectives:

RAG System Configuration

Lab Completed!

Learning Objectives:

Red Team Workbench

Lab Completed!

Learning Objectives:

Token Cost Calculator

Lab Completed!

Lab 16: RAG Configuration Instructions

Objective

Configuration Sections

Pro Tips

Common Values

Lab 17: Security Red Team Instructions

Objective

Assessment Steps

Attack Examples

Requirements

Lab 18: Cost Calculator Instructions

Objective

Calculation Tasks

Cost Formula

Budget Hint