Optimizer class finds Pareto-optimal RAG configurations using Multi-Objective Bayesian Optimization, balancing cost, latency, and quality.
Quick Start
How It Works
- Setup: Loads search space and initializes components
- Bootstrap: Generates initial training data (10 samples)
- Optimize: Runs Bayesian Optimization loop proposing and evaluating configurations
- Return: Best configurations balancing multiple objectives
Configuration
Basic Usage
Advanced Options
Performance Tips
- Start small: Test with
n_trials=5, then scale to 50-100 for production - Eager loading: For small search spaces, use
eager_load=Truein RAGPipelineManager - Parallel evaluation: Adjust
max_workersin RAGPipelineManager for faster optimization - Hugginface: Using Hugging Face models will slow down the optimization process.