BoLT
A benchmark suite for Bayesian optimization of expensive LLM tasks. Each problem is backed by a pretrained neural-network surrogate or tabular data from real LLM experiments, so evaluations are fast and reproducible without running real LLM training.
Installation
Quick Start
import torch
from bolt import HPO
# 7-dim HPO problem: returns a scalar surrogate of eval score
prob = HPO(noise_std=0.001, negate=False)
X = torch.Tensor([[0, 2, 2, 2, 0.5, 30, 2]]) # one candidate configuration
y = prob(X) # shape: (1,)
Problems
| Family | Classes | Notes |
|---|---|---|
| Hyperparameter optimization | HPO, HPOMultiFidelityToken, HPOMultiFidelityModel |
mixed continuous/discrete/categorical; multi-fidelity variants available |
| Data mixture | DMCurriculum, DMCurriculumMO, DMCurriculumHet |
simplex-constrained inputs; multi-objective and heteroscedastic variants |
| Prompt optimization | PO128, PO256, PO512, PO768 |
high-dimensional continuous embedding search (128–768 dims) |
See Problems for full details on inputs, constraints, and fidelity parameters.
Next Steps
- Problems — detailed problem descriptions and parameter tables
- API Reference — full class and method documentation