BoLT

A benchmark suite for Bayesian optimization of expensive LLM tasks. Each problem is backed by a pretrained neural-network surrogate or tabular data from real LLM experiments, so evaluations are fast and reproducible without running real LLM training.

Installation

pip install bolt-bench

Quick Start

import torch
from bolt import HPO

# 7-dim HPO problem: returns a scalar surrogate of eval score
prob = HPO(noise_std=0.001, negate=False)

X = torch.Tensor([[0, 2, 2, 2, 0.5, 30, 2]])  # one candidate configuration
y = prob(X)  # shape: (1,)

Problems

Family	Classes	Notes
Hyperparameter optimization	`HPO`, `HPOMultiFidelityToken`, `HPOMultiFidelityModel`	mixed continuous/discrete/categorical; multi-fidelity variants available
Data mixture	`DMCurriculum`, `DMCurriculumMO`, `DMCurriculumHet`	simplex-constrained inputs; multi-objective and heteroscedastic variants
Prompt optimization	`PO128`, `PO256`, `PO512`, `PO768`	high-dimensional continuous embedding search (128–768 dims)

See Problems for full details on inputs, constraints, and fidelity parameters.

Next Steps

Problems — detailed problem descriptions and parameter tables
API Reference — full class and method documentation