Skip to content

BoLT

A benchmark suite for Bayesian optimization of expensive LLM tasks. Each problem is backed by a pretrained neural-network surrogate or tabular data from real LLM experiments, so evaluations are fast and reproducible without running real LLM training.

Installation

pip install bolt-bench

Quick Start

import torch
from bolt import HPO

# 7-dim HPO problem: returns a scalar surrogate of eval score
prob = HPO(noise_std=0.001, negate=False)

X = torch.Tensor([[0, 2, 2, 2, 0.5, 30, 2]])  # one candidate configuration
y = prob(X)  # shape: (1,)

Problems

Family Classes Notes
Hyperparameter optimization HPO, HPOMultiFidelityToken, HPOMultiFidelityModel mixed continuous/discrete/categorical; multi-fidelity variants available
Data mixture DMCurriculum, DMCurriculumMO, DMCurriculumHet simplex-constrained inputs; multi-objective and heteroscedastic variants
Prompt optimization PO128, PO256, PO512, PO768 high-dimensional continuous embedding search (128–768 dims)

See Problems for full details on inputs, constraints, and fidelity parameters.

Next Steps

  • Problems — detailed problem descriptions and parameter tables
  • API Reference — full class and method documentation