Multi-objective hyperparameter optimization
In LLM development, we often need to balance multiple objectives, such as model performance, inference speed, and model size. Let’s implement multi-objective optimization using Optuna:
- Add the
import
statement and set up the hyperparameters:import optuna def objective(trial): hp = { "num_layers": trial.suggest_int("num_layers", 6, 24), "hidden_size": trial.suggest_categorical( "hidden_size", [512, 768, 1024]), "num_heads": trial.suggest_categorical( "num_heads", [8, 12, 16]), "ff_dim": trial.suggest_categorical...