Hyperparameter Optimization

The Hyperparameter Optimization (HPO) Extension provides automatic hyperparameter tuning for TabPFN models using Bayesian optimization (via Hyperopt). It finds the optimal configuration for both TabPFN model parameters and inference settings, improving predictive performance across classification and regression tasks. TabPFN models typically don’t require tuning for strong performance. However, for specific datasets or evaluation goals, fine-tuning hyperparameters can further improve accuracy, calibration, and robustness.

Getting Started

Install the hpo extension:

pip install "tabpfn-extensions[hpo]"

Then, automatically tune a TabPFN classifier using Bayesian optimization with just a few lines of code.

from tabpfn_extensions.hpo import TunedTabPFNClassifier
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split

# Load example dataset
X, y = load_breast_cancer(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Create a tuned classifier with 50 optimization trials
tuned_clf = TunedTabPFNClassifier(
    n_trials=50,                       # Number of configurations to explore
    metric="accuracy",                 # Metric to optimize
    random_state=42                    # Ensures reproducibility
)

# Fit automatically searches for the best hyperparameters
tuned_clf.fit(X_train, y_train)

# Use like any sklearn model
y_pred = tuned_clf.predict(X_test)

Supported Metrics

Metric	Description
`accuracy`	Classification accuracy (proportion of correct predictions)
`roc_auc`	Area under the ROC curve (binary or multiclass)
`f1`	F1 score (harmonic mean of precision and recall)
`rmse`	Root mean squared error (regression)
`mse`	Mean squared error (regression)
`mae`	Mean absolute error (regression)

Supported Estimators

Model	Description
`TunedTabPFNClassifier`	TabPFN classifier with automatic hyperparameter tuning for classification tasks.
`TunedTabPFNRegressor`	TabPFN regressor with automatic tuning for continuous prediction tasks.

How it Works

Under the hood, the HPO system:

Splits your data into train and validation sets with optional stratification.
Samples a candidate configuration from the TabPFN hyperparameter space.
Trains a TabPFN model with those parameters.
Evaluates it using the chosen metric.
Updates its belief model via TPE (Tree-structured Parzen Estimator).
Repeats this process for n_trials, selecting the configuration with the best score.

Each run is fully reproducible, with built-in logging and random seed control.

Key Features

Optimized search spaces for classification and regression tasks
Support for multiple evaluation metrics - accuracy, ROC-AUC, F1, RMSE, MAE
Built-in validation and stratification for reliable performance estimation
Configurable search algorithms - TPE (Bayesian, the default) or Random Search

Getting Started

Capabilities

Extensions

Use Cases

Hyperparameter Optimization

Getting Started

Supported Metrics

Supported Estimators

How it Works

Key Features

Getting Started

Capabilities

Extensions

Use Cases

​Getting Started

​Supported Metrics

​Supported Estimators

​How it Works

​Key Features

Getting Started

Supported Metrics

Supported Estimators

How it Works

Key Features