RF-PFN

Uses scikit-learn tree splits, but fits TabPFN models at each node or leaf.
Optionally prunes or refits nodes using validation-based performance checks.
Works with the typical parameters for tree-based methods.

Getting Started

To install the extension, include the rf_pfn extra:

pip install "tabpfn-extensions[rf_pfn]"

Once installed, you can easily combine TabPFN with decision trees or random forests:

Random Forest

Train a Random Forest where each tree uses TabPFN models at its leaves for adaptive predictions via RandomForestTabPFNRegressor and RandomForestTabPFNClassifier.

from tabpfn import TabPFNRegressor
from tabpfn_extensions.rf_pfn import RandomForestTabPFNRegressor

reg = TabPFNRegressor()

# Simply wrap the base TabPFN estimator with the RF extension
rf_reg = RandomForestTabPFNRegressor(tabpfn=reg)

rf_reg.fit(X_train, y_train)
y_hat = rf_reg.predict(X_test)

Decision Trees

Train standalone Decision Trees that delegate predictions at each leaf (or node) to TabPFN with DecisionTreeTabPFNRegressor and DecisionTreeTabPFNClassifier.

from tabpfn import TabPFNRegressor
from tabpfn_extensions.rf_pfn import DecisionTreeTabPFNRegressor

reg = TabPFNRegressor()

# Simply wrap the base TabPFN estimator with the RF extension
dt_reg = DecisionTreeTabPFNRegressor(tabpfn=reg)

dt_reg.fit(X_train, y_train)
y_hat = dt_reg.predict(X_test)

Core Parameters

The following table lists the core parameters. For more details see the rf_pfn extension on GitHub.

Random Forest

Param	Meaning (per code)	Default
`tabpfn`	Required TabPFN model	-
`max_depth`	Tree depth (per base DT)	cls: `5`, reg: `5`
`bootstrap`	Bootstrap samples when fitting trees	`True`
`rf_average_logits`	Classifier: average logits across trees	cls: `True`, reg: `False`
`dt_average_logits`	Classifier: average logits within trees	`True`
`max_predict_time`	Stop averaging once time exceeded (seconds)	cls: `60`, reg: `-1`
`min_samples_split`	Split threshold per DT	cls: `1000`, reg: `300`
`min_samples_leaf`	Min samples per leaf	`5`
`max_features`	Features considered at split	`"sqrt"`
`fit_nodes`	Fit TabPFN at internal nodes	`True`

Decision Trees

Param	Meaning (per code)	Default
`tabpfn`	Required TabPFN model	-
`max_depth`	Max depth	`None` (unlimited)
`min_samples_split`	Split threshold per DT	`1000`
`min_samples_leaf`	Min samples per leaf	`1`
`max_features`	Features considered at split	`None` (all features)
`fit_nodes`	Fit TabPFN at internal nodes	`True`

Getting Started

Capabilities

Extensions

Integrations

Use Cases

Getting Started

Random Forest

Decision Trees

Core Parameters

Random Forest

Decision Trees

Getting Started

Capabilities

Extensions

Integrations

Use Cases

​Getting Started

​Random Forest

​Decision Trees

​Core Parameters

​Random Forest

​Decision Trees

Getting Started

Random Forest

Decision Trees

Core Parameters

Random Forest

Decision Trees