Documentation Index
Fetch the complete documentation index at: https://docs.priorlabs.ai/llms.txt
Use this file to discover all available pages before exploring further.
Thinking mode is only available through the TabPFN API (via
tabpfn-client or the REST endpoints). It is not part of the open-source tabpfn package.When to use it
| Situation | Guidance |
|---|---|
| High-ROI use cases where small accuracy gains matter (finance, healthcare) | Use thinking mode — the one-time fit cost fuels recurring predictions |
| Pipelines with highly unstable ground truth data | Leave thinking off; use default TabPFN fit |
Quickstart
Thinking mode is available only in the API with TabPFN-3-Plus. Local execution is not supported.
- Classification
- Regression
Choosing effort and metric
thinking_effort controls how much compute is spent during fitting. thinking_metric sets the target.
Effort levels:
| Use case | Recommended effort |
|---|---|
| You need maximum accuracy and can trade off fit time | Enable Thinking mode and set thinking_effort="high" |
| You want a balance between quality and speed | Start with default Thinking mode (thinking_effort="medium") |
| Task | Metrics |
|---|---|
| Classification | accuracy, log_loss, roc_auc |
| Regression | rmse, mae |
thinking_timeout_s to cap the wall-clock time spent on optimization.
REST API
CallPOST /tabpfn/fit with the thinking parameters in the JSON body:
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
thinking_mode | bool | False | Enable thinking mode. Setting thinking_effort also enables it. |
thinking_effort | "medium" or "high" | None | Controls the effort & compute spent at fit time. Higher tends to give better results. |
thinking_timeout_s | float | None | Wall-clock time budget in seconds |
thinking_metric | str | None | Target metric to optimize (see supported metrics above) |
On the REST API, the metric parameter is called
thinking_effort_metric.Limits
Thinking fits have a separate monthly quota from prediction tokens. The default is 20 thinking fits per month. When the quota is exhausted,POST /tabpfn/fit with thinking enabled returns HTTP 429.
If you need higher limits, see API metering for details or contact Prior Labs.
TabPFN-3 changelog
Full release notes including thinking mode.
API metering
Token budgets, thinking fit limits, and usage tracking.
Classification
Binary and multi-class classification guide.
Regression
Point estimates, quantiles, and full distributions.