What GPUs does TabPFN require at minimum?
What GPUs does TabPFN require at minimum?
device parameter of TabPFNClassifier and TabPFNRegressor.What are TabPFN's practical limits for context size and features?
What are TabPFN's practical limits for context size and features?
How are text features handled?
How are text features handled?
How are date features handled?
How are date features handled?
How reproducible are TabPFN results across runs and devices?
How reproducible are TabPFN results across runs and devices?
Does TabPFN handle missing values natively?
Does TabPFN handle missing values natively?
pd.NA, without requiring manual imputation.How should I handle imbalanced datasets?
How should I handle imbalanced datasets?
- Use
balance_probabilities=Trueto optimize for balanced accuracy, balanced loss or other evaluation metrics weighting each class equally regardless of its frequency. - Use
eval_metric="f1"(or other supported metrics) for specific precision-recall tradeoffs.
My run is 'slow' on thousands of rows - what controls the fit/predict trade-off?
My run is 'slow' on thousands of rows - what controls the fit/predict trade-off?
predict() step. That means latency scales roughly with the number of training rows. For fast inference we can use KV-caching (TabPFNClassifier(fit_mode="fit_with_cache")). We can also create a tree- or small MLP based model that yields almost the same accuracy as TabPFN. Contact sales@priorlabs.ai to access this solution.How can I speed up predictions on large test sets?
How can I speed up predictions on large test sets?
.predict() is called. It is much faster to make a prediction for all your test points in a single .predict() call rather than calling it repeatedly. If you run out of memory, split the test points into batches of 1,000 to 10,000 and call .predict() for each batch.You can also tune the memory_saving_mode and n_preprocessing_jobs parameters of TabPFNClassifier and TabPFNRegressor for additional speed improvements. See the code documentation for details.What is the fitted-model cache and when should I use it?
What is the fitted-model cache and when should I use it?
.fit(), making subsequent .predict() calls fast by using a KV-Cache. Enable it by setting the fit_mode parameter of TabPFNClassifier or TabPFNRegressor to fit_with_cache.For TabPFN-3, this setting consumes about 7GB of GPU memory per estimator (so 56GB for our default 8 estimators) for 1M rows (and roughly half for 500K rows). For previous model versions, this setting consumes a lot of memory and is not recommended
except for very small datasets.What PyTorch version should I use?
What PyTorch version should I use?
What are the API rate limits?
What are the API rate limits?
How do I know which TabPFN model version am I using in the TabPFN API Client?
How do I know which TabPFN model version am I using in the TabPFN API Client?