hub Canonical reference

TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second

· 2022 · cs.LG · arXiv 2207.01848

Canonical reference. 88% of citing Pith papers cite this work as background.

56 Pith papers citing it

Background 88% of classified citations

open full Pith review browse 56 citing papers arXiv PDF

abstract

We present TabPFN, a trained Transformer that can do supervised classification for small tabular datasets in less than a second, needs no hyperparameter tuning and is competitive with state-of-the-art classification methods. TabPFN performs in-context learning (ICL), it learns to make predictions using sequences of labeled examples (x, f(x)) given in the input, without requiring further parameter updates. TabPFN is fully entailed in the weights of our network, which accepts training and test samples as a set-valued input and yields predictions for the entire test set in a single forward pass. TabPFN is a Prior-Data Fitted Network (PFN) and is trained offline once, to approximate Bayesian inference on synthetic datasets drawn from our prior. This prior incorporates ideas from causal reasoning: It entails a large space of structural causal models with a preference for simple structures. On the 18 datasets in the OpenML-CC18 suite that contain up to 1 000 training data points, up to 100 purely numerical features without missing values, and up to 10 classes, we show that our method clearly outperforms boosted trees and performs on par with complex state-of-the-art AutoML systems with up to 230$\times$ speedup. This increases to a 5 700$\times$ speedup when using a GPU. We also validate these results on an additional 67 small numerical datasets from OpenML. We provide all our code, the trained TabPFN, an interactive browser demo and a Colab notebook at https://github.com/automl/TabPFN.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 method 1

citation-polarity summary

background 7 use method 1

representative citing papers

Privacy Auditing with Zero (0) Training Run

cs.CR · 2026-05-14 · unverdicted · novelty 8.0

Zero-Run auditing supplies valid lower bounds on differential privacy parameters from fixed member and non-member datasets by modeling and correcting distribution-shift confounding via causal-inference techniques.

What learning algorithm is in-context learning? Investigations with linear models

cs.LG · 2022-11-28 · accept · novelty 8.0

Transformers performing in-context learning implicitly implement gradient descent, ridge regression, and least-squares predictors for linear models, with behavior shifting based on model depth, width, and data noise.

$\alpha$-PFN: Fast Entropy Search via In-Context Learning

cs.LG · 2026-06-05 · unverdicted · novelty 7.0

α-PFN trains two PFNs in sequence to predict expected information gain for entropy search, delivering over 50x speedups while remaining competitive on synthetic and real-world benchmarks.

Toward Calibrated, Fair, and accurate Deepfake Detection

cs.LG · 2026-06-03 · unverdicted · novelty 7.0

Face-Feature Tuning is a label-free logit remapping method that reduces FPR/TPR gaps across groups in deepfake detection while preserving overall accuracy.

TabPrep: Closing the Feature Engineering Gap in Tabular Benchmarks

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

TabPrep is a new feature engineering pipeline that targets three data patterns and improves performance of tree-based, neural, linear, and foundation models on tabular benchmarks, often more than model architecture changes.

TabQL: In-Context Q-Learning with Tabular Foundation Models

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

TabQL is a reinforcement learning framework that substitutes a tabular foundation model with in-context capabilities for the parametric Q-network in DQN, with a warm-up phase and theoretical analysis claiming improved sample efficiency.

Rethinking Side-Channel Analysis: Automated Discovery and Analysis of Side-Channel Leakage with LLM-Assisted Agents

cs.CR · 2026-05-17 · unverdicted · novelty 7.0

SCAgent automates side-channel leakage discovery via LLM agents for target identification and few-shot foundation models for scalable analysis on iOS.

SurvivalPFN: Amortizing Survival Prediction via In-Context Bayesian Inference

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

SurvivalPFN amortizes Bayesian survival analysis for right-censored data by pretraining a prior-data fitted network on synthetic identifiable DGPs and then performing in-context inference, achieving competitive results on 61 real datasets.

FORGE: Fragment-Oriented Ranking and Generation for Context-Aware Molecular Optimization

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

FORGE reformulates molecular optimization as context-aware fragment ranking and replacement using mined low-to-high edit pairs, outperforming larger language models and graph methods on standard benchmarks.

Quantifying the Risk-Return Tradeoff in Forecasting

econ.EM · 2026-05-10 · unverdicted · novelty 7.0

Forecast loss differentials are reframed as returns and assessed with risk-adjusted finance metrics, showing professional forecasters are harder to beat on risk-adjusted performance than on raw accuracy in US macro forecasting.

Data Language Models: A New Foundation Model Class for Tabular Data

cs.AI · 2026-05-07 · unverdicted · novelty 7.0

Schema-1 is the first Data Language Model that natively understands raw tabular data and outperforms gradient-boosted ensembles, AutoML, and prior tabular foundation models on row-level prediction and imputation tasks.

TFM-Retouche: A Lightweight Input-Space Adapter for Tabular Foundation Models

cs.LG · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

TFM-Retouche is an architecture-agnostic input-space residual adapter that improves tabular foundation model accuracy on 51 datasets by learning input corrections through the frozen backbone, with an identity guard to fall back to the original model.

PHBench: A Benchmark for Predicting Startup Series A Funding from Product Hunt Launch Signals

q-fin.PR · 2026-05-03 · unverdicted · novelty 7.0

PHBench shows Product Hunt launch signals predict Series A funding with an ensemble model reaching AP 0.037 and F0.5 0.097 on blind test data, outperforming logistic regression and zero-shot LLMs.

Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

Time series foundation models match the performance of specialized models for day-ahead load forecasting while providing explanations that match domain knowledge on weather and calendar effects.

Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models

cs.LG · 2026-04-14 · unverdicted · novelty 7.0

TabDistill distills feature interactions from tabular foundation models via post-hoc attribution and inserts them into GAMs, yielding consistent predictive gains.

Environmental, Social and Governance Sentiment Analysis on Slovene News: A Novel Dataset and Models

cs.CL · 2026-04-08 · unverdicted · novelty 7.0

The authors release the first Slovene ESG sentiment dataset from news and report that large language models lead on environmental and social classification while fine-tuned SloBERTa performs best on governance.

Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data

cs.LG · 2025-09-25 · unverdicted · novelty 7.0

Reasoning LLMs with minimal tools for tree construction and analysis induce decision trees that outperform CART, compete with ensembles on low-resource tabular data, and provide human-readable reasoning traces.

Probabilistic Low-Voltage Peak Load Forecasting with Time Series Foundation Models Evaluated on Application-Oriented Metrics

cs.LG · 2026-07-02 · unverdicted · novelty 6.0

Compares foundation models for probabilistic low-voltage load forecasting on 200 real feeders and introduces a grid-planning metric that scores peak prediction by its effect on asset cost-risk decisions.

Relational and Sequential Conformal Inference for Energy Time Series over Graphs via Foundation Models

cs.LG · 2026-06-30 · unverdicted · novelty 6.0

STOIC integrates STGNN point forecasting with tabular foundation model in-context learning for conformal prediction to quantify uncertainty in graph-structured energy time series.

Trio: Learning Time-Series Forecasting with Temporal-Spatial-Sample Attention and Structural Causal Priors

cs.LG · 2026-06-05 · unverdicted · novelty 6.0

Trio proposes Temporal-Spatial-Sample attention and a TS-SCM synthetic data generator to improve multivariate time-series forecasting by reusing historical patterns and structural priors.

Disentangled Fine-Grained Prototype Learning for Incomplete Image-Tabular Classification

cs.CV · 2026-06-03 · unverdicted · novelty 6.0

DFPL introduces prototype-based disentanglement and alignment modules to preserve fine-grained consistency across heterogeneous modalities for better performance under missing data conditions.

LimiX-2M: Mitigating Low-Rank Collapse and Attention Bottlenecks in Tabular Foundation Models

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

LimiX-2M outperforms larger TabPFN-v2 and TabICL models on tabular benchmarks by expanding scalars into RBF features and using a reordered S->N->F attention block.

LLMTabBench: Evaluating LLMs on Binary Tabular Classification From Zero to Few Shots

cs.LG · 2026-05-23 · unverdicted · novelty 6.0

LLMTabBench evaluates LLMs on zero- and few-shot binary tabular classification and reports that zero-shot can outperform few-shot due to example conflicts with model priors while performance drops beyond a complexity threshold.

FLUXtrapolation: A benchmark on extrapolating ecosystem fluxes

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

FLUXtrapolation is a benchmark for domain generalization in ecosystem flux upscaling using temporal, spatial, and temperature-based extrapolation scenarios, with pilot results showing model separation on tail and multi-scale metrics.

citing papers explorer

Showing 1 of 1 citing paper after filters.

From Uniform to Learned Knots: A Study of Spline-Based Numerical Encodings for Tabular Deep Learning cs.LG · 2026-04-07 · unverdicted · none · ref 12 · internal anchor
Spline encodings for numerical features show task-dependent performance in tabular deep learning, with piecewise-linear encoding robust for classification and variable results for regression depending on spline family, knot strategy, and backbone.

TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer