Why tabular foundation models should be a research priority

Boris van Breugel, Mihaela van der Schaar · 2022 · arXiv 2405.01147

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

other 1

citation-polarity summary

unclear 1

representative citing papers

Data Language Models: A New Foundation Model Class for Tabular Data

cs.AI · 2026-05-07 · unverdicted · novelty 7.0

Schema-1 is the first Data Language Model that natively understands raw tabular data and outperforms gradient-boosted ensembles, AutoML, and prior tabular foundation models on row-level prediction and imputation tasks.

TFM-Retouche: A Lightweight Input-Space Adapter for Tabular Foundation Models

cs.LG · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

TFM-Retouche is an architecture-agnostic input-space residual adapter that improves tabular foundation model accuracy on 51 datasets by learning input corrections through the frozen backbone, with an identity guard to fall back to the original model.

Tables Guide Vision: Learning to See the Heart through Tabular Data

cs.CV · 2025-03-19 · unverdicted · novelty 7.0

Tabular clinical data guides contrastive learning on cardiac MR images to build better visual representations by identifying patient similarities, outperforming image-only augmentation on downstream disease prediction tasks.

LLM-TabLogic: Preserving Inter-Column Logical Relationships in Synthetic Tabular Data via Prompt-Guided Latent Diffusion

cs.LG · 2025-03-04 · unverdicted · novelty 7.0

LLM-TabLogic extracts inter-column logical constraints using LLMs and conditions a score-based latent diffusion model on them to generate synthetic tabular data that preserves those relationships.

SQuARE: Structured Query & Adaptive Retrieval Engine For Tabular Formats

cs.CL · 2025-12-03 · unverdicted · novelty 6.0

SQuARE is a hybrid retrieval system that uses a complexity score to route tabular queries between chunk-based and SQL-based paths, outperforming single-strategy baselines and GPT-4o on precision and accuracy for complex spreadsheets.

Self-Improving Tabular Language Models via Iterative Reward-Guided Post-Training

cs.LG · 2026-04-21 · unverdicted · novelty 5.0

TabGRAA applies group-relative advantage alignment in an iterative reward-guided post-training loop to improve tabular language model generators on fidelity, utility, and privacy trade-offs across five benchmarks.

TREASURE: The Visa Payment Foundation Model for High-Volume Transaction Understanding

cs.LG · 2025-11-24 · unverdicted · novelty 5.0

TREASURE is a transformer model for payment transactions that boosts abnormal behavior detection performance by 111% over production systems and improves recommendation models by 104% when used as an embedding provider.

Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms

cs.LG · 2026-04-06 · unverdicted · novelty 4.0

TabPFN maintains high ROC-AUC and structured attention under controlled additions of irrelevant features, nonlinear correlations, and mislabeled targets in binary classification.

Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation

cs.LG · 2025-01-03 · unverdicted · novelty 3.0

CTGAN and LLMs generate synthetic student data that passes statistical and predictive utility checks for learning analytics.

citing papers explorer

Showing 9 of 9 citing papers.

Data Language Models: A New Foundation Model Class for Tabular Data cs.AI · 2026-05-07 · unverdicted · none · ref 18
Schema-1 is the first Data Language Model that natively understands raw tabular data and outperforms gradient-boosted ensembles, AutoML, and prior tabular foundation models on row-level prediction and imputation tasks.
TFM-Retouche: A Lightweight Input-Space Adapter for Tabular Foundation Models cs.LG · 2026-05-07 · unverdicted · none · ref 4 · 2 links
TFM-Retouche is an architecture-agnostic input-space residual adapter that improves tabular foundation model accuracy on 51 datasets by learning input corrections through the frozen backbone, with an identity guard to fall back to the original model.
Tables Guide Vision: Learning to See the Heart through Tabular Data cs.CV · 2025-03-19 · unverdicted · none · ref 36
Tabular clinical data guides contrastive learning on cardiac MR images to build better visual representations by identifying patient similarities, outperforming image-only augmentation on downstream disease prediction tasks.
LLM-TabLogic: Preserving Inter-Column Logical Relationships in Synthetic Tabular Data via Prompt-Guided Latent Diffusion cs.LG · 2025-03-04 · unverdicted · none · ref 15
LLM-TabLogic extracts inter-column logical constraints using LLMs and conditions a score-based latent diffusion model on them to generate synthetic tabular data that preserves those relationships.
SQuARE: Structured Query & Adaptive Retrieval Engine For Tabular Formats cs.CL · 2025-12-03 · unverdicted · none · ref 17
SQuARE is a hybrid retrieval system that uses a complexity score to route tabular queries between chunk-based and SQL-based paths, outperforming single-strategy baselines and GPT-4o on precision and accuracy for complex spreadsheets.
Self-Improving Tabular Language Models via Iterative Reward-Guided Post-Training cs.LG · 2026-04-21 · unverdicted · none · ref 73
TabGRAA applies group-relative advantage alignment in an iterative reward-guided post-training loop to improve tabular language model generators on fidelity, utility, and privacy trade-offs across five benchmarks.
TREASURE: The Visa Payment Foundation Model for High-Volume Transaction Understanding cs.LG · 2025-11-24 · unverdicted · none · ref 25
TREASURE is a transformer model for payment transactions that boosts abnormal behavior detection performance by 111% over production systems and improves recommendation models by 104% when used as an embedding provider.
Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms cs.LG · 2026-04-06 · unverdicted · none · ref 23
TabPFN maintains high ROC-AUC and structured attention under controlled additions of irrelevant features, nonlinear correlations, and mislabeled targets in binary classification.
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation cs.LG · 2025-01-03 · unverdicted · none · ref 9
CTGAN and LLMs generate synthetic student data that passes statistical and predictive utility checks for learning analytics.

Why tabular foundation models should be a research priority

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer