Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

Last updated: March 10, 2026 11:26 pm

1 Min Read

arXiv:2603.06609v1 Announce Type: new
Abstract: Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or p-values for assessing whether individual features contain information about a target variable. This article presents a practical approach to feature-level hypothesis testing that combines the Conditional Randomization Test (CRT) with TabPFN, a probabilistic foundation model for tabular data. The resulting procedure yields finite-sample valid p-values for conditional feature relevance, even in nonlinear and correlated settings, without requiring model retraining or parametric assumptions.

Source link

Share This Article

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

Leave a Reply Cancel reply

Recent Posts

Recent Comments

Leave a Reply Cancel reply

Recent Posts

Recent Comments

You Might Also Like

[2309.12032] Expert-Aided Causal Discovery of Ancestral Graphs

Train AI models with Unsloth and Hugging Face Jobs for FREE

OpenClaw AI chatbots are running amok — these scientists are listening in

Reliable AI Coding for Unreal Engine: Improving Accuracy and Reducing Token Costs