2d ago

MulTaBench benchmark evaluates multimodal tabular learning strategies

322854.5K

——0——

MulTaBench pairs tabular records with text and images across clinical, real-estate, and product domains. The benchmark draws datasets from Kaggle and OpenML to test structured-only, unstructured, joint-frozen, and target-aware modeling approaches. A paper posted on the Hugging Face platform shows target-aware representations outperform frozen LLM and VLM embeddings and concludes that future progress depends on models that jointly process tabular and unstructured modalities aligned to prediction targets.

Original post

#1398@GAELVAROQUAUX @EILAMSHA

Eilam Shapira@EILAMSHA

Which model would you use to predict the presence of a disease, given the patient's demographics, clinical notes, and X-ray? 🩻 It’s 2026, and we still haven't solved Multimodal Tabular Learning!

3:31 AM · May 14, 2026

Cluster Engagement

Engagement snapshots are unavailable for this cluster.no post metric buckets

Reposted by

#1398@GAELVAROQUAUX

#30AK@_AKHALIQ

paper: https://huggingface.co/papers/2605.10616

AK@_akhaliq

MulTaBench Benchmarking Multimodal Tabular Learning with Text and Image

1:19 PM · May 14, 2026 · 5.1K Views

1:19 PM · May 14, 2026 · 3.7K Views