Foundation VAE for CT Reconstruction, Augmentation, and Generation

Qi Chen*1, Shuhan Ding*2, Yu Gu3, Nan Liu2, Jiang Bian3, Alan Yuille1, Zongwei Zhou1, Jingjing Fu3
1Johns Hopkins University, 2Duke-NUS Medical School, 3Microsoft Research
* Equal contribution
Preprint (February 18, 2026)

Abstract

We show that a single Foundation VAE, pretrained on natural images and videos, can serve as a unified training-free interface for CT reconstruction, augmentation, and generation. With frozen encoder and decoder, reconstructions preserve anatomy while suppressing acquisition noise, and training segmentation models on reconstructed CT improves boundary quality. In the same latent space, a conditional latent diffusion model generates anatomically consistent healthy and abnormal CT with explicit anatomy and report conditioning.

Method Overview

The framework has three connected parts: (1) CT reconstruction using frozen Foundation VAE, x~ = D(E(x)); (2) CT augmentation by using reconstructed volumes as a boundary-stable training view; (3) CT generation by conditional latent diffusion in the same fixed latent space with organ masks, disease masks, and radiology reports.

Qualitative Results

MSD Lung reconstruction and segmentation comparison
MSD Task06 Lung: reconstruction and segmentation comparison.
MSD Pancreas reconstruction and segmentation comparison
MSD Task07 Pancreas: reconstruction and segmentation comparison.

Controllable CT Generation

Three-view CT generation with anatomy and report conditioning
Three-view (axial/coronal/sagittal) generated CT with report conditioning.

Anatomical and Pathological Grounding

Anatomical and pathological grounding comparison
Spatial grounding under organ and disease mask constraints.

BibTeX

@article{chen2026foundationvae,
  title   = {Foundation VAE for CT Reconstruction, Augmentation, and Generation},
  author  = {Chen, Qi and Ding, Shuhan and Gu, Yu and Liu, Nan and Bian, Jiang and Yuille, Alan and Zhou, Zongwei and Fu, Jingjing},
  journal = {ICML 2026 (Preprint)},
  year    = {2026}
}
Local project page demo for ICML 2026 Foundation VAE.