/Tech4h ago

GenWildSplat Enables Generalizable 3D Reconstruction from Unconstrained Images

3443163.1K

#321

Original post

Jia-Bin Huang@jbhuang0604#321inTech

GenWildSplat

Feedforward 3D models are awesome, but they can't handle in-the-wild images with varying illumination.

Check out GenWildSplat that fills this gap! https://genwildsplat.github.io/

Jia-Bin Huang@jbhuang0604

CVPR 2026 was a blast! 🥳

It was great meeting old and new friends and presenting our work.

summarized below if you missed it! 🧵

10:35 AM · Jun 11, 2026 · 1.3K Views

/Tech4h ago

GenWildSplat Enables Generalizable 3D Reconstruction from Unconstrained Images

3443163.1K

#321

Original post

Jia-Bin Huang@jbhuang0604#321inTech

GenWildSplat

Feedforward 3D models are awesome, but they can't handle in-the-wild images with varying illumination.

Check out GenWildSplat that fills this gap! https://genwildsplat.github.io/

Jia-Bin Huang@jbhuang0604

CVPR 2026 was a blast! 🥳

It was great meeting old and new friends and presenting our work.

summarized below if you missed it! 🧵

10:35 AM · Jun 11, 2026 · 1.3K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS1KBOOKMARKS7LIKES12RETWEETS1REPLIES1

Jia-Bin Huang@jbhuang0604

Edit by Track

Video editing requires precise spatial control. Edit by track produces intuitive, compelling edits using 3D point tracks.

https://edit-by-track.github.io/

Jia-Bin Huang@jbhuang0604

GenWildSplat

Feedforward 3D models are awesome, but they can't handle in-the-wild images with varying illumination.

Check out GenWildSplat that fills this gap! https://genwildsplat.github.io/

4h1K127

Jia-Bin Huang@jbhuang0604

TraceGen

Video world models help robot learning.

But, why do we need to predict all these pixels?

Let's model the world using 3D traces! 👉 computationally efficient 👉 embodiment-agnostic 👉 task-relevant

https://tracegen.github.io/

Jia-Bin Huang@jbhuang0604

Edit by Track

Video editing requires precise spatial control. Edit by track produces intuitive, compelling edits using 3D point tracks.

https://edit-by-track.github.io/

4h27692

Jia-Bin Huang@jbhuang0604

Coupled Diffusion

So many awesome 2D diffusion models for a wide range of tasks. BUT, how can we get the best of both 2D and multi-view diffusion models?

Coupled diffusion shows a simple training-free approach to do so!

https://coupled-diffusion.github.io/

Jia-Bin Huang@jbhuang0604

UniVerse

Most existing customization models either require costly per-concept optimization or clean reference images.

UniVerse enables decomposing and composing multiple visual concepts from unsegmented images

https://universe-personalization.github.io/

4h45981

Jia-Bin Huang@jbhuang0604

SIMPACT

VLMs have common-sense and semantic reasoning capabilities.

But, they suck at predicting physical consequences.

We equip VLMs with physical reasoning through simulation-in-the-loop world modeling!

https://simpact-bot.github.io/

4h81

Jia-Bin Huang@jbhuang0604

UniVerse

Most existing customization models either require costly per-concept optimization or clean reference images.

UniVerse enables decomposing and composing multiple visual concepts from unsegmented images

https://universe-personalization.github.io/

4h71