3D content creation with touch: TactileDreamFusion exploits high-resolution tactile sensing to enhance geometric details for text- or image-to-3D generation.
FlashTex textures an input 3D mesh given a user-provided text prompt. Notably, our generated texture can be relit properly in different lighting environments.
IEEE Robotics and Automation Letters (RA-L), 2024
github
We propose a new optimization formulation to infer the structural stability of block stacking assembly. We also provide StableLego: a comprehensive Lego assembly dataset of more than 50k Lego structures with their stability inferences.
We propose pix2pix3D, a 3D-aware conditional generative model for controllable
photorealistic image synthesis. Given a 2D label map, such as a segmentation or edge map, our model
learns to synthesize a corresponding image from different viewpoints.
Proposed DS-NeRF (Depth-supervised Neural Radiance Fields), a model for learning neural radiance
fields that takes advantage of depth supervised by 3D point clouds.
Defined and addressed a new question of unsupervised audiovisual synthesis -- input the audio of a
random individual and then output the talking-head video with audio in the style of another target
speaker.