MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization.
CoRR, March, 2025
VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting.
CoRR, March, 2025
GaussianGrasper: 3D Language Gaussian Splatting for Open-Vocabulary Robotic Grasping.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Robotics Autom. Lett., September, 2024
DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model.
CoRR, 2024
HE-Drive: Human-Like End-to-End Driving with Vision Language Models.
CoRR, 2024
Text2Street: Controllable Text-to-image Generation for Street Views.
CoRR, 2024
Text2Street: Controllable Text-to-Image Generation for Street Views.
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Structured-NeRF: Hierarchical Scene Graph with Neural Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024
ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation.
CoRR, 2023
A Multi-Granularity Information-Based Method for Learning High-Dimensional Bayesian Network Structures.
Cogn. Comput., 2022