2025
MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization.
CoRR, March, 2025

VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting.
CoRR, March, 2025

2024
GaussianGrasper: 3D Language Gaussian Splatting for Open-Vocabulary Robotic Grasping.
IEEE Robotics Autom. Lett., September, 2024

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model.
CoRR, 2024

HE-Drive: Human-Like End-to-End Driving with Vision Language Models.
CoRR, 2024

Text2Street: Controllable Text-to-image Generation for Street Views.
CoRR, 2024

Text2Street: Controllable Text-to-Image Generation for Street Views.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Structured-NeRF: Hierarchical Scene Graph with Neural Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation.
CoRR, 2023

2022
A Multi-Granularity Information-Based Method for Learning High-Dimensional Bayesian Network Structures.
Cogn. Comput., 2022