Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation.
CoRR, March, 2025
Exploring Representation-Aligned Latent Space for Better Generation.
CoRR, February, 2025
Logit Normalization for Long-Tail Object Detection.
Int. J. Comput. Vis., June, 2024
Analyzing Nobel Prize Literature with Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding.
CoRR, 2024
CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation.
CoRR, 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis.
CoRR, 2024
A study on the dynamic characteristics of the stiffened coupled plate with the effect of the dynamic vibration absorbers.
Comput. Math. Appl., 2024
Drag-A-Video: Non-rigid Video Editing with Point-based Interaction.
CoRR, 2023
Deep Equilibrium Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
StageInteractor: Query-based Object Detector with Cross-stage Interaction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Structured Sparse R-CNN for Direct Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Target Adaptive Context Aggregation for Video Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021