Lu Sheng

Orcid: 0000-0002-8525-9163

According to our database1, Lu Sheng authored at least 80 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

3D Reconstruction From a Single Sketch via View-Dependent Depth Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

WorldSimBench: Towards Video Generation Models as World Simulators.
CoRR, 2024

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion.
CoRR, 2024

From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation.
CoRR, 2024

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents.
CoRR, 2024

Assessment of Multimodal Large Language Models in Alignment with Human Values.
CoRR, 2024

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control.
CoRR, 2024

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.
CoRR, 2024

Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Data-Free Generalized Zero-Shot Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Toward Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Guest Editorial: Special Issue on Machine Learning and Signal Processing.
J. Signal Process. Syst., January, 2023

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models.
CoRR, 2023

Octavius: Mitigating Task Interference in MLLMs via MoE.
CoRR, 2023

Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting.
CoRR, 2023

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter.
CoRR, 2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distortion-aware Transformer in 360° Salient Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Siamese DETR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
VPU: A Video-Based Point Cloud Upsampling Framework.
IEEE Trans. Image Process., 2022

Towards Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline.
CoRR, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.
CoRR, 2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy.
CoRR, 2022

Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation.
Proceedings of the Computer Vision - ECCV 2022, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling.
Proceedings of the Computer Vision - ECCV 2022, 2022

3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Motion Compensated Virtual View Synthesis Using Novel Particle Cell.
IEEE Trans. Multim., 2021

PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization.
IEEE Trans. Image Process., 2021

Transformer3D-Det: Improving 3D Object Detection by Vote Refinement.
IEEE Trans. Circuits Syst. Video Technol., 2021

Sequential Point Cloud Upsampling by Exploiting Multi-Scale Temporal Dependency.
IEEE Trans. Circuits Syst. Video Technol., 2021

ForgeryNet - Face Forgery Analysis Challenge 2021: Methods and Results.
CoRR, 2021

DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.
CoRR, 2021

IncreACO: Incrementally Learned Automatic Check-out with Photorealistic Exemplar Augmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

StyleFormer: Real-time Arbitrary Style Transfer via Parametric Style Composition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
High-Quality Video Generation from Static Structural Annotations.
Int. J. Comput. Vis., 2020

PV-NAS: Practical Neural Architecture Search for Video Recognition.
CoRR, 2020

Adaptive Gradient Method with Resilience and Momentum.
CoRR, 2020

Unsupervised Domain Expansion from Multiple Sources.
CoRR, 2020

Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues.
Proceedings of the Computer Vision - ECCV 2020, 2020

Powering One-Shot Topological NAS with Stabilized Share-Parameter Proxy.
Proceedings of the Computer Vision - ECCV 2020, 2020

Morphing and Sampling Network for Dense Point Cloud Completion.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Bags of tricks for learning depth and camera motion from monocular videos.
Virtual Real. Intell. Hardw., 2019

Cascaded regression using landmark displacement for 3D face reconstruction.
Pattern Recognit. Lett., 2019

Visibility Constrained Generative Model for Depth-Based 3D Facial Pose Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot.
CoRR, 2019

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Context and Attribute Grounded Dense Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Semantics Disentangling for Text-To-Image Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Video Generation From Single Semantic Label Map.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Spatio-Temporal Disocclusion Filling Using Novel Sprite Cells.
IEEE Trans. Multim., 2018

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Exploring Disentangled Feature Representation Beyond Face Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Real-Time Head Pose Tracking with Online Face Template Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

2015
Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure.
IEEE Trans. Image Process., 2015

A disocclusion filling method using multiple sprites with depth for virtual view synthesis.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

2014
Temporal depth video enhancement based on intrinsic static structure.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Screen-camera calibration using a thread.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Accelerating the Distribution Estimation for the Weighted Median/Mode Filters.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
A Head Pose Tracking System Using RGB-D Camera.
Proceedings of the Computer Vision Systems - 9th International Conference, 2013

Depth enhancement based on hybrid geometric hole filling strategy.
Proceedings of the IEEE International Conference on Image Processing, 2013


  Loading...