Mingyu Ding

Orcid: 0000-0001-6556-8359

According to our database1, Mingyu Ding authored at least 85 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RoadBEV: Road Surface Reconstruction in Bird's Eye View.
IEEE Trans. Intell. Transp. Syst., November, 2024

DrPlanner: Diagnosis and Repair of Motion Planners for Automated Vehicles Using Large Language Models.
IEEE Robotics Autom. Lett., October, 2024

Context Autoencoder for Self-supervised Representation Learning.
Int. J. Comput. Vis., January, 2024

DCP: Learning Accelerator Dataflow for Neural Network via Propagation.
CoRR, 2024

Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework.
CoRR, 2024

TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection.
CoRR, 2024

P2 Explore: Efficient Exploration in Unknown Clustered Environment with Floor Plan Prediction.
CoRR, 2024

Embodiment-Agnostic Action Planning via Object-Part Scene Flow.
CoRR, 2024

DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation.
CoRR, 2024

Compositional Physical Reasoning of Objects and Events from Videos.
CoRR, 2024

WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning.
CoRR, 2024

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning.
CoRR, 2024

Q-SLAM: Quadric Representations for Monocular SLAM.
CoRR, 2024

DrPlanner: Diagnosis and Repair of Motion Planners Using Large Language Models.
CoRR, 2024

PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large Multimodal Models.
CoRR, 2024

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation.
CoRR, 2024

Depth-aware Volume Attention for Texture-less Stereo Matching.
CoRR, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

VDT: General-purpose Video Diffusion Transformers via Mask Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Tree-Planner: Efficient Close-loop Task Planning with Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval.
Mach. Intell. Res., August, 2023

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning.
Trans. Mach. Learn. Res., 2023

Quadric Representations for LiDAR Odometry, Mapping and Localization.
IEEE Robotics Autom. Lett., 2023

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.
IEEE Robotics Autom. Lett., 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

Interfacing Foundation Models' Embeddings.
CoRR, 2023

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models.
CoRR, 2023

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving.
CoRR, 2023

Human-oriented Representation Learning for Robotic Manipulation.
CoRR, 2023

Generalizable Long-Horizon Manipulations with Large Language Models.
CoRR, 2023

RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving.
CoRR, 2023

Pre-training on Synthetic Driving Data for Trajectory Prediction.
CoRR, 2023

An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training.
CoRR, 2023

VDT: An Empirical Study on Video Diffusion with Transformers.
CoRR, 2023

EC^2: Emergent Communication for Embodied Control.
CoRR, 2023

Doubly-Robust Self-Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Free Data Selection with General-Purpose Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners.
Proceedings of the International Conference on Machine Learning, 2023

Planning with Large Language Models for Code Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

EC<sup>2</sup>: Emergent Communication for Embodied Control.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners.
CoRR, 2022

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.
CoRR, 2022

Multimodal foundation models are better simulators of the human brain.
CoRR, 2022

LGDN: Language-Guided Denoising Network for Video-Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer.
Proceedings of the International Conference on Machine Learning, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos.
Proceedings of the Tenth International Conference on Learning Representations, 2022

DaViT: Dual Attention Vision Transformers.
Proceedings of the Computer Vision, 2022

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following.
Proceedings of the Conference on Robot Learning, 2022

2021
Affimer-Based Europium Chelates Allow Sensitive Optical Biosensing in a Range of Human Disease Biomarkers.
Sensors, 2021

Domain-Adaptive Few-Shot Learning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Compressed Video Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Supervised Video Representation Learning with Constrained Spatiotemporal Jigsaw.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Global Occlusion-Aware Approach to Self-Supervised Monocular Visual Odometry.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Domain-Adaptive Few-Shot Learning.
CoRR, 2020

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Segmenting Transparent Objects in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Lightweight Action Recognition in Compressed Videos.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Depth-Guided Convolutions for Monocular 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cross-domain mapping learning for transductive zero-shot learning.
Comput. Vis. Image Underst., 2019

CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Face-Focused Cross-Stream Network for Deception Detection in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Domain-Invariant Projection Learning for Zero-Shot Recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

InsightGAN: Semi-Supervised Feature Learning with Generative Adversarial Network for Drug Abuse Detection.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Zero-Shot Learning with Superclasses.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

DeepInsight: Multi-Task Multi-Scale Deep Learning for Mental Disorder Diagnosis.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
One-Step Facile Synthesis of Aptamer-Modified Graphene Oxide for Highly Specific Enrichment of Human A-Thrombin in Plasma.
Sensors, 2017

2015
Research on the Interaction of Hydrogen-Bond Acidic Polymer Sensitive Sensor Materials with Chemical Warfare Agents Simulants by Inverse Gas Chromatography.
Sensors, 2015

2014
Portable Solid Phase Micro-Extraction Coupled with Ion Mobility Spectrometry System for On-Site Analysis of Chemical Warfare Agents and Simulants in Water Samples.
Sensors, 2014


  Loading...