Hao Dong

Orcid: 0000-0002-7984-9909

Affiliations:

Peking University, PKU-Agibot Lab, Center on Frontiers of Computing Studies, School of Computer Science, Beijing, China
Imperial College London, Department of Computing, UK (former)

According to our database¹, Hao Dong authored at least 127 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

LVDiffusor: Distilling Functional Rearrangement Priors From Large Models Into Diffusor.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., October, 2024

RGBGrasp: Image-Based Object Grasping by Capturing Multiple Views During Robot arm Movement With Neural Radiance Fields.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., June, 2024

Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Regression-Guided Refocusing Learning With Feature Alignment for Remote Sensing Tiny Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

Adaptive Dynamic Label Assignment for Tiny Object Detection in Aerial Images.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation.

[BibT_eX]

[DOI]

CoRR, 2024

EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning.

[BibT_eX]

[DOI]

CoRR, 2024

TARGO: Benchmarking Target-driven Object Grasping under Occlusions.

[BibT_eX]

[DOI]

CoRR, 2024

Human-centered In-building Embodied Delivery Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

SpatialBot: Precise Spatial Understanding with Vision Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

GFPack++: Improving 2D Irregular Packing by Learning Gradient Field with Attention.

[BibT_eX]

[DOI]

CoRR, 2024

A3VLM: Actionable Articulation-Aware Vision Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment.

[BibT_eX]

[DOI]

CoRR, 2024

Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking.

[BibT_eX]

[DOI]

CoRR, 2024

Broadcasting Support Relations Recursively from Local Dynamics for Object Retrieval in Clutters.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Manipulation by Predicting Interaction.

[BibT_eX]

[DOI]

CoRR, 2024

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence.

[BibT_eX]

[DOI]

CoRR, 2024

PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments.

[BibT_eX]

[DOI]

CoRR, 2024

SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network.

[BibT_eX]

[DOI]

CoRR, 2024

UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy.

[BibT_eX]

[DOI]

CoRR, 2024

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

UniDoorManip: Learning Universal Door Manipulation Policy Over Large-scale and Diverse Door Manipulation Environments.

[BibT_eX]

[DOI]

CoRR, 2024

Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise.

[BibT_eX]

[DOI]

CoRR, 2024

Pattern4Ego: Learning Egocentric Video Representation Using Cross-video Activity Patterns.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Personalize Segment Anything Model with One Shot.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Learning Semantic-Agnostic and Spatial-Aware Representation for Generalizable Visual-Audio Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., June, 2023

P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification.

[BibT_eX]

[DOI]

Remote. Sens., April, 2023

Posterior Instance Injection Detector for Arbitrary-Oriented Object Detection From Optical Remote-Sensing Imagery.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

A Survey of Reasoning with Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Distilling Functional Rearrangement Priors from Large Models.

[BibT_eX]

[DOI]

CoRR, 2023

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Compositional Text-to-image Generation with Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.

[BibT_eX]

[DOI]

CoRR, 2023

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks.

[BibT_eX]

[DOI]

CoRR, 2023

GenPose: Generative Category-level Object Pose Estimation via Diffusion Models.

[BibT_eX]

[DOI]

Jiyao Zhang

Mingdong Wu

Hao Dong

CoRR, 2023

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

Personalize Segment Anything Model with One Shot.

[BibT_eX]

[DOI]

CoRR, 2023

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference.

[BibT_eX]

[DOI]

CoRR, 2023

Plan4MC: Skill Reinforcement Learning and Planning for Open-World Minecraft Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Gradient Fields for Scalable and Generalizable Irregular Packing.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Generative Category-level Object Pose Estimation via Diffusion Models.

[BibT_eX]

[DOI]

Jiyao Zhang

Mingdong Wu

Hao Dong

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLAfford: End-to-End Affordance Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation.

[BibT_eX]

[DOI]

Ruihai Wu

Chuanruo Ning

Hao Dong

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GFPose: Learning 3D Human Pose Prior with Gradient Fields.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Part Motion of Articulated Objects Using Spatially Continuous Neural Implicit Representations.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

Score-PA: Score-based 3D Part Assembly.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

Resilient Binary Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain.

[BibT_eX]

[DOI]

Remote. Sens., 2022

Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery.

[BibT_eX]

[DOI]

Remote. Sens., 2022

MARLlib: Extending RLlib for Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

End-to-End Affordance Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2022

Collaboration of Pre-trained Models Makes Better Few-shot Learner.

[BibT_eX]

[DOI]

CoRR, 2022

TarGF: Learning Target Gradient Field for Object Rearrangement.

[BibT_eX]

[DOI]

CoRR, 2022

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL.

[BibT_eX]

[DOI]

CoRR, 2022

Fully Decentralized Model-based Policy Optimization for Networked Systems.

[BibT_eX]

[DOI]

CoRR, 2022

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain.

[BibT_eX]

[DOI]

CoRR, 2022

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Object Manipulation.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Stephen Marcus McAleer

Hao Dong

Zongqing Lu

Song-Chun Zhu

CoRR, 2022

GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Edge-enhanced dual discriminator generative adversarial network for fast MRI with parallel imaging using multi-view information.

[BibT_eX]

[DOI]

Appl. Intell., 2022

TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Adaptive Local Context Embedding for Small Vehicle Detection from Aerial Optical Remote Sensing Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022

VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Bilateral Asymmetry Guided Counterfactual Generating Network for Mammogram Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment.

[BibT_eX]

[DOI]

CoRR, 2021

Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Product semantics translation from brain activity via adversarial learning.

[BibT_eX]

[DOI]

CoRR, 2021

Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos.

[BibT_eX]

[DOI]

CoRR, 2021

MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2022 Competition Track, 2021

Fast and Flexible Human Pose Estimation with HyperPose.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Efficient Reinforcement Learning Development with RLzoo.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Tensorlayer 3.0: A Deep Learning Library Compatible With Multiple Backends.

[BibT_eX]

[DOI]

Cheng Lai

Jiarong Han

Hao Dong

Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Contrastive Multimodal Fusion with TupleInfoNCE.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

End-to-End Object Detection with Adaptive Clustering Transformer.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2020

End-to-End Object Detection with Adaptive Clustering Transformer.

[BibT_eX]

[DOI]

CoRR, 2020

RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library.

[BibT_eX]

[DOI]

CoRR, 2020

Generative 3D Part Assembly via Dynamic Graph Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Role-Wise Data Augmentation for Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2020

Generative 3D Part Assembly via Dynamic Graph Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Unpaired Image-to-Image Translation Using Adversarial Consistency Loss.

[BibT_eX]

[DOI]

Yihao Zhao

Ruihai Wu

Hao Dong

Proceedings of the Computer Vision - ECCV 2020, 2020

Lyapunov-Based Reinforcement Learning for Decentralized Multi-agent Control.

[BibT_eX]

[DOI]

Qingrui Zhang

Hao Dong

Wei Pan

Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

2019

Practical deep learning.

[BibT_eX]

[DOI]

Hao Dong

PhD thesis, 2019

An artificial intelligence based data-driven approach for design ideation.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

DLGAN: Disentangling Label-Specific Fine-Grained Features for Image Manipulation.

[BibT_eX]

[DOI]

CoRR, 2019

SIMGAN: Photo-Realistic Semantic Image Manipulation Using Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Generative Creativity: Adversarial Learning for Bionic Design.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019

2018

DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2018

Dropping Activation Outputs With Localized First-Layer Deep Network for Enhancing User Privacy and Data Security.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2018

Text-to-Image Synthesis via Visual-Memory Creative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

2017

Deep De-Aliasing for Fast Compressive Sensing MRI.

[BibT_eX]

[DOI]

CoRR, 2017

Unsupervised Image-to-Image Translation with Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2017

TensorLayer: A Versatile Library for Efficient Deep Learning Development.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Automatic Brain Tumor Detection and Segmentation Using U-Net Based Fully Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Understanding and Analysis - 21st Annual Conference, 2017

I2T2I: Learning text to image synthesis with textual data augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Semantic Image Synthesis via Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

Survey on Feature Extraction and Applications of Biosignals.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health Informatics, 2016

DropNeuron: Simplifying the Structure of Deep Neural Networks.

[BibT_eX]

[DOI]

Wei Pan

Hao Dong

Yike Guo

CoRR, 2016

Mixed Neural Network Approach for Temporal Sleep Stage Classification.

[BibT_eX]

[DOI]

CoRR, 2016

A new soft material based in-the-ear EEG recording technique.

[BibT_eX]

[DOI]

Hao Dong

Paul M. Matthews

Yike Guo

Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Hao Dong

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...