Hao Dong

Orcid: 0000-0002-7984-9909

Affiliations:
  • Peking University, PKU-Agibot Lab, Center on Frontiers of Computing Studies, School of Computer Science, Beijing, China
  • Imperial College London, Department of Computing, UK (former)


According to our database1, Hao Dong authored at least 127 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LVDiffusor: Distilling Functional Rearrangement Priors From Large Models Into Diffusor.
IEEE Robotics Autom. Lett., October, 2024

RGBGrasp: Image-Based Object Grasping by Capturing Multiple Views During Robot arm Movement With Neural Radiance Fields.
IEEE Robotics Autom. Lett., June, 2024

Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Regression-Guided Refocusing Learning With Feature Alignment for Remote Sensing Tiny Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Adaptive Dynamic Label Assignment for Tiny Object Detection in Aerial Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation.
CoRR, 2024

MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation.
CoRR, 2024

EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning.
CoRR, 2024

TARGO: Benchmarking Target-driven Object Grasping under Occlusions.
CoRR, 2024

Human-centered In-building Embodied Delivery Benchmark.
CoRR, 2024

SpatialBot: Precise Spatial Understanding with Vision Language Models.
CoRR, 2024

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.
CoRR, 2024

GFPack++: Improving 2D Irregular Packing by Learning Gradient Field with Attention.
CoRR, 2024

A3VLM: Actionable Articulation-Aware Vision Language Model.
CoRR, 2024

InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment.
CoRR, 2024

Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking.
CoRR, 2024

Broadcasting Support Relations Recursively from Local Dynamics for Object Retrieval in Clutters.
CoRR, 2024

Learning Manipulation by Predicting Interaction.
CoRR, 2024

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence.
CoRR, 2024

PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments.
CoRR, 2024

SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network.
CoRR, 2024

UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy.
CoRR, 2024

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models.
CoRR, 2024

NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation.
CoRR, 2024

UniDoorManip: Learning Universal Door Manipulation Policy Over Large-scale and Diverse Door Manipulation Environments.
CoRR, 2024

Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise.
CoRR, 2024

Pattern4Ego: Learning Egocentric Video Representation Using Cross-video Activity Patterns.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Personalize Segment Anything Model with One Shot.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Learning Semantic-Agnostic and Spatial-Aware Representation for Generalizable Visual-Audio Navigation.
IEEE Robotics Autom. Lett., June, 2023

P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification.
Remote. Sens., April, 2023

Posterior Instance Injection Detector for Arbitrary-Oriented Object Detection From Optical Remote-Sensing Imagery.
IEEE Trans. Geosci. Remote. Sens., 2023

Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library.
J. Mach. Learn. Res., 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

Distilling Functional Rearrangement Priors from Large Models.
CoRR, 2023

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection.
CoRR, 2023

Improving Compositional Text-to-image Generation with Large Vision-Language Models.
CoRR, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.
CoRR, 2023

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks.
CoRR, 2023

GenPose: Generative Category-level Object Pose Estimation via Diffusion Models.
CoRR, 2023

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model.
CoRR, 2023

Personalize Segment Anything Model with One Shot.
CoRR, 2023

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference.
CoRR, 2023

Plan4MC: Skill Reinforcement Learning and Planning for Open-World Minecraft Tasks.
CoRR, 2023

Learning Gradient Fields for Scalable and Generalizable Irregular Packing.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Generative Category-level Object Pose Estimation via Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLAfford: End-to-End Affordance Learning for Robotic Manipulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GFPose: Learning 3D Human Pose Prior with Gradient Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Part Motion of Articulated Objects Using Spatially Continuous Neural Implicit Representations.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Score-PA: Score-based 3D Part Assembly.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Resilient Binary Neural Network.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain.
Remote. Sens., 2022

Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery.
Remote. Sens., 2022

MARLlib: Extending RLlib for Multi-agent Reinforcement Learning.
CoRR, 2022

End-to-End Affordance Learning for Robotic Manipulation.
CoRR, 2022

Collaboration of Pre-trained Models Makes Better Few-shot Learner.
CoRR, 2022

TarGF: Learning Target Gradient Field for Object Rearrangement.
CoRR, 2022

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL.
CoRR, 2022

Fully Decentralized Model-based Policy Optimization for Networked Systems.
CoRR, 2022

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain.
CoRR, 2022

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Object Manipulation.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
CoRR, 2022

GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning.
CoRR, 2022

Edge-enhanced dual discriminator generative adversarial network for fast MRI with parallel imaging using multi-view information.
Appl. Intell., 2022

TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Adaptive Local Context Embedding for Small Vehicle Detection from Aerial Optical Remote Sensing Images.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022

VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects.
Proceedings of the Tenth International Conference on Learning Representations, 2022

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions.
Proceedings of the Computer Vision - ECCV 2022, 2022

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Bilateral Asymmetry Guided Counterfactual Generating Network for Mammogram Classification.
IEEE Trans. Image Process., 2021

RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment.
CoRR, 2021

Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning.
CoRR, 2021

Product semantics translation from brain activity via adversarial learning.
CoRR, 2021

Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos.
CoRR, 2021


Fast and Flexible Human Pose Estimation with HyperPose.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Efficient Reinforcement Learning Development with RLzoo.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Tensorlayer 3.0: A Deep Learning Library Compatible With Multiple Backends.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Contrastive Multimodal Fusion with TupleInfoNCE.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

End-to-End Object Detection with Adaptive Clustering Transformer.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding.
CoRR, 2020

End-to-End Object Detection with Adaptive Clustering Transformer.
CoRR, 2020

RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library.
CoRR, 2020

Generative 3D Part Assembly via Dynamic Graph Learning.
CoRR, 2020

Role-Wise Data Augmentation for Knowledge Distillation.
CoRR, 2020

Generative 3D Part Assembly via Dynamic Graph Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Unpaired Image-to-Image Translation Using Adversarial Consistency Loss.
Proceedings of the Computer Vision - ECCV 2020, 2020

Lyapunov-Based Reinforcement Learning for Decentralized Multi-agent Control.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

2019
Practical deep learning.
PhD thesis, 2019

An artificial intelligence based data-driven approach for design ideation.
J. Vis. Commun. Image Represent., 2019

DLGAN: Disentangling Label-Specific Fine-Grained Features for Image Manipulation.
CoRR, 2019

SIMGAN: Photo-Realistic Semantic Image Manipulation Using Generative Adversarial Networks.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Generative Creativity: Adversarial Learning for Bionic Design.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019

2018
DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction.
IEEE Trans. Medical Imaging, 2018

Dropping Activation Outputs With Localized First-Layer Deep Network for Enhancing User Privacy and Data Security.
IEEE Trans. Inf. Forensics Secur., 2018

Text-to-Image Synthesis via Visual-Memory Creative Adversarial Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

2017
Deep De-Aliasing for Fast Compressive Sensing MRI.
CoRR, 2017

Unsupervised Image-to-Image Translation with Generative Adversarial Networks.
CoRR, 2017

TensorLayer: A Versatile Library for Efficient Deep Learning Development.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Automatic Brain Tumor Detection and Segmentation Using U-Net Based Fully Convolutional Networks.
Proceedings of the Medical Image Understanding and Analysis - 21st Annual Conference, 2017

I2T2I: Learning text to image synthesis with textual data augmentation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Semantic Image Synthesis via Adversarial Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Survey on Feature Extraction and Applications of Biosignals.
Proceedings of the Machine Learning for Health Informatics, 2016

DropNeuron: Simplifying the Structure of Deep Neural Networks.
CoRR, 2016

Mixed Neural Network Approach for Temporal Sleep Stage Classification.
CoRR, 2016

A new soft material based in-the-ear EEG recording technique.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016


  Loading...