Hang Xu
Orcid: 0000-0003-3645-8972Affiliations:
- Huawei Noah's Ark Lab, Shanghai, China
- Hong Kong University (PhD 2018)
According to our database1,
Hang Xu
authored at least 152 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection.
IEEE Trans. Neural Networks Learn. Syst., November, 2024
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024
CoRR, 2024
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation.
CoRR, 2024
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models.
CoRR, 2024
CoRR, 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection.
CoRR, 2024
CoRR, 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning.
CoRR, 2024
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs.
CoRR, 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Reason2Drive: Towards Interpretable and Chain-Based Reasoning for Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2024, 2024
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024
LayerDiff: Exploring Text-Guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-Fine Pose-Reversible Guidance.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Holistic Autonomous Driving Understanding by Bird'View Injected Multi-Modal Large Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Cybern., 2023
CoRR, 2023
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance.
CoRR, 2023
VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving.
CoRR, 2023
MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation.
CoRR, 2023
CoRR, 2023
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining.
CoRR, 2023
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
CLIP<sup>2</sup>: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
CoRR, 2022
ZeroGen<sup>+</sup>: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning.
CoRR, 2022
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework.
CoRR, 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding.
Proceedings of the Computer Vision - ECCV 2022, 2022
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2022, 2022
MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
CoRR, 2021
Learning Transferable Features for Point Cloud Detection via 3D Contrastive Co-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search.
Proceedings of the 9th International Conference on Learning Representations, 2021
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
C<sup>3</sup>-SemiSeg: Contrastive Semi-supervised Segmentation via Cross-set Learning and Dynamic Class-balancing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Towards Dynamic and Scalable Active Learning with Neural Architecture Adaption for Object Detection.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending.
Proceedings of the Computer Vision - ECCV 2020, 2020
AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling.
Proceedings of the Computer Vision - ECCV 2020, 2020
CATCH: Context-Based Meta Reinforcement Learning for Transferrable Architecture Search.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
ElixirNet: Relation-Aware Network Architecture Adaptation for Medical Lesion Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
EHSOD: CAM-Guided End-to-End Hybrid-Supervised Object Detection with Cascade Refinement.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Multi-objective Neural Architecture Search via Predictive Network Performance Optimization.
CoRR, 2019
Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018