Hongyang Li

Orcid: 0000-0001-9110-5534

Affiliations:
  • University of Hong Kong
  • Shanghai AI Laboratory, China
  • University of California at San Diego, USA


According to our database1, Hongyang Li authored at least 66 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking.
Int. J. Comput. Vis., May, 2024

Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation.
CoRR, 2024

Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation.
CoRR, 2024

Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving.
CoRR, 2024

NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking.
CoRR, 2024

Learning Manipulation by Predicting Interaction.
CoRR, 2024

Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability.
CoRR, 2024

3D Data Augmentation for Driving Scenes on Camera.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Embodied Understanding of Driving Scenarios.
Proceedings of the Computer Vision - ECCV 2024, 2024

Fully Sparse 3D Occupancy Prediction.
Proceedings of the Computer Vision - ECCV 2024, 2024

Generalized Predictive Model for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

DriveLM: Driving with Graph Visual Question Answering.
CoRR, 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future.
CoRR, 2023

LLM4Drive: A Survey of Large Language Models for Autonomous Driving.
CoRR, 2023

End-to-end Autonomous Driving: Challenges and Frontiers.
CoRR, 2023

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
CoRR, 2023

Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving.
CoRR, 2023

Topology Reasoning for Driving Scenes.
CoRR, 2023

Sparse Dense Fusion for 3D Object Detection.
CoRR, 2023

Geometric-aware Pretraining for Vision-centric 3D Object Detection.
CoRR, 2023

3D Data Augmentation for Driving Scenes on Camera.
CoRR, 2023

Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking.
CoRR, 2023

Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling.
CoRR, 2023

OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Scene as Occupancy.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Density-invariant Features for Distant Point Cloud Registration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Distilling Focal Knowledge from Imperfect Expert for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Stare at What You See: Masked Image Modeling without Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Planning-oriented Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Goal-oriented Autonomous Driving.
CoRR, 2022

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.
CoRR, 2022

Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot.
CoRR, 2022

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers.
CoRR, 2022

Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark.
Proceedings of the Computer Vision - ECCV 2022, 2022

Align Representations with Base: A New Approach to Self-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach.
Proceedings of the Conference on Robot Learning, 2022

2020
Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection.
Int. J. Comput. Vis., 2019

Feature Intertwiner for Object Detection.
Proceedings of the 7th International Conference on Learning Representations, 2019

Finding Task-Relevant Features for Few-Shot Learning by Category Traversal.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Neural Network Encapsulation.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Dual Deep Network for Visual Tracking.
IEEE Trans. Image Process., 2017

DeepID-Net: Object Detection with Deformable Part Based Convolutional Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

CNN for saliency detection with low-level feature integration.
Neurocomputing, 2017

Rethinking Feature Discrimination and Polymerization for Large-scale Recognition.
CoRR, 2017

Learning Deep Features via Congenerous Cosine Loss for Person Recognition.
CoRR, 2017

Zoom Out-and-In Network with Recursive Training for Object Proposal.
CoRR, 2017

Do we really need more training data for object localization.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Recurrent Scale Approximation for Object Detection in CNN.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Multi-Bias Non-linear Activation in Deep Neural Networks.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Inner and Inter Label Propagation: Salient Object Detection in the Wild.
IEEE Trans. Image Process., 2015

LCNN: Low-level Feature Embedded CNN for Salient Object Detection.
CoRR, 2015

Learning Deep Representation with Large-Scale Attributes.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015


  Loading...