Hao Li

Orcid: 0000-0001-6197-0674

Affiliations:
  • Alibaba Group, Hangzhou, Zhejiang, China


According to our database1, Hao Li authored at least 123 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dynamic gradient reactivation for backward compatible person re-identification.
Pattern Recognit., February, 2024

Relationships between brain structure-function coupling in normal aging and cognition: A cross-ethnicity population-based study.
NeuroImage, 2024

FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset.
CoRR, 2024

GRPose: Learning Graph Relations for Human Image Generation with Pose Priors.
CoRR, 2024

Cascaded Temporal Updating Network for Efficient Video Super-Resolution.
CoRR, 2024

EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models.
CoRR, 2024

Predicting functional outcome in ischemic stroke patients using genetic, environmental, and clinical factors: a machine learning analysis of population-based prospective cohort study.
Briefings Bioinform., 2024

A clinically actionable and explainable real-time risk assessment framework for stroke-associated pneumonia.
Artif. Intell. Medicine, 2024

EGGen: Image Generation with Multi-entity Prior Learning through Entity Guidance.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
What Limits the Performance of Local Self-attention?
Int. J. Comput. Vis., October, 2023

A Deep Learning System to Predict Recurrence and Disability Outcomes in Patients with Transient Ischemic Attack or Ischemic Stroke.
Adv. Intell. Syst., April, 2023

InfMLLM: A Unified Framework for Visual-Language Tasks.
CoRR, 2023

OVO: Open-Vocabulary Occupancy.
CoRR, 2023

Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Cross-Batch Hard Example Mining With Pseudo Large Batch for ID vs. Spot Face Recognition.
IEEE Trans. Image Process., 2022

Multi-View Evolutionary Training for Unsupervised Domain Adaptive Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2022

Contrastive Haze-Aware Learning for Dynamic Remote Sensing Image Dehazing.
IEEE Trans. Geosci. Remote. Sens., 2022

Class-Aware Feature Aggregation Network for Video Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Beyond Classifiers: Remote Sensing Change Detection with Metric Learning.
Remote. Sens., 2022

Towards a More Realistic and Detailed Deep-Learning-Based Radar Echo Extrapolation Method.
Remote. Sens., 2022

Refining pseudo labels for unsupervised Domain Adaptive Re-Identification.
Knowl. Based Syst., 2022

Revisiting instance search: A new benchmark using cycle self-training.
Neurocomputing, 2022

Implicit Semantic Augmentation for Distance Metric Learning in Domain Generalization.
CoRR, 2022

Semi-supervised Deep Multi-view Stereo.
CoRR, 2022

DLME: Deep Local-flatness Manifold Embedding.
CoRR, 2022

Point RCNN: An Angle-Free Framework for Rotated Object Detection.
CoRR, 2022

Architecture-Agnostic Masked Image Modeling - From ViT back to CNN.
CoRR, 2022

SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation.
CoRR, 2022

An Empirical Study on Distribution Shift Robustness From the Perspective of Pre-Training and Data Augmentation.
CoRR, 2022

Improved Knowledge Distillation via Full Kernel Matrix Transfer.
Proceedings of the 2022 SIAM International Conference on Data Mining, 2022

VTC-LFC: Vision Transformer Compression with Low-Frequency Components.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Entropy-Driven Mixed-Precision Quantization for Deep Network Design.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improved Fine-Tuning by Better Leveraging Pre-Training Data.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MimCo: Masked Image Modeling Pre-training with Contrastive Teacher.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Semantic Data Augmentation based Distance Metric Learning for Domain Generalization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation.
Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

Dependency-Aware Traffic Management for Configuring On-demand in Service Meshes.
Proceedings of the 28th IEEE International Conference on Parallel and Distributed Systems, 2022

MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection.
Proceedings of the International Conference on Machine Learning, 2022

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

GiraffeDet: A Heavy-Neck Paradigm for Object Detection.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Graph Convolution for Re-Ranking in Person Re-Identification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer.
Proceedings of the IEEE International Conference on Acoustics, 2022

Adaptive Matching Strategy for Multi-Target Multi-Camera Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2022

Jmpnet: Joint Motion Prediction for Learning-Based Video Compression.
Proceedings of the IEEE International Conference on Acoustics, 2022

DLME: Deep Local-Flatness Manifold Embedding.
Proceedings of the Computer Vision - ECCV 2022, 2022

TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

KVT: k-NN Attention for Boosting Vision Transformers.
Proceedings of the Computer Vision, 2022

Unstructured Feature Decoupling for Vehicle Re-identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Efficient Training Approach for Very Large Scale Face Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MogFace: Towards a Deeper Appreciation on Face Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Visual Representation Learning by Online Constrained K-Means.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaled ReLU Matters for Training Vision Transformers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

TransZero: Attribute-Guided Transformer for Zero-Shot Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Context and Structure Mining Network for Video Object Detection.
Int. J. Comput. Vis., 2021

ELSA: Enhanced Local Self-Attention for Vision Transformer.
CoRR, 2021

TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning.
CoRR, 2021

Improved Fine-tuning by Leveraging Pre-training Data: Theory and Practice.
CoRR, 2021

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification.
CoRR, 2021

2nd Place Solution to Google Landmark Retrieval 2021.
CoRR, 2021

Fine-Grained AutoAugmentation for Multi-Label Classification.
CoRR, 2021

Align Yourself: Self-supervised Pre-training for Fine-grained Recognition via Saliency Alignment.
CoRR, 2021

KVT: k-NN Attention for Boosting Vision Transformers.
CoRR, 2021

An Efficient Training Approach for Very Large Scale Face Recognition.
CoRR, 2021

Importance Weighted Adversarial Discriminative Transfer for Anomaly Detection.
CoRR, 2021

Why Does Multi-Epoch Training Help?
CoRR, 2021

Learning to Cluster Faces via Transformer.
CoRR, 2021

A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation.
CoRR, 2021

Spatiotemporal Entropy Model is All You Need for Learned Video Compression.
CoRR, 2021

A Theoretical Analysis of Learning with Noisily Labeled Data.
CoRR, 2021

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation.
CoRR, 2021

MogFace: Rethinking Scale Augmentation on the Face Detector.
CoRR, 2021

Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition.
CoRR, 2021

Object Detection Made Simpler by Eliminating Heuristic NMS.
CoRR, 2021

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking.
CoRR, 2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Interpolation Variable Rate Image Compression.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Exploring the Quality of GAN Generated Images for Person Re-Identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Dash: Semi-Supervised Learning with Dynamic Thresholding.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Accurate Entropy Model with Global Reference for Image Compression.
Proceedings of the 9th International Conference on Learning Representations, 2021

Mask Aware Network for Masked Face Recognition in the Wild.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Get better 1 pixel PCK: ladder scales correspondence flow networks for remote sensing image matching in higher resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation<sup>*</sup>.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Digging into Uncertainty in Self-supervised Multi-view Stereo.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Weakly Supervised Representation Learning with Coarse Labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TransReID: Transformer-based Object Re-Identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

An Empirical Study of Vehicle Re-Identification on the AI City Challenge.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
1st Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification.
CoRR, 2020

AU-Guided Unsupervised Domain Adaptive Facial Expression Recognition.
CoRR, 2020

WeMix: How to Better Utilize Data Augmentation.
CoRR, 2020

Efficient Kernel Transfer in Knowledge Distillation.
CoRR, 2020

Semi-Anchored Detector for One-Stage Object Detection.
CoRR, 2020

Neural Architecture Design for GPU-Efficient Networks.
CoRR, 2020

Towards Understanding Label Smoothing.
CoRR, 2020

Representation Learning with Fine-grained Patterns.
CoRR, 2020

Exploiting Better Feature Aggregation for Video Object Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

7.2 A 12nm Programmable Convolution-Efficient Neural-Processing-Unit Chip Achieving 825TOPS.
Proceedings of the 2020 IEEE International Solid- State Circuits Conference, 2020

Unsupervised Style Transfer via Dualgan for Cross-Domain Aerial Image Classification.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2020

Hierarchically Robust Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

DR Loss: Improving Object Detection by Distributional Ranking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Domain Learning and Identity Mining for Vehicle Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
SoftTriple Loss: Deep Metric Learning Without Triplet Sampling.
CoRR, 2019

Robust Gaussian Process Regression for Real-Time High Precision GPS Signal Enhancement.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

MuffNet: Multi-Layer Feature Federation for Mobile Deep Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Learning to Rank Proposals for Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SoftTriple Loss: Deep Metric Learning Without Triplet Sampling.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Robust Optimization over Multiple Domains.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Large-Scale Distance Metric Learning With Uncertainty.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Extremely Low Bit Neural Network: Squeeze the Last Bit Out With ADMM.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM.
CoRR, 2017

The Opensesame NIST 2016 Speaker Recognition Evaluation System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Deep CTR Prediction in Display Advertising.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2012
Multimodal Graph-Based Reranking for Web Image Search.
IEEE Trans. Image Process., 2012

2011
Optimizing multimodal reranking for web image search.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

2009
MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset.
Proceedings of the ICDM Workshops 2009, 2009

2007
New timing and routability driven placement algorithms for FPGA synthesis.
Proceedings of the 17th ACM Great Lakes Symposium on VLSI 2007, 2007


  Loading...