Yang Liu

Orcid: 0000-0002-4259-3882

Affiliations:
  • Peking University, Wangxuan Institute of Computer Technology, Beijing, China
  • University of Oxford, UK (former)
  • University of Cambridge, Computer Laboratory, UK (former)


According to our database1, Yang Liu authored at least 61 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
TeachText: CrossModal text-video retrieval through generalized distillation.
Artif. Intell., 2025

2024
Evidential Multi-Source-Free Unsupervised Domain Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

IoT-V2E: An Uncertainty-Aware Cross-Modal Hashing Retrieval Between Infrared-Videos and EEGs for Automated Sleep State Analysis.
IEEE Internet Things J., February, 2024

MAAN: Memory-Augmented Auto-Regressive Network for Text-Driven 3D Indoor Scene Generation.
IEEE Trans. Multim., 2024

MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval.
CoRR, 2024

Diff-BGM: A Diffusion Model for Video Background Music Generation.
CoRR, 2024

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection.
CoRR, 2024

Progressive trajectory matching for medical dataset distillation.
CoRR, 2024

Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval.
CoRR, 2024

Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RelScene: A Benchmark and baseline for Spatial Relations in text-driven 3D Scene Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

3D Vision and Language Pretraining with Large-Scale Synthetic Data.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Semantic-Aware Human Object Interaction Image Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Training-Free Video Temporal Grounding Using Large-Scale Pre-trained Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

WAS: Dataset and Methods for Artistic Text Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Conditional Multi-modal Prompts for Zero-Shot HOI Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Active Object Detection with Knowledge Aggregation and Distillation from Large Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OED: Towards One-stage End-to-End Dynamic Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Novel Class Discovery in Chest X-rays via Paired Images and Text.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Semantic-Guided Novel Category Discovery.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Uncertainty-Induced Transferability Representation for Source-Free Unsupervised Domain Adaptation.
IEEE Trans. Image Process., 2023

Incorporating Pre-training Data Matters in Unsupervised Domain Adaptation.
CoRR, 2023

Recent Advances in Class-Incremental Learning.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Moment Detection in Long Tutorial Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Phrase-Level Temporal Relationship Mining for Temporal Sentence Localization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Seeing your sleep stage: cross-modal distillation from EEG to infrared video.
CoRR, 2022

Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report.
CoRR, 2022

Delving into the Continuous Domain Adaptation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Phrase-level Prediction for Video Temporal Localization.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Video Activity Localisation with Uncertainties in Temporal Boundary.
Proceedings of the Computer Vision - ECCV 2022, 2022

Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cross Modal Retrieval with Querybank Normalisation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TeachText: CrossModal Generalized Distillation for Text-Video Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

QUERYD: A Video Dataset with High-Quality Text and Audio Narrations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
QuerYD: A video dataset with high-quality textual and audio narrations.
CoRR, 2020

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020).
CoRR, 2020

Respiration and Activity Detection Based on Passive Radio Sensing in Home Environments.
IEEE Access, 2020

Cross-Device Cross-Anatomy Adaptation Network for Ultrasound Video Analysis.
Proceedings of the Medical Ultrasound, and Preterm, Perinatal and Paediatric Image Analysis, 2020

Amplifying Key Cues for Human-Object-Interaction Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structure-Aware Feature Fusion for Unsupervised Domain Adaptation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Use What You Have: Video retrieval using representations from collaborative experts.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Application of prior information to discriminative feature learning.
PhD thesis, 2018

Synthetically Supervised Feature Learning for Scene Text Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Multi-Task Adversarial Network for Disentangled Feature Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Re-Weighted Adversarial Adaptation Network for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dictionary Learning Inspired Deep Network for Scene Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deep network for image super-resolution with a dictionary learning layer.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2016
Simultaneous Bayesian Sparse Approximation With Structured Sparse Models.
IEEE Trans. Signal Process., 2016

Support Discrimination Dictionary Learning for Image Classification.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
A New Face Recognition Algorithm based on Dictionary Learning for a Single Training Sample per Person.
Proceedings of the British Machine Vision Conference 2015, 2015


  Loading...