Yazhou Yao

Orcid: 0000-0002-0337-9410

According to our database1, Yazhou Yao authored at least 129 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Robust-EQA: Robust Learning for Embodied Question Answering With Noisy Labels.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Holistic Prototype Attention Network for Few-Shot Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Dual Dynamic Threshold Adjustment Strategy.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Deep Metric Learning Based on Meta-Mining Strategy With Semiglobal Information.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

Two-stage fine-grained image classification model based on multi-granularity feature fusion.
Pattern Recognit., February, 2024

Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection.
IEEE Trans. Multim., 2024

Anti-Collapse Loss for Deep Metric Learning.
IEEE Trans. Multim., 2024

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation.
IEEE Trans. Image Process., 2024

LTFormer: A light-weight transformer-based self-supervised matching network for heterogeneous remote sensing images.
Inf. Fusion, 2024

Class Probability Space Regularization for semi-supervised semantic segmentation.
Comput. Vis. Image Underst., 2024

COMOGen: A Controllable Text-to-3D Multi-object Generation Framework.
CoRR, 2024

Relating CNN-Transformer Fusion Network for Change Detection.
CoRR, 2024

Anti-Collapse Loss for Deep Metric Learning Based on Coding Rate Metric.
CoRR, 2024

Universal Organizer of SAM for Unsupervised Semantic Segmentation.
CoRR, 2024

A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images.
CoRR, 2024

Dual Dynamic Threshold Adjustment Strategy for Deep Metric Learning.
CoRR, 2024

Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.
CoRR, 2024

Group Benefits Instances Selection for Data Purification.
CoRR, 2024

Group benefits instance for data purification.
Comput. Electr. Eng., 2024

Delving Deeper Into Clean Samples for Combating Noisy Labels.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Enhancing Robustness in Learning with Noisy Labels: An Asymmetric Co-Training Approach.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Progressively Robust Loss for Deep Learning with Noisy Labels.
Proceedings of the International Joint Conference on Neural Networks, 2024

Universal Organizer of Segment Anything Model for Unsupervised Semantic Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Foster Adaptivity and Balance in Learning with Noisy Labels.
Proceedings of the Computer Vision - ECCV 2024, 2024

Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Knowledge Transfer with Simulated Inter-image Erasing for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

SMP-Track: SAM in Multi-Pedestrian Tracking.
Proceedings of the 11th IEEE International Conference on Data Science and Advanced Analytics, 2024

VideoMAC: Video Masked Autoencoders Meet ConvNets.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Poly Kernel Inception Network for Remote Sensing Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Guest Editorial: Learning from limited annotations for computer vision tasks.
IET Comput. Vis., August, 2023

Information bottleneck and selective noise supervision for zero-shot learning.
Mach. Learn., July, 2023

Depth and Video Segmentation Based Visual Attention for Embodied Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Motion Stimulation for Compositional Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition.
IEEE Trans. Multim., 2023

Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data.
IEEE Trans. Multim., 2023

Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device.
IEEE Trans. Multim., 2023

FECANet: Boosting Few-Shot Semantic Segmentation With Feature-Enhanced Context-Aware Network.
IEEE Trans. Multim., 2023

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation.
IEEE Trans. Multim., 2023

Hierarchical Co-Attention Propagation Network for Zero-Shot Video Object Segmentation.
IEEE Trans. Image Process., 2023

Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.
IEEE Trans. Image Process., 2023

Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation.
IEEE Trans. Image Process., 2023

Co-mining: Mining informative samples with noisy labels.
Signal Process., 2023

Robust learning from noisy web data for fine-Grained recognition.
Pattern Recognit., 2023

Hierarchical Graph Pattern Understanding for Zero-Shot VOS.
CoRR, 2023

Holistic Prototype Attention Network for Few-Shot VOS.
CoRR, 2023

Co-attention Propagation Network for Zero-Shot Video Object Segmentation.
CoRR, 2023

Attention Map Guided Transformer Pruning for Edge Device.
CoRR, 2023

Semi-Supervised Semantic Segmentation With Region Relevance.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

2022
Multimodal Marketing Intent Analysis for Effective Targeted Advertising.
IEEE Trans. Multim., 2022

Guest Editorial: Learning From Noisy Multimedia Data.
IEEE Trans. Multim., 2022

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples.
IEEE Trans. Multim., 2022

Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation.
IEEE Trans. Multim., 2022

Self-Supervised Depth Completion From Direct Visual-LiDAR Odometry in Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., 2022

Self-Supervised Multi-Modal Hybrid Fusion Network for Brain Tumor Segmentation.
IEEE J. Biomed. Health Informatics, 2022

Dense Semantics-Assisted Networks for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

DBFC-Net: a uniform framework for fine-grained cross-media retrieval.
Multim. Syst., 2022

An Uncertainly Dynamic Loss Correction and Global Sample Selection Method for Webly Supervised Fine-Grained Visual Classification.
Circuits Syst. Signal Process., 2022

Few-Shot Object Detection via Understanding Convolution and Attention.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Unsupervised Pre-training for 3D Object Detection with Transformer.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Feature Difference Enhancement Fusion for Remote Sensing Image Change Detection.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Exploring Linear Feature Disentanglement for Neural Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Unsupervised Self-Evolutionary Hashing for Image Retrieval.
IEEE Trans. Multim., 2021

VMAN: A Virtual Mainstay Alignment Network for Transductive Zero-Shot Learning.
IEEE Trans. Image Process., 2021

Exploiting textual queries for dynamically visual disambiguation.
Pattern Recognit., 2021

Knowledge memorization and generation for action recognition in still images.
Pattern Recognit., 2021

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation.
CoRR, 2021

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones.
CoRR, 2021

Local Self-Attention on Fine-grained Cross-media Retrieval.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Extracting Useful Knowledge from Noisy Web Images via Data Purification for Fine-Grained Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Curriculum-Based Meta-learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Video Representation Learning with Graph Contrastive Augmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

CAA: Candidate-Aware Aggregation for Temporal Action Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Link Prediction with Multiple Structural Attentions in Multiplex Networks.
Proceedings of the International Joint Conference on Neural Networks, 2021

Few-Shot Semantic Segmentation with Cyclic Memory Network.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Jo-SRC: A Contrastive Approach for Combating Noisy Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Exploiting Web Images for Multi-Output Classification: From Category to Subcategories.
IEEE Trans. Neural Networks Learn. Syst., 2020

Approximate Kernel Selection via Matrix Approximation.
IEEE Trans. Neural Networks Learn. Syst., 2020

Towards Automatic Construction of Diverse, High-Quality Image Datasets.
IEEE Trans. Knowl. Data Eng., 2020

Pseudo distribution on unseen classes for generalized zero shot learning.
Pattern Recognit. Lett., 2020

CAN-GAN: Conditioned-attention normalized GAN for face age synthesis.
Pattern Recognit. Lett., 2020

Road segmentation with image-LiDAR data fusion in deep neural network.
Multim. Tools Appl., 2020

Tips and Tricks for Webly-Supervised Fine-Grained Recognition: Learning from the WebFG 2020 Challenge.
CoRR, 2020

Data-driven Meta-set Based Fine-Grained Visual Classification.
CoRR, 2020

Salvage Reusable Samples from Noisy Data for Robust Learning.
CoRR, 2020

Exploiting Category Similarity-Based Distributed Labeling for Fine-Grained Visual Classification.
IEEE Access, 2020

A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Multi-model Network for Fine-Grained Cross-Media Retrieval.
Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Field-wise Learning for Multi-field Categorical Data.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Data-driven Meta-set Based Fine-Grained Visual Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Bridging the Web Data and Fine-Grained Visual Recognition via Alleviating Label Noise and Domain Mismatch.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PyRetri: A PyTorch-based Library for Unsupervised Image Retrieval by Deep Convolutional Neural Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Generative Adversarial and Self-Attention Based Fine-Grained Cross-Media Retrieval.
Proceedings of the ICVISP 2020: 4th International Conference on Vision, 2020

Web-Supervised Network for Fine-Grained Visual Classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Hsi Road: A Hyper Spectral Image Dataset For Road Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Classification Constrained Discriminator For Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Region Graph Embedding Network for Zero-Shot Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Motion-Attentive Transition for Zero-Shot Video Object Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Extracting Multiple Visual Senses for Web Learning.
IEEE Trans. Multim., 2019

Extracting Privileged Information for Enhancing Classifier Learning.
IEEE Trans. Image Process., 2019

Exploiting textual and visual features for image categorization.
Pattern Recognit. Lett., 2019

Deep representation learning for road detection using Siamese network.
Multim. Tools Appl., 2019

Clustering-driven unsupervised deep hashing for image retrieval.
Neurocomputing, 2019

Deep Representation Learning for Road Detection through Siamese Network.
CoRR, 2019

Road Segmentation with Image-LiDAR Data Fusion.
CoRR, 2019

Dynamically Visual Disambiguation of Keyword-based Image Search.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Attentive Region Embedding Network for Zero-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Towards automatic construction of diverse, high-quality image dataset
PhD thesis, 2018

Collaborative representation based local discriminant projection for feature extraction.
Digit. Signal Process., 2018

Extracting Privileged Information from Untagged Corpora for Classifier Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Discovering and Distinguishing Multiple Visual Senses for Polysemous Words.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Exploiting Web Images for Dataset Construction: A Domain Robust Approach.
IEEE Trans. Multim., 2017

A new web-supervised method for image dataset constructions.
Neurocomputing, 2017

Towards Automatic Construction of Diverse, High-quality Image Dataset.
CoRR, 2017

Refining Image Categorization by Exploiting Web Images and General Corpus.
CoRR, 2017

Deep Learning for Person Reidentification Using Support Vector Machines.
Adv. Multim., 2017

2016
Extracting Visual Knowledge from the Internet: Making Sense of Image Data.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A Domain Robust Approach For Image Dataset Construction.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Automatic image dataset construction with multiple textual metadata.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016


  Loading...