Yunchao Wei

IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation.

[DOI]

CoRR, April, 2025

FANeRV: Frequency Separation and Augmentation based Neural Representation for Video.

[DOI]

CoRR, April, 2025

Dynamic feature regularized loss for weakly supervised semantic segmentation.

[DOI]

Bingfeng Zhang

Ángel F. García-Fernández

Pattern Recognit., 2025

M-SEE: A multi-scale encoder enhancement framework for end-to-end Weakly Supervised Semantic Segmentation.

[DOI]

Pattern Recognit., 2025

Revisiting 3D point cloud analysis with Markov process.

[DOI]

Pattern Recognit., 2025

Generative Prompt Controlled Diffusion for weakly supervised semantic segmentation.

[DOI]

Neurocomputing, 2025

High-Frequency Enhanced Hybrid Neural Representation for video compression.

[DOI]

Expert Syst. Appl., 2025

Segmentation guided dual-branch classification for measuring fat infiltration in paraspinal muscles.

[DOI]

Expert Syst. Appl., 2025

Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation.

[DOI]

Eng. Appl. Artif. Intell., 2025

Image Fusion for Cross-Domain Sequential Recommendation.

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Class Activation Map Calibration for Weakly Supervised Semantic Segmentation.

[DOI]

Jian Wang

Tianhong Dai

Xinqiao Zhao

Eng Gee Lim

Ángel F. García-Fernández

IEEE Trans. Circuits Syst. Video Technol., November, 2024

Unified Multi-Modality Video Object Segmentation Using Reinforcement Learning.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2024

Multi-Keys Attention Network for Image Captioning.

[DOI]

Cogn. Comput., May, 2024

Prototype Guided Pseudo Labeling and Perturbation-based Active Learning for domain adaptive semantic segmentation.

[DOI]

Pattern Recognit., April, 2024

Self-supervised learning for point cloud data: A survey.

[DOI]

Expert Syst. Appl., March, 2024

Trajectory Poisson Multi-Bernoulli Mixture Filter for Traffic Monitoring Using a Drone.

[DOI]

Ángel F. García-Fernández

IEEE Trans. Veh. Technol., January, 2024

Cross-frame feature-saliency mutual reinforcing for weakly supervised video salient object detection.

[DOI]

Eng Gee Lim

Pattern Recognit., 2024

Image Augmentation Agent for Weakly Supervised Semantic Segmentation.

[DOI]

CoRR, 2024

Prompt Categories Cluster for Weakly Supervised Semantic Segmentation.

[DOI]

CoRR, 2024

Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras.

[DOI]

CoRR, 2024

APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation.

[DOI]

CoRR, 2024

Top-K Pooling with Patch Contrastive Learning for Weakly-Supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Adversarial Erasing Transformer for Weakly Supervised Semantic Segmentation.

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual Segmentation with Disentangled Objectness Learning and Class Recognition.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Robust generative adversarial network.

[DOI]

Mach. Learn., December, 2023

Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2023

Credible Dual-Expert Learning for Weakly Supervised Semantic Segmentation.

[DOI]

Int. J. Comput. Vis., August, 2023

Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., July, 2023

Towards Simple and Accurate Human Pose Estimation With Stair Network.

[DOI]

IEEE Trans. Emerg. Top. Comput. Intell., June, 2023

Real-Time Prediction of Simulator Sickness in Virtual Reality Games.

[DOI]

IEEE Trans. Games, June, 2023

Weight-guided class complementing for long-tailed image recognition.

[DOI]

Pattern Recognit., June, 2023

Aggregated pyramid gating network for human pose estimation without pre-training.

[DOI]

Pattern Recognit., June, 2023

Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation.

[DOI]

IEEE Trans. Multim., 2023

Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning.

[DOI]

IEEE Trans. Multim., 2023

Weight-guided loss for long-tailed object detection and instance segmentation.

[DOI]

Signal Process. Image Commun., 2023

PointGS: Bridging and fusing geometric and semantic space for 3D point cloud analysis.

[DOI]

Inf. Fusion, 2023

Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FastRecon: Few-shot Industrial Anomaly Detection via Fast Feature Reconstruction.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

ToF and Stereo Data Fusion Using Dynamic Search Range Stereo Matching.

[DOI]

Yong Deng

Steven Zhiying Zhou

IEEE Trans. Multim., 2022

Transformer-Based Language-Person Search With Multiple Region Slicing.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Neural texture transfer assisted video coding with adaptive up-sampling.

[DOI]

Signal Process. Image Commun., 2022

Unsupervised domain adaptation in homogeneous distance space for person re-identification.

[DOI]

Pattern Recognit., 2022

Soft pseudo-Label shrinkage for unsupervised domain adaptive person re-identification.

[DOI]

Pattern Recognit., 2022

End-to-end weakly supervised semantic segmentation with reliable region mining.

[DOI]

Pattern Recognit., 2022

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

CARD: Semi-supervised Semantic Segmentation via Class-agnostic Relation based Denoising.

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Detail Preserving Coarse-to-Fine Matching for Stereo Matching and Optical Flow.

[DOI]

IEEE Trans. Image Process., 2021

Fast pixel-matching for video object segmentation.

[DOI]

Signal Process. Image Commun., 2021

Exploiting textual queries for dynamically visual disambiguation.

[DOI]

Pattern Recognit., 2021

Progressive sample mining and representation learning for one-shot person re-identification.

[DOI]

Pattern Recognit., 2021

Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding.

[DOI]

John Yannis Goulermas

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Occluded and tiny face detection network for dense crowd.

[DOI]

J. Inf. Hiding Multim. Signal Process., 2021

Fast Pixel-Matching for Video Object Segmentation.

[DOI]

CoRR, 2021

A Lightweight Real-time Stereo Depth Estimation Network with Dynamic Upsampling Modules.

[DOI]

Yong Deng

Steven Zhiying Zhou

Proceedings of the 16th International Joint Conference on Computer Vision, 2021

Self-Guided and Cross-Guided Learning for Few-Shot Segmentation.

[DOI]

Bingfeng Zhang

Terry Qin

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning.

[DOI]

Mingjie Sun

Eng Gee Lim

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Structure-Consistent Weakly Supervised Salient Object Detection with Local Saliency Coherence.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Correlation Filter Selection for Visual Tracking Using Reinforcement Learning.

[DOI]

Yanchun Xie

Kaizhu Huang

Jeyarajan Thiyagalingam

IEEE Trans. Circuits Syst. Video Technol., 2020

Segmentation mask guided end-to-end person search.

[DOI]

Signal Process. Image Commun., 2020

Single image-based head pose estimation with spherical parametrization and 3D morphing.

[DOI]

Pattern Recognit., 2020

Adaptive ROI generation for video object segmentation using reinforcement learning.

[DOI]

Pattern Recognit., 2020

Generative adversarial classifier for handwriting characters super-resolution.

[DOI]

Pattern Recognit., 2020

Robust Generative Adversarial Network.

[DOI]

CoRR, 2020

Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training.

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Feature Representation Matters: End-to-End Learning for Reference-Based Image Super-Resolution.

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Fast Template Matching and Update for Video Object Tracking and Segmentation.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Reliability Does Matter: An End-to-End Weakly Supervised Semantic Segmentation Approach.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Video Streaming Adaptation Strategy for Multiview Navigation Over DASH.

[DOI]

IEEE Trans. Broadcast., 2019

Multiview video quality enhancement without depth information.

[DOI]

Samer Jammal

Signal Process. Image Commun., 2019

IAN: The Individual Aggregation Network for Person Search.

[DOI]

Pattern Recognit., 2019

Progressive Sample Mining and Representation Learning for One-Shot Person Re-identification with Adversarial Samples.

[DOI]

CoRR, 2019

A Single Image based Head Pose Estimation Method with Spherical Parameterization.

[DOI]

CoRR, 2019

Edge Orientation Driven Depth Super-Resolution for View Synthesis.

[DOI]

Proceedings of the Image and Graphics - 10th International Conference, 2019

2018

Cooperative Bargaining Game-Based Multiuser Bandwidth Allocation for Dynamic Adaptive Streaming Over HTTP.

[DOI]

IEEE Trans. Multim., 2018

Convolutional Neural Network for Intermediate View Enhancement in Multiview Streaming.

[DOI]

IEEE Trans. Multim., 2018

Region-Based Multiple Description Coding for Multiview Video Plus Depth Video.

[DOI]

IEEE Trans. Multim., 2018

Visual aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization.

[DOI]

Signal Process. Image Commun., 2018

Siamese network ensemble for visual tracking.

[DOI]

Neurocomputing, 2018

Image Ordinal Classification and Understanding: Grid Dropout with Masking Label.

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

2017

Texture Plus Depth Video Coding Using Camera Global Motion Information.

[DOI]

IEEE Trans. Multim., 2017

End-to-End Distortion-Based Multiuser Bandwidth Allocation for Real-Time Video Transmission Over LTE Network.

[DOI]

IEEE Trans. Broadcast., 2017

QoE-Driven Dynamic Adaptive Video Streaming Strategy With Future Information.

[DOI]

Li Yu

IEEE Trans. Broadcast., 2017

An effective CU size decision method for quality scalability in SHVC.

[DOI]

Multim. Tools Appl., 2017

Disparity Estimation Using Convolutional Neural Networks with Multi-scale Correlation.

[DOI]

Samer Jammal

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Multi-resolution for disparity estimation with convolutional neural networks.

[DOI]

Samer Jammal

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Depth Map Down-Sampling and Coding Based on Synthesized View Distortion.

[DOI]

IEEE Trans. Multim., 2016

Virtual-View-Assisted Video Super-Resolution and Enhancement.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling.

[DOI]

EURASIP J. Image Video Process., 2016

Packetization strategies for MVD-based 3D video transmission.

[DOI]

Proceedings of the 2016 Visual Communications and Image Processing, 2016

3D video super-resolution using fully convolutional neural networks.

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015

Scalable Bit Allocation Between Texture and Depth Views for 3-D Video Streaming Over Heterogeneous Networks.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Multiple Description Coding for Stereoscopic Videos With Stagger Frame Order.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating.

[DOI]

KSII Trans. Internet Inf. Syst., 2015

Erratum to: One-class kernel subspace ensemble for medical image classification.

[DOI]

EURASIP J. Adv. Signal Process., 2015

A Paradigm for Dynamic Adaptive Streaming over HTTP for Multi-view Video.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Depth-Based Stereoscopic Projection Approach for 3D Saliency Detection.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A Flexible Programmable Camera Control and Data Acquisition Hardware Platform.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Global Motion Information Based Depth Map Sequence Coding.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

3D video coding using motion information and depth map.

[DOI]

Fei Cheng

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Statistical approach for motion estimation skipping (SAMEK).

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Depth filter design by jointly utilizing spatial-temporal depth and texture information.

[DOI]

Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

2014

Macroblock Level Bits Allocation for Depth Maps in 3-D Video Coding.

[DOI]

J. Signal Process. Syst., 2014

Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information.

[DOI]

IEEE Trans. Broadcast., 2014

Optimizing the deadzone width to improve the polyphase-based multiple description coding.

[DOI]

Multim. Tools Appl., 2014

Correlation based universal image/video coding loss recovery.

[DOI]

J. Vis. Commun. Image Represent., 2014

One-class kernel subspace ensemble for medical image classification.

[DOI]

EURASIP J. Adv. Signal Process., 2014

Dynamic redundancy allocation for video streaming using Sub-GOP based FEC code.

[DOI]

Li Yu

Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

2013

Real-Time Video Streaming Using Randomized Expanding Reed-Solomon Code.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2013

A Real-Time Error Resilient Video Streaming Scheme Exploiting the Late- and Early-Arrival Packets.

[DOI]

IEEE Trans. Broadcast., 2013

Novel Wireless Capsule Endoscopy Diagnosis System with Adaptive Image Capturing Rate.

Proceedings of the VISAPP 2013, 2013

3-D video depth map quantization based on Lloyd's algorithm.

[DOI]

Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Multiple description video coding based on forward error correction within expanding windows.

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

2012

Dynamic Sub-GOP Forward Error Correction Code for Real-Time Video Applications.

[DOI]

IEEE Trans. Multim., 2012

Video Error Concealment of P-frame Using Packets of the Following Frames.

[DOI]

Proceedings of the Eighth International Conference on Signal Image Technology and Internet Based Systems, 2012

Real-time video streaming exploiting the late-arrival packets.

[DOI]

Proceedings of the 2012 Picture Coding Symposium, 2012

Real-Time Macroblock Level Bits Allocation for Depth Maps in 3-D Video Coding.

[DOI]