2025
Frozen CLIP-DINO: A Strong Backbone for Weakly Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation.
CoRR, April, 2025
FANeRV: Frequency Separation and Augmentation based Neural Representation for Video.
CoRR, April, 2025
Dynamic feature regularized loss for weakly supervised semantic segmentation.
Pattern Recognit., 2025
M-SEE: A multi-scale encoder enhancement framework for end-to-end Weakly Supervised Semantic Segmentation.
Pattern Recognit., 2025
Revisiting 3D point cloud analysis with Markov process.
Pattern Recognit., 2025
Generative Prompt Controlled Diffusion for weakly supervised semantic segmentation.
Neurocomputing, 2025
High-Frequency Enhanced Hybrid Neural Representation for video compression.
Expert Syst. Appl., 2025
Segmentation guided dual-branch classification for measuring fat infiltration in paraspinal muscles.
Expert Syst. Appl., 2025
Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation.
Eng. Appl. Artif. Intell., 2025
Image Fusion for Cross-Domain Sequential Recommendation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Class Activation Map Calibration for Weakly Supervised Semantic Segmentation.
IEEE Trans. Circuits Syst. Video Technol., November, 2024
Unified Multi-Modality Video Object Segmentation Using Reinforcement Learning.
IEEE Trans. Circuits Syst. Video Technol., August, 2024
Multi-Keys Attention Network for Image Captioning.
Cogn. Comput., May, 2024
Prototype Guided Pseudo Labeling and Perturbation-based Active Learning for domain adaptive semantic segmentation.
Pattern Recognit., April, 2024
Self-supervised learning for point cloud data: A survey.
Expert Syst. Appl., March, 2024
Trajectory Poisson Multi-Bernoulli Mixture Filter for Traffic Monitoring Using a Drone.
IEEE Trans. Veh. Technol., January, 2024
Cross-frame feature-saliency mutual reinforcing for weakly supervised video salient object detection.
Pattern Recognit., 2024
Image Augmentation Agent for Weakly Supervised Semantic Segmentation.
CoRR, 2024
Prompt Categories Cluster for Weakly Supervised Semantic Segmentation.
CoRR, 2024
Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras.
CoRR, 2024
APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation.
CoRR, 2024
Top-K Pooling with Patch Contrastive Learning for Weakly-Supervised Semantic Segmentation.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024
Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Adversarial Erasing Transformer for Weakly Supervised Semantic Segmentation.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Robust generative adversarial network.
Mach. Learn., December, 2023
Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2023
Credible Dual-Expert Learning for Weakly Supervised Semantic Segmentation.
Int. J. Comput. Vis., August, 2023
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., July, 2023
Towards Simple and Accurate Human Pose Estimation With Stair Network.
IEEE Trans. Emerg. Top. Comput. Intell., June, 2023
Real-Time Prediction of Simulator Sickness in Virtual Reality Games.
IEEE Trans. Games, June, 2023
Weight-guided class complementing for long-tailed image recognition.
Pattern Recognit., June, 2023
Aggregated pyramid gating network for human pose estimation without pre-training.
Pattern Recognit., June, 2023
Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation.
IEEE Trans. Multim., 2023
Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning.
IEEE Trans. Multim., 2023
Weight-guided loss for long-tailed object detection and instance segmentation.
Signal Process. Image Commun., 2023
PointGS: Bridging and fusing geometric and semantic space for 3D point cloud analysis.
Inf. Fusion, 2023
Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
FastRecon: Few-shot Industrial Anomaly Detection via Fast Feature Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
ToF and Stereo Data Fusion Using Dynamic Search Range Stereo Matching.
IEEE Trans. Multim., 2022
Transformer-Based Language-Person Search With Multiple Region Slicing.
IEEE Trans. Circuits Syst. Video Technol., 2022
Neural texture transfer assisted video coding with adaptive up-sampling.
Signal Process. Image Commun., 2022
Unsupervised domain adaptation in homogeneous distance space for person re-identification.
Pattern Recognit., 2022
Soft pseudo-Label shrinkage for unsupervised domain adaptive person re-identification.
Pattern Recognit., 2022
End-to-end weakly supervised semantic segmentation with reliable region mining.
Pattern Recognit., 2022
Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
CARD: Semi-supervised Semantic Segmentation via Class-agnostic Relation based Denoising.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Detail Preserving Coarse-to-Fine Matching for Stereo Matching and Optical Flow.
IEEE Trans. Image Process., 2021
Fast pixel-matching for video object segmentation.
Signal Process. Image Commun., 2021
Exploiting textual queries for dynamically visual disambiguation.
Pattern Recognit., 2021
Progressive sample mining and representation learning for one-shot person re-identification.
Pattern Recognit., 2021
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Occluded and tiny face detection network for dense crowd.
J. Inf. Hiding Multim. Signal Process., 2021
Fast Pixel-Matching for Video Object Segmentation.
CoRR, 2021
A Lightweight Real-time Stereo Depth Estimation Network with Dynamic Upsampling Modules.
Proceedings of the 16th International Joint Conference on Computer Vision, 2021
Self-Guided and Cross-Guided Learning for Few-Shot Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Structure-Consistent Weakly Supervised Salient Object Detection with Local Saliency Coherence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020
Segmentation mask guided end-to-end person search.
Signal Process. Image Commun., 2020
Single image-based head pose estimation with spherical parametrization and 3D morphing.
Pattern Recognit., 2020
Adaptive ROI generation for video object segmentation using reinforcement learning.
Pattern Recognit., 2020
Generative adversarial classifier for handwriting characters super-resolution.
Pattern Recognit., 2020
Robust Generative Adversarial Network.
CoRR, 2020
Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation without Pre-training.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Feature Representation Matters: End-to-End Learning for Reference-Based Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020, 2020
Fast Template Matching and Update for Video Object Tracking and Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Reliability Does Matter: An End-to-End Weakly Supervised Semantic Segmentation Approach.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Video Streaming Adaptation Strategy for Multiview Navigation Over DASH.
IEEE Trans. Broadcast., 2019
Multiview video quality enhancement without depth information.
Signal Process. Image Commun., 2019
IAN: The Individual Aggregation Network for Person Search.
Pattern Recognit., 2019
Progressive Sample Mining and Representation Learning for One-Shot Person Re-identification with Adversarial Samples.
CoRR, 2019
A Single Image based Head Pose Estimation Method with Spherical Parameterization.
CoRR, 2019
Edge Orientation Driven Depth Super-Resolution for View Synthesis.
Proceedings of the Image and Graphics - 10th International Conference, 2019
2018
Cooperative Bargaining Game-Based Multiuser Bandwidth Allocation for Dynamic Adaptive Streaming Over HTTP.
IEEE Trans. Multim., 2018
Convolutional Neural Network for Intermediate View Enhancement in Multiview Streaming.
IEEE Trans. Multim., 2018
Region-Based Multiple Description Coding for Multiview Video Plus Depth Video.
IEEE Trans. Multim., 2018
Visual aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization.
Signal Process. Image Commun., 2018
Siamese network ensemble for visual tracking.
Neurocomputing, 2018
Image Ordinal Classification and Understanding: Grid Dropout with Masking Label.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018
2017
Texture Plus Depth Video Coding Using Camera Global Motion Information.
IEEE Trans. Multim., 2017
End-to-End Distortion-Based Multiuser Bandwidth Allocation for Real-Time Video Transmission Over LTE Network.
IEEE Trans. Broadcast., 2017
QoE-Driven Dynamic Adaptive Video Streaming Strategy With Future Information.
IEEE Trans. Broadcast., 2017
An effective CU size decision method for quality scalability in SHVC.
Multim. Tools Appl., 2017
Disparity Estimation Using Convolutional Neural Networks with Multi-scale Correlation.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Multi-resolution for disparity estimation with convolutional neural networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Depth Map Down-Sampling and Coding Based on Synthesized View Distortion.
IEEE Trans. Multim., 2016
Virtual-View-Assisted Video Super-Resolution and Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2016
Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling.
EURASIP J. Image Video Process., 2016
Packetization strategies for MVD-based 3D video transmission.
Proceedings of the 2016 Visual Communications and Image Processing, 2016
3D video super-resolution using fully convolutional neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
2015
Scalable Bit Allocation Between Texture and Depth Views for 3-D Video Streaming Over Heterogeneous Networks.
IEEE Trans. Circuits Syst. Video Technol., 2015
Multiple Description Coding for Stereoscopic Videos With Stagger Frame Order.
IEEE Trans. Circuits Syst. Video Technol., 2015
Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating.
KSII Trans. Internet Inf. Syst., 2015
Erratum to: One-class kernel subspace ensemble for medical image classification.
EURASIP J. Adv. Signal Process., 2015
A Paradigm for Dynamic Adaptive Streaming over HTTP for Multi-view Video.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Depth-Based Stereoscopic Projection Approach for 3D Saliency Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
A Flexible Programmable Camera Control and Data Acquisition Hardware Platform.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Global Motion Information Based Depth Map Sequence Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
3D video coding using motion information and depth map.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
Statistical approach for motion estimation skipping (SAMEK).
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Depth filter design by jointly utilizing spatial-temporal depth and texture information.
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015
2014
Macroblock Level Bits Allocation for Depth Maps in 3-D Video Coding.
J. Signal Process. Syst., 2014
Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information.
IEEE Trans. Broadcast., 2014
Optimizing the deadzone width to improve the polyphase-based multiple description coding.
Multim. Tools Appl., 2014
Correlation based universal image/video coding loss recovery.
J. Vis. Commun. Image Represent., 2014
One-class kernel subspace ensemble for medical image classification.
EURASIP J. Adv. Signal Process., 2014
Dynamic redundancy allocation for video streaming using Sub-GOP based FEC code.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014
2013
Real-Time Video Streaming Using Randomized Expanding Reed-Solomon Code.
IEEE Trans. Circuits Syst. Video Technol., 2013
A Real-Time Error Resilient Video Streaming Scheme Exploiting the Late- and Early-Arrival Packets.
IEEE Trans. Broadcast., 2013
Novel Wireless Capsule Endoscopy Diagnosis System with Adaptive Image Capturing Rate.
Proceedings of the VISAPP 2013, 2013
3-D video depth map quantization based on Lloyd's algorithm.
Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013
Multiple description video coding based on forward error correction within expanding windows.
Proceedings of the IEEE International Conference on Image Processing, 2013
2012
Dynamic Sub-GOP Forward Error Correction Code for Real-Time Video Applications.
IEEE Trans. Multim., 2012
Video Error Concealment of P-frame Using Packets of the Following Frames.
Proceedings of the Eighth International Conference on Signal Image Technology and Internet Based Systems, 2012
Real-time video streaming exploiting the late-arrival packets.
Proceedings of the 2012 Picture Coding Symposium, 2012
Real-Time Macroblock Level Bits Allocation for Depth Maps in 3-D Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012
2011
Joint redundant motion vector and intra macroblock refreshment for video transmission.
EURASIP J. Image Video Process., 2011
Error-resilient video coding with end-to-end rate-distortion optimized at macroblock level.
EURASIP J. Adv. Signal Process., 2011
Real-time forward error correction for video transmission.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011