Yao Zhao

Orcid: 0000-0002-8581-9554

Affiliations:
  • Beijing Jiaotong University, Institute of Information Science, China (PhD 1996)


According to our database1, Yao Zhao authored at least 717 papers between 1996 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
PSVMA+: Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

SiGNN: A spike-induced graph neural network for dynamic graph representation learning.
Pattern Recognit., 2025

2024
360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-View Geometric Consistency Perception.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Multi-teacher Universal Distillation Based on Information Hiding for Defense Against Facial Manipulation.
Int. J. Comput. Vis., November, 2024

Camera calibration for the surround-view system: a benchmark and dataset.
Vis. Comput., October, 2024

Diagnostic analytics for a GARCH model under skew-normal distributions.
Commun. Stat. Simul. Comput., October, 2024

HKA: A Hierarchical Knowledge Alignment Framework for Multimodal Knowledge Graph Completion.
ACM Trans. Multim. Comput. Commun. Appl., August, 2024

PVASS-MDD: Predictive Visual-Audio Alignment Self-Supervision for Multimodal Deepfake Detection.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Partially View-Aligned Representation Learning via Cross-View Graph Contrastive Network.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Unified Multi-Modality Video Object Segmentation Using Reinforcement Learning.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Enhanced Video Super-Resolution Network towards Compressed Data.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

A Survey on Reversible Data Hiding for Uncompressed Images.
ACM Comput. Surv., July, 2024

PADVG: A Simple Baseline of Active Protection for Audio-Driven Video Generation.
ACM Trans. Multim. Comput. Commun. Appl., June, 2024

Seeing All From a Few: Nodes Selection Using Graph Pooling for Graph Clustering.
IEEE Trans. Neural Networks Learn. Syst., May, 2024

Exploring Large-Scale Financial Knowledge Graph for SMEs Supply Chain Mining.
IEEE Trans. Knowl. Data Eng., May, 2024

Adversarial Graph Disentanglement With Component-Specific Aggregation.
IEEE Trans. Artif. Intell., May, 2024

SGT++: Improved Scene Graph-Guided Transformer for Surgical Report Generation.
IEEE Trans. Medical Imaging, April, 2024

Exploring Resolution Fields for Scalable Image Compression With Uncertainty Guidance.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

MCL: Multimodal Contrastive Learning for Deepfake Detection.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Graph meets probabilistic generation model: A new perspective for graph disentanglement.
Pattern Recognit., April, 2024

Interactive guidance network for object detection based on radar-camera fusion.
Multim. Tools Appl., March, 2024

Improving neural ordinary differential equations via knowledge distillation.
IET Comput. Vis., March, 2024

HGV4Risk: Hierarchical Global View-guided Sequence Representation Learning for Risk Prediction.
ACM Trans. Knowl. Discov. Data, January, 2024

Node-Oriented Spectral Filtering for Graph Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception.
IEEE Trans. Multim., 2024

ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries.
IEEE Trans. Multim., 2024

Narrowing Domain Gaps With Bridging Samples for Generalized Face Forgery Detection.
IEEE Trans. Multim., 2024

Each Performs Its Functions: Task Decomposition and Feature Assignment for Audio-Visual Segmentation.
IEEE Trans. Multim., 2024

Temporal Action Proposal Generation With Action Frequency Adaptive Network.
IEEE Trans. Multim., 2024

From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation.
IEEE Trans. Multim., 2024

Implicit-Explicit Motion Learning for Video Camouflaged Object Detection.
IEEE Trans. Multim., 2024

Reflection Intensity Guided Single Image Reflection Removal and Transmission Recovery.
IEEE Trans. Multim., 2024

Exploring the Applicability of Spectral Recovery in Semantic Segmentation of RGB Images.
IEEE Trans. Multim., 2024

DropQueries: A Simple Way to Discover Comprehensive Segment Representations.
IEEE Trans. Multim., 2024

Query-Guided Prototype Evolution Network for Few-Shot Segmentation.
IEEE Trans. Multim., 2024

Multimodal Composition Example Mining for Composed Query Image Retrieval.
IEEE Trans. Image Process., 2024

Learnable Feature Augmentation Framework for Temporal Action Localization.
IEEE Trans. Image Process., 2024

Toward Accurate Human Parsing Through Edge Guided Diffusion.
IEEE Trans. Image Process., 2024

Part-Object Progressive Refinement Network for Zero-Shot Learning.
IEEE Trans. Image Process., 2024

Cylin-Painting: Seamless 360° Panoramic Image Outpainting and Beyond.
IEEE Trans. Image Process., 2024

Asymptotic Consistent Graph Structure Learning for Multivariate Time-Series Anomaly Detection.
IEEE Trans. Instrum. Meas., 2024

Learning Hierarchical Color Guidance for Depth Map Super-Resolution.
IEEE Trans. Instrum. Meas., 2024

Perception-Oriented UAV Image Dehazing Based on Super-Pixel Scene Prior.
IEEE Trans. Geosci. Remote. Sens., 2024

RedCDR: Dual Relation Distillation for Cancer Drug Response Prediction.
IEEE ACM Trans. Comput. Biol. Bioinform., 2024

Graph Representation Learning Based on Specific Subgraphs for Biomedical Interaction Prediction.
IEEE ACM Trans. Comput. Biol. Bioinform., 2024

Training Superpixel Network Only Once.
IEEE Signal Process. Lett., 2024

Matrix Embedding Based Multiple Histograms Modification for Efficient Reversible Data Hiding.
IEEE Signal Process. Lett., 2024

Fast dominant feature selection with compensation for efficient image steganalysis.
Signal Process., 2024

IBVC: Interpolation-driven B-frame video compression.
Pattern Recognit., 2024

Semi-supervised cross-modal hashing with joint hyperboloid mapping.
Knowl. Based Syst., 2024

A keypoints-motion-based landmark transfer method for face reenactment.
J. Vis. Commun. Image Represent., 2024

Multimodal spatiotemporal aggregation for point cloud accumulation.
J. Vis. Commun. Image Represent., 2024

Str-L Pose: integrating point and structured line for relative pose estimation in dual graph.
J. Electronic Imaging, 2024

Joint Defocus Deblurring and Superresolution Learning Network for Autonomous Driving.
IEEE Intell. Transp. Syst. Mag., 2024

Unsupervised node representation learning of pure graph via symmetric cumulative sampling strategy.
Eng. Appl. Artif. Intell., 2024

Histogram shifting based reversible data hiding with multiple expansion bin pairs.
Displays, 2024

Digging into depth-adaptive structure for guided depth super-resolution.
Displays, 2024

ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks.
CoRR, 2024

SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution.
CoRR, 2024

Collapsed Language Models Promote Fairness.
CoRR, 2024

C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection.
CoRR, 2024

Instructing Prompt-to-Prompt Generation for Zero-Shot Learning.
CoRR, 2024

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption.
CoRR, 2024

ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance.
CoRR, 2024

Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models.
CoRR, 2024

SGFormer: Spherical Geometry Transformer for 360 Depth Estimation.
CoRR, 2024

Learning Trimaps via Clicks for Image Matting.
CoRR, 2024

BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution.
CoRR, 2024

Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Learning.
CoRR, 2024

Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection.
CoRR, 2024

Eliminating Warping Shakes for Unsupervised Online Video Stitching.
CoRR, 2024

One for all: A novel Dual-space Co-training baseline for Large-scale Multi-View Clustering.
CoRR, 2024

DreamLCM: Towards High Quality Text-to-3D Generation via Latent Consistency Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Digging into Contrastive Learning for Robust Depth Estimation with Diffusion Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Segment Anything with Precise Interaction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FlexCare: Leveraging Cross-Task Synergy for Flexible Multimodal Healthcare Prediction.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

WeatherDepth: Curriculum Contrastive Learning for Self-Supervised Depth Estimation under Adverse Weather Conditions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

TLVC: Temporal Bit-rate Allocation for Learned Video Compression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Noisy-Residual Continuous Diffusion Models for Real Image Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Multi-granular Semantic Mining for Composed Image Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Region-Native Visual Tokenization.
Proceedings of the Computer Vision - ECCV 2024, 2024

Eliminating Warping Shakes for Unsupervised Online Video Stitching.
Proceedings of the Computer Vision - ECCV 2024, 2024

Region-Adaptive Transform with Segmentation Prior for Image Compression.
Proceedings of the Computer Vision - ECCV 2024, 2024

Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Diffusion for Natural Image Matting.
Proceedings of the Computer Vision - ECCV 2024, 2024

Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Transferable and Principled Efficiency for Open-Vocabulary Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking the Up-Sampling Operations in CNN-Based Generative Network for Generalizable Deepfake Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PixelLM: Pixel Reasoning with Large Multimodal Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Endow SAM with Keen Eyes: Temporal-Spatial Prompt Learning for Video Camouflaged Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Lyapunov-Stable Deep Equilibrium Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

On the Unstable Convergence Regime of Gradient Descent.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
TCSD: Triple Complementary Streams Detector for Comprehensive Deepfake Detection.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

Latent Low-Rank Representation With Weighted Distance Penalty for Clustering.
IEEE Trans. Cybern., November, 2023

ℒ풪<sup>2</sup>net: Global-Local Semantics Coupled Network for scene-specific video foreground extraction with less supervision.
Pattern Anal. Appl., November, 2023

ESA: External Space Attention Aggregation for Image-Text Retrieval.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Magnifying multimodal forgery clues for Deepfake detection.
Signal Process. Image Commun., October, 2023

Camouflaged object detection based on context-aware and boundary refinement.
Appl. Intell., October, 2023

Sylvester Equation Induced Collaborative Representation Learning for Recommendation.
IEEE Trans. Knowl. Data Eng., September, 2023

MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Cooperative dual medical ontology representation learning for clinical assisted decision-making.
Comput. Biol. Medicine, September, 2023

As-Deformable-As-Possible Single-Image-Based View Synthesis Without Depth Prior.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Credible Dual-Expert Learning for Weakly Supervised Semantic Segmentation.
Int. J. Comput. Vis., August, 2023

Interactive Object Segmentation With Inside-Outside Guidance.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Learning a bi-directional discriminative representation for deep clustering.
Pattern Recognit., May, 2023

PSNet: Parallel Symmetric Network for Video Salient Object Detection.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2023

Temporal Consistency Learning of Inter-Frames for Video Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

Artifacts-Disentangled Adversarial Learning for Deepfake Detection.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

Monocular Pseudo-LiDAR Point Cloud Extrapolation Based on Iterative Hybrid Rendering.
IEEE Trans. Intell. Transp. Syst., March, 2023

Global-and-Local Collaborative Learning for Co-Salient Object Detection.
IEEE Trans. Cybern., March, 2023

Projection-preserving block-diagonal low-rank representation for subspace clustering.
Neurocomputing, March, 2023

A Novel Reversible Data Hiding Scheme Based on Pixel-Residual Histogram.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels.
IEEE Trans. Circuits Syst. Video Technol., February, 2023

CED: A case-level explainable paramedical diagnosis via AdaGBDT.
Comput. Biol. Medicine, February, 2023

Fine-Grained Image Classification by Class and Image-Specific Decomposition With Multiple Views.
IEEE Trans. Multim., 2023

Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection.
IEEE Trans. Multim., 2023

Detection and Localization of Video Transcoding From AVC to HEVC Based on Deep Representations of Decoded Frames and PU Maps.
IEEE Trans. Multim., 2023

Reversible Data Hiding for JPEG Images With Adaptive Multiple Two-Dimensional Histogram and Mapping Generation.
IEEE Trans. Multim., 2023

General Framework to Reversible Data Hiding for JPEG Images With Multiple Two-Dimensional Histograms.
IEEE Trans. Multim., 2023

Consistent Multiple Graph Embedding for Multi-View Clustering.
IEEE Trans. Multim., 2023

Graph Contrastive Partial Multi-View Clustering.
IEEE Trans. Multim., 2023

Dual-Gradients Localization Framework With Skip-Layer Connections for Weakly Supervised Object Localization.
IEEE Trans. Multim., 2023

Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation.
IEEE Trans. Multim., 2023

Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning.
IEEE Trans. Multim., 2023

Semi-Supervised Knowledge Distillation for Cross-Modal Hashing.
IEEE Trans. Multim., 2023

Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval.
IEEE Trans. Multim., 2023

Learning Detail-Structure Alternative Optimization for Blind Super-Resolution.
IEEE Trans. Multim., 2023

Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding.
IEEE Trans. Multim., 2023

Does Thermal Really Always Matter for RGB-T Salient Object Detection?
IEEE Trans. Multim., 2023

Deep Rotation Correction Without Angle Prior.
IEEE Trans. Image Process., 2023

JNMR: Joint Non-Linear Motion Regression for Video Frame Interpolation.
IEEE Trans. Image Process., 2023

Collaborative Content-Dependent Modeling: A Return to the Roots of Salient Object Detection.
IEEE Trans. Image Process., 2023

Self-Supervised Surface Defect Localization via Joint De-Anomaly Reconstruction and Saliency-Guided Segmentation.
IEEE Trans. Instrum. Meas., 2023

CoI<sup>2</sup>A: Collaborative Inter-domain and Intra-domain Alignments for Multisource Domain Adaptation.
IEEE Trans. Geosci. Remote. Sens., 2023

Incomplete Multiview Clustering via Cross-View Relation Transfer.
IEEE Trans. Circuits Syst. Video Technol., 2023

Defending Fake via Warning: Universal Proactive Defense Against Face Manipulation.
IEEE Signal Process. Lett., 2023

MSPNet: Multi-stage progressive network for image denoising.
Neurocomputing, 2023

4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency.
CoRR, 2023

Camera calibration for the surround-view system: a benchmark and dataset.
CoRR, 2023

Unleashing the potential of GNNs via Bi-directional Knowledge Transfer.
CoRR, 2023

WeatherDepth: Curriculum Contrastive Learning for Self-Supervised Depth Estimation under Adverse Weather Conditions.
CoRR, 2023

Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation.
CoRR, 2023

You Can Mask More For Extremely Low-Bitrate Image Compression.
CoRR, 2023

NPVForensics: Jointing Non-critical Phonemes and Visemes for Deepfake Detection.
CoRR, 2023

Learning Robust Deep Equilibrium Models.
CoRR, 2023

Deep Learning for Camera Calibration and Beyond: A Survey.
CoRR, 2023

Learning Thin-Plate Spline Motion and Seamless Composition for Parallax-Tolerant Unsupervised Deep Image Stitching.
CoRR, 2023

Dual Diffusion Architecture for Fisheye Image Rectification: Synthetic-to-Real Generalization.
CoRR, 2023

Complementary Bi-directional Feature Compression for Indoor 360° Semantic Segmentation with Self-distillation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Parking Slot Detection with Rotated Bounding Box.
Proceedings of the ACM Multimedia Asia 2023, 2023

CLE Diffusion: Controllable Light Enhancement Diffusion Model.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Kernel Dimension Matters: To Activate Available Kernels for Real-time Video Super-Resolution.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Frequency Perception Network for Camouflaged Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SDDNet: Style-guided Dual-layer Disentanglement Network for Shadow Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

S-OmniMVS: Incorporating Sphere Geometry into Omnidirectional Stereo Matching.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Joint Relational Co-evolution in Spatial-Temporal Knowledge Graph for SMEs Supply Chain Prediction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Unsupervised OmniMVS: Efficient Omnidirectional Depth Inference via Establishing Pseudo-Stereo Supervision.
IROS, 2023

CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Innovating Real Fisheye Image Correction with Dual Diffusion Architecture.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Parallax-Tolerant Unsupervised Deep Image Stitching.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Locating Noise is Halfway Denoising for Semi-Supervised Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Fourier-Based Instance Selective Whitening for Domain Generalized Lane Detection.
Proceedings of the Computer Science and Education. Computer Science and Technology, 2023

Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

SIGVIC: Spatial Importance Guided Variable-Rate Image Compression.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Segment Every Referring Object Point by Point.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Spatiotemporal Deformation Perception for Fisheye Video Rectification.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Detection of AI-Manipulated Fake Faces via Mining Generalized Features.
ACM Trans. Multim. Comput. Commun. Appl., 2022

SRDRL: A Blind Super-Resolution Framework With Degradation Reconstruction Loss.
IEEE Trans. Multim., 2022

Multi-Modal Graph Learning for Disease Prediction.
IEEE Trans. Medical Imaging, 2022

Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision.
IEEE Trans. Intell. Transp. Syst., 2022

Margin Preserving Self-Paced Contrastive Learning Towards Domain Adaptation for Medical Image Segmentation.
IEEE J. Biomed. Health Informatics, 2022

Composed Image Retrieval via Explicit Erasure and Replenishment With Semantic Alignment.
IEEE Trans. Image Process., 2022

Exemplar-Based, Semantic Guided Zero-Shot Visual Recognition.
IEEE Trans. Image Process., 2022

Cross-Part Learning for Fine-Grained Image Classification.
IEEE Trans. Image Process., 2022

CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection.
IEEE Trans. Image Process., 2022

BCS-Net: Boundary, Context, and Semantic for Automatic COVID-19 Lung Infection Segmentation From CT Images.
IEEE Trans. Instrum. Meas., 2022

RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Revisiting Radial Distortion Rectification in Polar-Coordinates: A New and Efficient Learning Perspective.
IEEE Trans. Circuits Syst. Video Technol., 2022

General Expansion-Shifting Model for Reversible Data Hiding: Theoretical Investigation and Practical Algorithm Design.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multi-Source Aggregation Transformer for Concealed Object Detection in Millimeter-Wave Images.
IEEE Trans. Circuits Syst. Video Technol., 2022

Neural Contourlet Network for Monocular 360° Depth Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Detection of Double JPEG Compression With the Same Quantization Matrix via Convergence Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2022

Depth-Aware Multi-Grid Deep Homography Estimation With Contextual Correlation.
IEEE Trans. Circuits Syst. Video Technol., 2022

MFHI: Taking Modality-Free Human Identification as Zero-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Transformer-Based Language-Person Search With Multiple Region Slicing.
IEEE Trans. Circuits Syst. Video Technol., 2022

Quantization Step Estimation for JPEG Image Forensics.
IEEE Trans. Circuits Syst. Video Technol., 2022

Just Noticeable Difference for Deep Machine Vision.
IEEE Trans. Circuits Syst. Video Technol., 2022

Reversible Data Hiding for Color Images Based on Adaptive Three-Dimensional Histogram Modification.
IEEE Trans. Circuits Syst. Video Technol., 2022

SRInpaintor: When Super-Resolution Meets Transformer for Image Inpainting.
IEEE Trans. Computational Imaging, 2022

Boundary Guided Semantic Learning for Real-Time COVID-19 Lung Infection Segmentation System.
IEEE Trans. Consumer Electron., 2022

General Distortion Based Reversible Data Hiding for Binary Covers.
IEEE Signal Process. Lett., 2022

ET: Edge-Enhanced Transformer for Image Splicing Detection.
IEEE Signal Process. Lett., 2022

Fast Expansion-Bins-Determination for Multiple Histograms Modification Based Reversible Data Hiding.
IEEE Signal Process. Lett., 2022

Unsupervised domain adaptation in homogeneous distance space for person re-identification.
Pattern Recognit., 2022

Soft pseudo-Label shrinkage for unsupervised domain adaptive person re-identification.
Pattern Recognit., 2022

End-to-end weakly supervised semantic segmentation with reliable region mining.
Pattern Recognit., 2022

Multi-level augmented inpainting network using spatial similarity.
Pattern Recognit., 2022

Gradient-based refined class activation map for weakly supervised object localization.
Pattern Recognit., 2022

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

LMDC: Learning a multiple description codec for deep learning-based image compression.
Multim. Tools Appl., 2022

Adaptive feature fusion network based on boosted attention mechanism for single image dehazing.
Multim. Tools Appl., 2022

Future pseudo-LiDAR frame prediction for autonomous driving.
Multim. Syst., 2022

Auto-weighted low-rank representation for clustering.
Knowl. Based Syst., 2022

Bi-projection for 360°image object detection bridged by RoI Searcher.
J. Vis. Commun. Image Represent., 2022

Contour enhanced image super-resolution.
J. Vis. Commun. Image Represent., 2022

Learning edge-preserved image stitching from multi-scale deep homography.
Neurocomputing, 2022

MFAN: Multi-Level Features Attention Network for Fake Certificate Image Detection.
Entropy, 2022

Bridging Component Learning with Degradation Modelling for Blind Image Super-Resolution.
CoRR, 2022

Cross-view Graph Contrastive Representation Learning on Partially Aligned Multi-view Data.
CoRR, 2022

HVS-Inspired Signal Degradation Network for Just Noticeable Difference Estimation.
CoRR, 2022

FishFormer: Annulus Slicing-based Transformer for Fisheye Rectification with Efficacy Domain Exploration.
CoRR, 2022

FisheyeEX: Polar Outpainting for Extending the FoV of Fisheye Lens.
CoRR, 2022

Cylin-Painting: Seamless 360° Panoramic Image Outpainting and Beyond with Cylinder-Style Convolutions.
CoRR, 2022

PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation.
CoRR, 2022

Improving Neural ODEs via Knowledge Distillation.
CoRR, 2022

ACTIVE: Augmentation-Free Graph Contrastive Learning for Partial Multi-View Clustering.
CoRR, 2022

SivsFormer: Parallax-Aware Transformers for Single-image-based View Synthesis.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2022

Weakly Supervised Object Localization with Noisy-Label Learning.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Information Adversarial Disentanglement for Face Swapping.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Mask Matching Transformer for Few-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A new robust watermarking algorithm based on intra-frame difference.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

One-step Low-Rank Representation for Clustering.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Attention-Enhanced Disentangled Representation Learning for Unsupervised Domain Adaptation in Cardiac Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

SGT: Scene Graph-Guided Transformer for Surgical Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Exploring Complementarity of Global and Local Spatiotemporal Information for Fake Face Video Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

PanoFormer: Panorama Transformer for Indoor 360$^{\circ }$ Depth Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Slim Scissors: Segmenting Thin Object from Synthetic Background.
Proceedings of the Computer Vision - ECCV 2022, 2022

Deep Rectangling for Image Stitching: A Learning Baseline.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Blind Image Clustering for Camera Source Identification via Row-Sparsity Optimization.
IEEE Trans. Multim., 2021

Graph Learning Based Head Movement Prediction for Interactive 360 Video Streaming.
IEEE Trans. Image Process., 2021

Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images.
IEEE Trans. Image Process., 2021

Unsupervised Deep Image Stitching: Reconstructing Stitched Features to Images.
IEEE Trans. Image Process., 2021

Spatial-Aware Texture Transformer for High-Fidelity Garment Transfer.
IEEE Trans. Image Process., 2021

A Deep Ordinal Distortion Estimation Approach for Distortion Rectification.
IEEE Trans. Image Process., 2021

Image Splicing Detection, Localization and Attribution via JPEG Primary Quantization Matrix Estimation and Clustering.
IEEE Trans. Inf. Forensics Secur., 2021

To See in the Dark: N2DGAN for Background Modeling in Nighttime Scene.
IEEE Trans. Circuits Syst. Video Technol., 2021

Efficient Reversible Data Hiding for JPEG Images With Multiple Histograms Modification.
IEEE Trans. Circuits Syst. Video Technol., 2021

Adaptive Pairwise Prediction-Error Expansion and Multiple Histograms Modification for Reversible Data Hiding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Reversible Data Hiding for JPEG Images Based on Multiple Two-Dimensional Histograms.
IEEE Signal Process. Lett., 2021

Fast pixel-matching for video object segmentation.
Signal Process. Image Commun., 2021

A reference-free underwater image quality assessment metric in frequency domain.
Signal Process. Image Commun., 2021

PVO-based reversible data hiding using adaptive multiple histogram generation and modification.
Signal Process. Image Commun., 2021

Multi-semantic region weighting and multi-scale flatness weighting based image retrieval.
Soft Comput., 2021

Anti-Forensics of Image Contrast Enhancement Based on Generative Adversarial Network.
Secur. Commun. Networks, 2021

DQN-based gradual fisheye image rectification.
Pattern Recognit. Lett., 2021

Progressive sample mining and representation learning for one-shot person re-identification.
Pattern Recognit., 2021

A hierarchical weighted low-rank representation for image clustering and classification.
Pattern Recognit., 2021

Occluded and tiny face detection network for dense crowd.
J. Inf. Hiding Multim. Signal Process., 2021

Dual Attention Based Image Pyramid Network for Object Detection.
KSII Trans. Internet Inf. Syst., 2021

Joint distortion rectification and super-resolution for self-driving scene perception.
Neurocomputing, 2021

Pseudo-LiDAR point cloud magnification.
Neurocomputing, 2021

A new blind image denoising method based on asymmetric generative adversarial network.
IET Image Process., 2021

Multi-scale attention network for image inpainting.
Comput. Vis. Image Underst., 2021

FPPN: Future Pseudo-LiDAR Frame Prediction for Autonomous Driving.
CoRR, 2021

Incomplete Multi-view Clustering via Cross-view Relation Transfer.
CoRR, 2021

RRNet: Relational Reasoning Network with Parallel Multi-scale Attention for Salient Object Detection in Optical Remote Sensing Images.
CoRR, 2021

Dynamic Feature Regularized Loss for Weakly Supervised Semantic Segmentation.
CoRR, 2021

Multi-modal Graph Learning for Disease Prediction.
CoRR, 2021

Adversarial Graph Disentanglement.
CoRR, 2021

Does deep machine vision have just noticeable difference (JND)?
CoRR, 2021

Deep learning based torsional nystagmus detection for dizziness and vertigo diagnosis.
Biomed. Signal Process. Control., 2021

Image Outpainting with Depth Assistance.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Attention Guided Spatio-Temporal Artifacts Extraction for Deepfake Detection.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Slow Video Detection Based on Spatial-Temporal Feature Representation.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

GGRNet: Global Graph Reasoning Network for Salient Object Detection in Optical Remote Sensing Images.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

A Novel Method of Cropped Images Forensics in Social Networks.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

An End-to-End Mutual Enhancement Network Toward Image Compression and Semantic Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

ODE-Inspired Image Denoiser: An End-to-End Dynamical Denoising Network.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Towards Transferable 3D Adversarial Attack.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Heterogeneous Feature Fusion and Cross-modal Alignment for Composed Image Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Enhancing Adversarial Examples Transferability via Ensemble Feature Manifolds.
Proceedings of the ADVM '21: Proceedings of the 1st International Workshop on Adversarial Learning for Multimedia, 2021

BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Ddan: A Deep Dual Attention Network For Video Super-Resolution.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Distortion-Tolerant Monocular Depth Estimation on Omnidirectional Images Using Dual-Cubemap.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Adversarial Attack on Fake-Faces Detectors Under White and Black Box Scenarios.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Level Curriculum for Training A Distortion-Aware Barrel Distortion Rectification Model.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Color-indoor: Incorporating Depth into Room Decoration Visualization.
Proceedings of the ICAIIS 2021: 2021 2nd International Conference on Artificial Intelligence and Information Systems, Chongqing, China, May 28, 2021

Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Double Low-Rank Representation With Projection Distance Penalty for Clustering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Coarse-to-Fine Learning Framework for Semi-supervised Multimodal MRI Synthesis.
Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

GradingNet: Towards Providing Reliable Supervisions for Weakly Supervised Object Detection by Grading the Box Candidates.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Parameter Distribution Balanced CNNs.
IEEE Trans. Neural Networks Learn. Syst., 2020

Hierarchical Prototype Learning for Zero-Shot Recognition.
IEEE Trans. Multim., 2020

Referring Image Segmentation by Generative Adversarial Learning.
IEEE Trans. Multim., 2020

Rich Features Embedding for Cross-Modal Retrieval: A Simple Baseline.
IEEE Trans. Multim., 2020

Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study.
IEEE Trans. Image Process., 2020

Model-Free Distortion Rectification Framework Bridged by Distortion Distribution Map.
IEEE Trans. Image Process., 2020

Learning a Deep Dual Attention Network for Video Super-Resolution.
IEEE Trans. Image Process., 2020

Canonical Correlation Analysis With L<sub>2, 1</sub>-Norm for Multiview Data Representation.
IEEE Trans. Cybern., 2020

Correlation Filter Selection for Visual Tracking Using Reinforcement Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

High Capacity Reversible Data Hiding Based on Multiple Histograms Modification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Distortion Rectification From Static to Dynamic: A Distortion Sequence Construction Perspective.
IEEE Trans. Circuits Syst. Video Technol., 2020

DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time.
IEEE Trans. Circuits Syst. Video Technol., 2020

FilterNet: Adaptive Information Filtering Network for Accurate and Fast Image Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., 2020

Pixel-Level View Synthesis Distortion Estimation for 3D Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2020

Primary Quantization Matrix Estimation of Double Compressed JPEG Images via CNN.
IEEE Signal Process. Lett., 2020

Segmentation mask guided end-to-end person search.
Signal Process. Image Commun., 2020

Detection of fake high definition for HEVC videos based on prediction mode feature.
Signal Process., 2020

Improved PPVO-based high-fidelity reversible data hiding.
Signal Process., 2020

PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation.
Sensors, 2020

Preface.
Pattern Recognit. Lett., 2020

Double compression detection for H.264 videos with adaptive GOP structure.
Multim. Tools Appl., 2020

Effective attention-based network for syndrome differentiation of AIDS.
BMC Medical Informatics Decis. Mak., 2020

Unsupervised fisheye image correction through bidirectional loss with geometric prior.
J. Vis. Commun. Image Represent., 2020

A view-free image stitching network based on global homography.
J. Vis. Commun. Image Represent., 2020

Efficient Video Integrity Analysis Through Container Characterization.
IEEE J. Sel. Top. Signal Process., 2020

OIDC-Net: Omnidirectional Image Distortion Correction via Coarse-to-Fine Region Attention.
IEEE J. Sel. Top. Signal Process., 2020

A Survey of Deep Learning-Based Source Image Forensics.
J. Imaging, 2020

Convolutional prototype learning for zero-shot recognition.
Image Vis. Comput., 2020

Adaptive Importance Channel Selection for Perceptual Image Compression.
KSII Trans. Internet Inf. Syst., 2020

A new Ensemble Clustering Algorithm using a Reconstructed Mapping Coefficient.
KSII Trans. Internet Inf. Syst., 2020

ProLFA: Representative prototype selection for local feature aggregation.
Neurocomputing, 2020

Face inpainting network for large missing regions based on weighted facial similarity.
Neurocomputing, 2020

A parallel down-up fusion network for salient object detection in optical remote sensing images.
Neurocomputing, 2020

Joint learning based deep supervised hashing for large-scale image retrieval.
Neurocomputing, 2020

IET Image Processing.
IET Image Process., 2020

Learning Edge-Preserved Image Stitching from Large-Baseline Deep Homography.
CoRR, 2020

Towards Natural Robustness Against Adversarial Examples.
CoRR, 2020

Learning Deep Interleaved Networks with Asymmetric Co-Attention for Image Restoration.
CoRR, 2020

Mining Generalized Features for Detecting AI-Manipulated Fake Faces.
CoRR, 2020

LID 2020: The Learning from Imperfect Data Challenge Results.
CoRR, 2020

Taking Modality-free Human Identification as Zero-shot Learning.
CoRR, 2020

From Anchor Generation to Distribution Alignment: Learning a Discriminative Embedding Space for Zero-Shot Recognition.
CoRR, 2020

Deep Optimized Multiple Description Image Coding via Scalar Quantization Learning.
CoRR, 2020

Concurrently Extrapolating and Interpolating Networks for Continuous Model Generation.
CoRR, 2020

CCAE: Cross-field categorical attributes embedding for cancer clinical endpoint prediction.
Artif. Intell. Medicine, 2020

Panoramic Image Quality-Enhancement by Fusing Neural Textures of the Adaptive Initial Viewport.
Proceedings of the 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2020

CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dual-Gradients Localization Framework for Weakly Supervised Object Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Defocused Image Splicing Localization by Distinguishing Multiple Cues between Raw Naturally Blur and Artificial Blur.
Proceedings of the Digital Forensics and Watermarking - 19th International Workshop, 2020

GDS: Global Description Guided Down-Sampling for 3D Point Cloud Classification.
Proceedings of the ICVISP 2020: 4th International Conference on Vision, 2020

Primary Quality Factor Estimation Of Resized Double Compressed JPEG Images.
Proceedings of the IEEE International Conference on Image Processing, 2020

Distribution-Induced Bidirectional Generative Adversarial Network for Graph Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Interactive Object Segmentation With Inside-Outside Guidance.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fast Template Matching and Update for Video Object Tracking and Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Modality-Invariant Image-Text Embedding for Image-Sentence Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Seeing All From a Few: ℓ<sub>1</sub>-Norm-Induced Discriminative Prototype Selection.
IEEE Trans. Neural Networks Learn. Syst., 2019

Saliency Inside: Learning Attentive CNNs for Content-Based Image Retrieval.
IEEE Trans. Image Process., 2019

Rearranging Online Tubes for Streaming Video Synopsis: A Dynamic Graph Coloring Approach.
IEEE Trans. Image Process., 2019

Magic-Wall: Visualizing Room Decoration by Enhanced Wall Segmentation.
IEEE Trans. Image Process., 2019

Secure Detection of Image Manipulation by Means of Random Feature Selection.
IEEE Trans. Inf. Forensics Secur., 2019

Concealed Object Detection for Activate Millimeter Wave Image.
IEEE Trans. Ind. Electron., 2019

Multiple Description Convolutional Neural Networks for Image Compression.
IEEE Trans. Circuits Syst. Video Technol., 2019

Adaptive Streaming in Interactive Multiview Video Systems.
IEEE Trans. Circuits Syst. Video Technol., 2019

Improving Pairwise PEE via Hybrid-Dimensional Histogram Generation and Adaptive Mapping Selection.
IEEE Trans. Circuits Syst. Video Technol., 2019

Robust Plane Detection Using Depth Information From a Consumer Depth Camera.
IEEE Trans. Circuits Syst. Video Technol., 2019

A Depth-Bin-Based Graphical Model for Fast View Synthesis Distortion Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2019

Video Streaming Adaptation Strategy for Multiview Navigation Over DASH.
IEEE Trans. Broadcast., 2019

Edge Heuristic GAN for Non-Uniform Blind Deblurring.
IEEE Signal Process. Lett., 2019

An enhanced approach for detecting double JPEG compression with the same quantization matrix.
Signal Process. Image Commun., 2019

Local activity-driven structural-preserving filtering for noise removal and image smoothing.
Signal Process., 2019

Reversible data hiding based on pairwise embedding and optimal expansion path.
Signal Process., 2019

Source camera identification based on content-adaptive fusion residual networks.
Pattern Recognit. Lett., 2019

Simultaneous color-depth super-resolution with conditional generative adversarial networks.
Pattern Recognit., 2019

From Night to Day: GANs Based Low Quality Image Enhancement.
Neural Process. Lett., 2019

Iterative range-domain weighted filter for structural preserving image smoothing and de-noising.
Multim. Tools Appl., 2019

Towards learning a semantic-consistent subspace for cross-modal retrieval.
Multim. Tools Appl., 2019

Robust median filtering detection based on the difference of frequency residuals.
Multim. Tools Appl., 2019

EA-LSTM: Evolutionary attention-based LSTM for time series prediction.
Knowl. Based Syst., 2019

No-reference quality assessment for contrast-distorted images based on multifaceted statistical representation of structure.
J. Vis. Commun. Image Represent., 2019

Learning a virtual codec based on deep convolutional neural network to compress image.
J. Vis. Commun. Image Represent., 2019

An approach to detect video frame deletion under anti-forensics.
J. Real Time Image Process., 2019

Improved reversible data hiding based on PVO and adaptive pairwise embedding.
J. Real Time Image Process., 2019

Stage-GAN with Semantic Maps for Large-scale Image Super-resolution.
KSII Trans. Internet Inf. Syst., 2019

Improving image similarity estimation via global distance distribution information.
Neurocomputing, 2019

Diffusion induced graph representation learning.
Neurocomputing, 2019

Detail-preserving image super-resolution via recursively dilated residual network.
Neurocomputing, 2019

Adversarial task-specific learning.
Neurocomputing, 2019

A Blind Print-Recapture Robust Watermark Scheme by Calculating Self-Convolution.
Int. J. Digit. Crime Forensics, 2019

Progressive Sample Mining and Representation Learning for One-Shot Person Re-identification with Adversarial Samples.
CoRR, 2019

ProLFA: Representative Prototype Selection for Local Feature Aggregation.
CoRR, 2019

ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries.
CoRR, 2019

Dual-Domain Fusion Convolutional Neural Network for Contrast Enhancement Forensics.
CoRR, 2019

PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation.
CoRR, 2019

A New JPEG Image Watermarking Method Exploiting Spatial JND Model.
Proceedings of the Digital Forensics and Watermarking - 18th International Workshop, 2019

An Attention Bi-box Regression Network for Traffic Light Detection.
Proceedings of the Intelligence Science and Big Data Engineering. Visual Data Engineering, 2019

Face Verification Between ID Document Photos and Partial Occluded Spot Photos.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Block Partitioning Decision Based on Content Complexity for Future Video Coding.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Semantic Map Based Image Compression via Conditional Generative Adversarial Network.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Detection and Localization of Video Object Removal by Spatio-Temporal LBP Coherence Analysis.
Proceedings of the Image and Graphics - 10th International Conference, 2019

A Visual Perspective for User Identification Based on Camera Fingerprint.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Deep Multiple Description Coding by Learning Scalar Quantization.
Proceedings of the Data Compression Conference, 2019

Rate Control Algorithm in HEVC Based on Scene-Change Detection.
Proceedings of the Data Compression Conference, 2019

Improving Cube-to-ERP Conversion Performance with Geometry Features of 360 Video Structure.
Proceedings of the Data Compression Conference, 2019

A Lightweight and Robust Face Recognition Network on Noisy Condition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Devil in the Details: Towards Accurate Single and Multiple Human Parsing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Heterogeneous Spatial-Temporal Representation for Bike-Sharing Demand Prediction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Region-Based Multiple Description Coding for Multiview Video Plus Depth Video.
IEEE Trans. Multim., 2018

Multi-View Missing Data Completion.
IEEE Trans. Knowl. Data Eng., 2018

3-D Surround View for Advanced Driver Assistance Systems.
IEEE Trans. Intell. Transp. Syst., 2018

Learning a General Assignment Model for Video Analytics.
IEEE Trans. Circuits Syst. Video Technol., 2018

A packetization strategy for interactive multiview video streaming over lossy networks.
Signal Process., 2018

A New Dataset for Source Identification of High Dynamic Range Images.
Sensors, 2018

Geometric Features-Based Parking Slot Detection.
Sensors, 2018

Nonoverlapping Blocks Based Copy-Move Forgery Detection.
Secur. Commun. Networks, 2018

Enhancing heterogeneous similarity estimation via neighborhood reversibility.
Multim. Tools Appl., 2018

Indexing of the CNN features for the large scale image search.
Multim. Tools Appl., 2018

Sparsity induced prototype learning via ℓp, 1-norm grouping.
J. Vis. Commun. Image Represent., 2018

Median filtering detection of small-size image based on CNN.
J. Vis. Commun. Image Represent., 2018

Enhance Neighbor Reversibility in Subspace Learning for Image Retrieval.
IEEE J. Sel. Top. Signal Process., 2018

A new clustering algorithm based on the connected region generation.
KSII Trans. Internet Inf. Syst., 2018

Subspace learning by kernel dependence maximization for cross-modal retrieval.
Neurocomputing, 2018

Copy-Move Forgery Localization Using Convolutional Neural Networks and CFA Features.
Int. J. Digit. Crime Forensics, 2018

Security Consideration for Deep Learning-Based Image Forensics.
IEICE Trans. Inf. Syst., 2018

Standard-Compliant Multiple Description Image Coding Based on Convolutional Neural Networks.
IEICE Trans. Inf. Syst., 2018

Multiview Cross-Media Hashing with Semantic Consistency.
IEEE Multim., 2018

Cascaded reconstruction network for compressive image sensing.
EURASIP J. Image Video Process., 2018

EA-LSTM: Evolutionary Attention-based LSTM for Time Series Prediction.
CoRR, 2018

Devil in the Details: Towards Accurate Single and Multiple Human Parsing.
CoRR, 2018

Virtual Codec Supervised Re-Sampling Network for Image Compression.
CoRR, 2018

Robust Contrast Enhancement Forensics Using Convolutional Neural Networks.
CoRR, 2018

Mixed-Resolution Image Representation and Compression with Convolutional Neural Networks.
CoRR, 2018

Dual-Streams Edge Driven Encoder-Decoder Network for Image Super-Resolution.
IEEE Access, 2018

An Automatic Clustering Algorithm Based on Region Segmentation.
IEEE Access, 2018

Convolutional Neural Network Based Inter-Frame Enhancement for 360-Degree Video Streaming.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Cycle GAN-Based Attack on Recaptured Images to Fool both Human and Machine.
Proceedings of the Digital Forensics and Watermarking - 17th International Workshop, 2018

Self-Supervised Deep Low-Rank Assignment Model for Prototype Selection.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Non-Local Graph-Based Prediction for Reversible Data Hiding in Images.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Image Reconstruction from Patch Compressive Sensing Measurements.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Optical Flow-Guided Multi-Scale Dense Network for Frame Interpolation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Double JPEG Compression Detection by Exploring the Correlations in DCT Domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Cross-Modal Retrieval With CNN Visual Features: A New Baseline.
IEEE Trans. Cybern., 2017

Two-stage filtering of compressed depth images with Markov Random Field.
Signal Process. Image Commun., 2017

Robust median filtering detection based on local difference descriptor.
Signal Process. Image Commun., 2017

Detection of operation chain: JPEG-Resampling-JPEG.
Signal Process. Image Commun., 2017

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Cross-media analysis and reasoning: advances and directions.
Frontiers Inf. Technol. Electron. Eng., 2017

3D saliency detection based on background detection.
J. Vis. Commun. Image Represent., 2017

A 3D Mesh Watermarking Based on Improved Vertex Grouping and Piecewise Mapping Function.
J. Inf. Hiding Multim. Signal Process., 2017

Quality Enhancement of Compressed Video via CNNs.
J. Inf. Hiding Multim. Signal Process., 2017

Depth map upsampling using joint edge-guided convolutional neural network for virtual view synthesizing.
J. Electronic Imaging, 2017

3D View Synthesis with Feature-Based Warping.
KSII Trans. Internet Inf. Syst., 2017

Depth map up-sampling with fractal dimension and texture-depth boundary consistencies.
Neurocomputing, 2017

Fast Intra Coding Algorithm for HEVC Based on Decision Tree.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Simultaneously Color-Depth Super-Resolution with Conditional Generative Adversarial Network.
CoRR, 2017

Local Activity-tuned Image Filtering for Noise Removal and Image Smoothing.
CoRR, 2017

Source Camera Identification Based On Content-Adaptive Fusion Network.
CoRR, 2017

A New Evaluation Protocol and Benchmarking Results for Extendable Cross-media Retrieval.
CoRR, 2017

A gradient-based pixel-domain attack against SVM detection of global image manipulations.
Proceedings of the 2017 IEEE Workshop on Information Forensics and Security, 2017

Two-stream Attentive CNNs for Image Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Magic-wall: Visualizing Room Decoration.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Finding the Secret of CNN Parameter Layout under Strict Size Constraint.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Recaptured Image Forensics Based on Quality Aware and Histogram Feature.
Proceedings of the Digital Forensics and Watermarking - 16th International Workshop, 2017

Robust Zero Watermarking for 3D Triangular Mesh Models Based on Spherical Integral Invariants.
Proceedings of the Digital Forensics and Watermarking - 16th International Workshop, 2017

Adaptive Multiple Description Depth Image Coding Based on Wavelet Sub-band Coefficients.
Proceedings of the Advances in Intelligent Information Hiding and Multimedia Signal Processing, 2017

Single depth image super-resolution with multiple residual dictionary learning and refinement.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Enhanced isomorphic semantic representation for cross-media retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

De-biased dart ensemble model for personalized recommendation.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Convolutional neural network-based depth image artifact removal.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A Vehicle-Mounted Multi-camera 3D Panoramic Imaging Algorithm Based on Ship-Shaped Model.
Proceedings of the Image and Graphics - 9th International Conference, 2017

JPEG Photo Privacy-Preserving Algorithm Based on Sparse Representation and Data Hiding.
Proceedings of the Image and Graphics - 9th International Conference, 2017

Warping and Blending Enhancement for 3D View Synthesis Based on Grid Deformation.
Proceedings of the Image and Graphics - 9th International Conference, 2017

Optimized receiver control in interactive multiview video streaming systems.
Proceedings of the IEEE International Conference on Communications, 2017

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Uncovering the Effect of Visual Saliency on Image Retrieval.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

Unsupervised Multi-view Subspace Learning via Maximizing Dependence.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

Stabilizing Object Tracking Trajectory on the Basis of Kalman Filter.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Block-Level Entropy-Based Adaptive Sampling Framework for Depth Map.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

A new detector for JPEG decompressed bitmap identification.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Detection of various image operations based on CNN.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Visual attention guided eye movements for 360 degree images.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Depth Map Down-Sampling and Coding Based on Synthesized View Distortion.
IEEE Trans. Multim., 2016

Region-Aware 3-D Warping for DIBR.
IEEE Trans. Multim., 2016

Modality-Dependent Cross-Media Retrieval.
ACM Trans. Intell. Syst. Technol., 2016

Virtual-View-Assisted Video Super-Resolution and Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2016

Learning to segment with image-level annotations.
Pattern Recognit., 2016

A dynamic niching clustering algorithm based on individual-connectedness and its application to color image segmentation.
Pattern Recognit., 2016

HCP: A Flexible CNN Framework for Multi-Label Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Text Detection Based On Discriminative Dictionary Learning.
J. Inf. Hiding Multim. Signal Process., 2016

Fast Algorithm for Intra Prediction of HEVC Using Adaptive Decision Trees.
KSII Trans. Internet Inf. Syst., 2016

Resolution-independent Up-sampling for Depth Map Using Fractal Transforms.
KSII Trans. Internet Inf. Syst., 2016

LSSLP - Local structure sensitive label propagation.
Inf. Sci., 2016

Light-weight binary code embedding of local feature distribution in image search.
Neurocomputing, 2016

Edge-Based Adaptive Sampling for Image Block Compressive Sensing.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling.
EURASIP J. Image Video Process., 2016

Camera Fingerprint: A New Perspective for Identifying User's Identity.
CoRR, 2016

A dimensionality reduction method based on structured sparse representation for face recognition.
Artif. Intell. Rev., 2016

Joint iterative guidance filtering for compressed depth images.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Packetization strategies for MVD-based 3D video transmission.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Feature-based depth refinement for view synthesis.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Depth map up-sampling with texture edge feature via sparse representation.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Recapture Image Forensics Based on Laplacian Convolutional Neural Networks.
Proceedings of the Digital Forensics and Watermarking - 15th International Workshop, 2016

Reversible 3D Image Data Hiding with Quality Enhancement.
Proceedings of the Digital Forensics and Watermarking - 15th International Workshop, 2016

3D video super-resolution using fully convolutional neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

A comparative evaluation: Different methods for simplifying the deep compositional features.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Improving the similarity estimation via score distribution.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Double JPEG detection using high order statistic features.
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016

Just Noticeable Difference Based Fast Coding Unit Partition in 3D-HEVC Intra Coding.
Proceedings of the 2016 Data Compression Conference, 2016

Saliency detection using secondary quantization in DCT domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Kernel Reconstruction ICA for Sparse Representation.
IEEE Trans. Neural Networks Learn. Syst., 2015

Sparsity Learning Formulations for Mining Time-Varying Data.
IEEE Trans. Knowl. Data Eng., 2015

Scalable Bit Allocation Between Texture and Depth Views for 3-D Video Streaming Over Heterogeneous Networks.
IEEE Trans. Circuits Syst. Video Technol., 2015

Multiple Description Coding for Stereoscopic Videos With Stagger Frame Order.
IEEE Trans. Circuits Syst. Video Technol., 2015

Efficient color image reversible data hiding based on channel-dependent payload partition and adaptive embedding.
Signal Process., 2015

Accumulated reconstruction error vector (AREV): a semantic representation for cross-media retrieval.
Multim. Tools Appl., 2015

Redundancy filtering and fusion verification for video copy detection.
Multim. Syst., 2015

Image Set Compression Based on Undirected Weighted Graph.
J. Inf. Hiding Multim. Signal Process., 2015

Recaptured Images Forensics Based On Color Moments and DCT Coefficients Features.
J. Inf. Hiding Multim. Signal Process., 2015

Visual Saliency Detection Based Object Recognition.
J. Inf. Hiding Multim. Signal Process., 2015

Optimized Multiple Description Lattice Vector Quantization Coding for 3D Depth Image.
KSII Trans. Internet Inf. Syst., 2015

Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating.
KSII Trans. Internet Inf. Syst., 2015

Detection for Operation Chain: Histogram Equalization and Dither-like Operation.
KSII Trans. Internet Inf. Syst., 2015

STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation.
CoRR, 2015

Indexing of CNN Features for Large Scale Image Search.
CoRR, 2015

Depth-Based Stereoscopic Projection Approach for 3D Saliency Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

An Accurate and Efficient Nonlinear Depth Quantization Scheme.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Orthogonal and Smooth Subspace Based on Sparse Coding for Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A Flexible Programmable Camera Control and Data Acquisition Hardware Platform.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Global Motion Information Based Depth Map Sequence Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

On the Security of Image Manipulation Forensics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Quantized dictionary for sparse representation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

A fast region-level 3D-warping method for depth-image-based rendering.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

An Effective Detection Method Based on Physical Traits of Recaptured Images on LCD Screens.
Proceedings of the Digital-Forensics and Watermarking - 14th International Workshop, 2015

Detection of Seam Carving and Contrast Enhancement Operation Chain.
Proceedings of the 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2015

Cross-media hashing with Centroid Approaching.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Photo linker: a system for finding your old photos based on fragmentary memories.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Heterogeneous data alignment for cross-media computing.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Texture Characteristics Based Fast Coding Unit Partition in HEVC Intra Coding.
Proceedings of the 2015 Data Compression Conference, 2015

Intra-/inter-View Correlation Based Multiple Description Coding for Multiview Transmission.
Proceedings of the 2015 Data Compression Conference, 2015

Depth map coding based on adaptive block compressive sensing.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Light-Weight Spatial Distribution Embedding of Adjacent Features for Image Search.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

2014
Macroblock Level Bits Allocation for Depth Maps in 3-D Video Coding.
J. Signal Process. Syst., 2014

Extracting shared subspace incrementally for multi-label image classification.
Vis. Comput., 2014

Mining Semantically Consistent Patterns for Cross-View Data.
IEEE Trans. Knowl. Data Eng., 2014

Salient Region Detection by Fusing Bottom-Up and Top-Down Features Extracted From a Single Image.
IEEE Trans. Image Process., 2014

Multiple Description Coding With Randomly and Uniformly Offset Quantizers.
IEEE Trans. Image Process., 2014

Contrast Enhancement-Based Forensics in Digital Images.
IEEE Trans. Inf. Forensics Secur., 2014

Topographic NMF for Data Representation.
IEEE Trans. Cybern., 2014

Multiple Description Video Coding Based on Human Visual System Characteristics.
IEEE Trans. Circuits Syst. Video Technol., 2014

Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information.
IEEE Trans. Broadcast., 2014

Reversible data hiding using invariant pixel-value-ordering and prediction-error expansion.
Signal Process. Image Commun., 2014

Optimizing the deadzone width to improve the polyphase-based multiple description coding.
Multim. Tools Appl., 2014

Shared Subspace Learning for Latent Representation of Multi-View Data.
J. Inf. Hiding Multim. Signal Process., 2014

Forensic detection of noise addition in digital images.
J. Electronic Imaging, 2014

Ensemble dictionary learning for saliency detection.
Image Vis. Comput., 2014

Auto-Covariance Analysis for Depth Map Coding.
KSII Trans. Internet Inf. Syst., 2014

Fast Intraframe Coding for High Efficiency Video Coding.
KSII Trans. Internet Inf. Syst., 2014

Just Noticeable Difference Based Fast Coding Unit Partition in HEVC Intra Coding.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Multiple Description Video Coding Using Inter- and Intra-Description Correlation at Macro Block Level.
IEICE Trans. Inf. Syst., 2014

Depth Image Coding Using Entropy-Based Adaptive Measurement Allocation.
Entropy, 2014

On YASS's Non-monotonic Security Performance.
Circuits Syst. Signal Process., 2014

CNN: Single-label to Multi-label.
CoRR, 2014

A channel selection rule for YASS.
Sci. China Inf. Sci., 2014

Attacking contrast enhancement forensics in digital images.
Sci. China Inf. Sci., 2014

Multiple description video coding using correlation optimized temporal sampling.
Sci. China Inf. Sci., 2014

Local stereo matching algorithm using rotation-skeleton-based region.
Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014

Superpixel-Based Watermarking Scheme for Image Authentication and Recovery.
Proceedings of the Digital-Forensics and Watermarking - 13th International Workshop, 2014

Improved SIFT-Based Copy-Move Detection Using BFSN Clustering and CFA Features.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Learning a mid-level feature space for cross-media regularization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Warping-driven mode selection for depth error concealment.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Two-Stage Multiview Image Compression Using Interview SIFT Matching.
Proceedings of the Data Compression Conference, 2014

Adaptive reversible watermarking using trimmed prediction and pixel-selection-based sorting.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

Forensics of image blurring and sharpening history based on NSCT domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

A reversible data hiding based on adaptive prediction technique and histogram shifting.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Pairwise Prediction-Error Expansion for Efficient Reversible Data Hiding.
IEEE Trans. Image Process., 2013

Joint Optimization Toward Effective and Efficient Image Search.
IEEE Trans. Cybern., 2013

LDFT-Based Watermarking Resilient to Local Desynchronization Attacks.
IEEE Trans. Cybern., 2013

Real-Time Video Streaming Using Randomized Expanding Reed-Solomon Code.
IEEE Trans. Circuits Syst. Video Technol., 2013

Generalized Gradient Vector Flow for Snakes: New Observations, Analysis, and Improvement.
IEEE Trans. Circuits Syst. Video Technol., 2013

Control-Point Representation and Differential Coding Affine-Motion Compensation.
IEEE Trans. Circuits Syst. Video Technol., 2013

A Real-Time Error Resilient Video Streaming Scheme Exploiting the Late- and Early-Arrival Packets.
IEEE Trans. Broadcast., 2013

Reversible data hiding based on PDE predictor.
J. Syst. Softw., 2013

Class-Driven Non-Negative Matrix Factorization for Image Representation.
J. Comput. Sci. Technol., 2013

Robust Multi-Bit Watermarking for Free-View Television Using Light Field Rendering.
IEICE Trans. Inf. Syst., 2013

Kernel Reconstruction ICA for Sparse Representation
CoRR, 2013

Fast bottom-up pruning for HEVC intraframe coding.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

High Capacity Reversible Watermarking for Images Based on Classified Neural Network.
Proceedings of the Image Analysis, 18th Scandinavian Conference, 2013

Reversible Image Watermarking Based on Neural Network and Parity Property.
Proceedings of the Multimedia and Ubiquitous Engineering, 2013

Depth down/up-sampling using hybrid correlation for depth coding.
Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Multiple description coding with randomly offset quantizers.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Directional block compressed sensing for image coding.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Recaptured Image Detection Based on Texture Features.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Orthogonal graph-regularized matrix factorization and its application for recommendation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Neighborhood reversibility verifying for image search.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Multiple PiPs detection in unbounded video stream.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Multiple description video coding based on forward error correction within expanding windows.
Proceedings of the IEEE International Conference on Image Processing, 2013

Frame filtering and path verification for improving video copy detection.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

TV Commercial Categorization Based on Sparse Visual Bag of Words.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

M-channel multiple description coding based on uniformly offset quantizers with optimal deadzone.
Proceedings of the IEEE International Conference on Acoustics, 2013

Forensics of blurred images based on no-reference image quality assessment.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012
Dynamic Sub-GOP Forward Error Correction Code for Real-Time Video Applications.
IEEE Trans. Multim., 2012

Reversible watermarking using optional prediction error histogram modification.
Neurocomputing, 2012

A Study on Embedding Efficiency of Matrix Encoding.
Int. J. Digit. Crime Forensics, 2012

A genetic clustering algorithm using a message-based similarity measure.
Expert Syst. Appl., 2012

RST transforms resistant image watermarking based on centroid and sector-shaped partition.
Sci. China Inf. Sci., 2012

Affine SKIP and DIRECT modes for efficient video coding.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Real-time video streaming exploiting the late-arrival packets.
Proceedings of the 2012 Picture Coding Symposium, 2012

View Synthesis Based on Background Update with Gaussian Mixture Model.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

An Interactive Semi-supervised Approach for Automatic Image Annotation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Discriminative ICA model with reconstruction constraint for image classification.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Knowledge Transferring for Image Classification.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Distortion estimation for compressed video transmission over mobile networks.
Proceedings of the 18th IEEE International Conference on Networks, 2012

Multiple description coding for scalable video coding with redundant slice.
Proceedings of the 18th IEEE International Conference on Networks, 2012

Stereo video coding using distributed compressive sensing with joint dictionary.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Correlation preserved dictionary learning for sparse representation.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Multiple Description Video Coding Using Macro Block Level Correlation of Inter-/Intra-Descriptions.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Incremental Shared Subspace Learning for Multi-label Classification.
Proceedings of the Computational Visual Media - First International Conference, 2012

Graph Regularized ICA for Over-Complete Feature Learning.
Proceedings of the Computational Visual Media - First International Conference, 2012

2011
Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation.
IEEE Trans. Multim., 2011

Frame Fusion for Video Copy Detection.
IEEE Trans. Circuits Syst. Video Technol., 2011

Multiple Description Coding for H.264/AVC With Redundancy Allocation at Macro Block Level.
IEEE Trans. Circuits Syst. Video Technol., 2011

Unsharp Masking Sharpening Detection via Overshoot Artifacts Analysis.
IEEE Signal Process. Lett., 2011

Compatible Stereo Video Coding with Adaptive Prediction Structure.
IEICE Trans. Inf. Syst., 2011

Integrated image representation based natural scene classification.
Expert Syst. Appl., 2011

Joint redundant motion vector and intra macroblock refreshment for video transmission.
EURASIP J. Image Video Process., 2011

Error-resilient video coding with end-to-end rate-distortion optimized at macroblock level.
EURASIP J. Adv. Signal Process., 2011

Two-description distributed video coding for robust transmission.
EURASIP J. Adv. Signal Process., 2011

Real-time forward error correction for video transmission.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Spread Spectrum-Based Multi-bit Watermarking for Free-View Video.
Proceedings of the Digital Forensics and Watermarking - 10th International Workshop, 2011

Copy detection towards semantic mining for video retrieval.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Motion compensated prediction using partial mesh generation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Pruned multi-level successive elimination algorithm for TV commercial recognition.
Proceedings of the ICIMCS 2011, 2011

A robust dynamic niching genetic clustering approach for image segmentation.
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

A Dynamic Niching Quantum Genetic Algorithm for Automatic Evolution of Clusters.
Proceedings of the Computer Analysis of Images and Patterns, 2011

Distributed Multiple Description Coding - Principles, Algorithms and Systems.
Springer, ISBN: 978-1-447-12247-0, 2011

2010
A Cooperative Learning Scheme for Interactive Video Search.
J. Signal Process. Syst., 2010

Multimodal Fusion for Video Search Reranking.
IEEE Trans. Knowl. Data Eng., 2010

Edge-based Blur Metric for Tamper Detection.
J. Inf. Hiding Multim. Signal Process., 2010

Commercial Shot Classification Based on Multiple Features Combination.
IEICE Trans. Inf. Syst., 2010

Improved Adaptive LSB Steganography Based on Chaos and Genetic Algorithm.
EURASIP J. Adv. Signal Process., 2010

Standard-Compliant Multiple Description Video Coding over Packet Loss Network.
EURASIP J. Adv. Signal Process., 2010

Commercial Recognition in TV Streams Using Coarse-to-Fine Matching Strategy.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Anti-forensics of contrast enhancement in digital images.
Proceedings of the Multimedia and Security Workshop, 2010

Reversible Watermarking Using Prediction Error Histogram and Blocking.
Proceedings of the Digital Watermarking - 9th International Workshop, 2010

Kernel Canonical Correlation with Similarity Refinement for Automatic Image Tagging.
Proceedings of the Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2010), 2010

Spread Spectrum-Based Image Watermarking Resistant to Rotation and Scaling Using Radon Transform.
Proceedings of the Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2010), 2010

A High Payload Histogram-Based Reversible Wartermarking Using Linear Prediction.
Proceedings of the Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2010), 2010

Multiple Description Wavelet Based Image Coding with Classification.
Proceedings of the Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2010), 2010

A high-performance YASS-like scheme using randomized big-blocks.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Multi-modal characteristics analysis and fusion for TV commercial detection.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Forensic detection of median filtering in digital images.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Forensic estimation of gamma correction in digital images.
Proceedings of the International Conference on Image Processing, 2010

Geometrically Invariant Image Watermarking Using Scale-Invariant Feature Transform and K-Means Clustering.
Proceedings of the Computational Collective Intelligence. Technologies and Applications, 2010

GOP-Flexible Distributed Multiview Video Coding with Adaptive Side Information.
Proceedings of the Computational Collective Intelligence. Technologies and Applications, 2010

2009
PM1 steganography in JPEG images using genetic algorithm.
Soft Comput., 2009

Two-Stage Multiple Description Image Coding Using TCQ.
Int. J. Wavelets Multiresolution Inf. Process., 2009

TSVM-HMM: Transductive SVM based hidden Markov model for automatic image annotation.
Expert Syst. Appl., 2009

Lossless data hiding based on prediction-error adjustment.
Sci. China Ser. F Inf. Sci., 2009

Robust multiple description distributed video coding using optimized zero-padding.
Sci. China Ser. F Inf. Sci., 2009

An Integrative Codebook for Natural Scene Categorization.
Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2009), 2009

Detection of image sharpening based on histogram aberration and ringing artifacts.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A Two-Description Distributed Video Coding.
Proceedings of the Fifth International Conference on Information Assurance and Security, 2009

Distributed Video Coding Based on Multiple Description.
Proceedings of the Fifth International Conference on Information Assurance and Security, 2009

2008
Reversible Watermarking Based on Invariability and Adjustment on Pixel Pairs.
IEEE Signal Process. Lett., 2008

Two-Stage Diversity-Based Multiple Description Image Coding.
IEEE Signal Process. Lett., 2008

Co-training for search-based automatic image annotation.
J. Digit. Inf. Manag., 2008

MR-MIL: Manifold Ranking Based Multiple-Instance Learning for Automatic Image Annotation.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008

An efficient hybrid distributed video coding.
IEICE Electron. Express, 2008

Ontology-Based Inter-concept Relation Fusion for Concept Detection.
Proceedings of the Advances in Multimedia Information Processing, 2008

A High Capacity Steganographic Algorithm in Color Images.
Proceedings of the Digital Watermarking, 7th International Workshop, 2008

A Novel Image Annotation Scheme Based on Neural Network.
Proceedings of the Eighth International Conference on Intelligent Systems Design and Applications, 2008

Reversible Watermarking Based on the Invariant Sum Value.
Proceedings of the Eighth International Conference on Intelligent Systems Design and Applications, 2008

Efficient Scalable Distributed Video Coding Based on Residual SW-SPIHT.
Proceedings of the Eighth International Conference on Intelligent Systems Design and Applications, 2008

A Unified System for Web Personal Image Retrieval.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

Multiple Description Image Coding Based on DSC and Pixel Interleaving.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

MRS-MIL: Minimum reference set based multiple instance learning for automatic image annotation.
Proceedings of the International Conference on Image Processing, 2008

Reversible watermarking based on PMO of triplets.
Proceedings of the International Conference on Image Processing, 2008

Strategy of combining random subspace and diversified active learning in CBIR.
Proceedings of the International Conference on Image Processing, 2008

Image authentication based on chaotic system with feedback and palm characteristics.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Priority Encoding Transmission Based Multiple Description Video Coding over Packet Loss Network.
Proceedings of the 2008 Data Compression Conference (DCC 2008), 2008

2007
Reversible Watermarking Techniques.
Proceedings of the Intelligent Multimedia Data Hiding: New Directions, 2007

Optimized Multiple Description Lattice Vector Quantization for Wavelet Image Coding.
IEEE Trans. Circuits Syst. Video Technol., 2007

Scale multiplication in odd Gabor transform domain for edge detection.
J. Vis. Commun. Image Represent., 2007

Lossless Data Hiding Based on Companding Technique and Difference Expansion of Triplets.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2007

BJTU TRECVID 2007 Video Search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Three-Channel Multiple Description Image Coding Based on Special Lattice Vector Quantization.
Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007

Robust Commercial Detection System.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Novel High-Capacity Reversiblewatermarking Scheme.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Multiple Description Video Coding using Adaptive Temporal Sub-Sampling.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Novel Reversible Watermarking Based on an Integer Transform.
Proceedings of the International Conference on Image Processing, 2007

Sequential Architecture for Efficient Car Detection.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Index assignment for MDVQ over memoryless binary symmetric channel with packet erasure.
IEICE Electron. Express, 2006

Geometrically robust video watermarking based on wavelet transform.
Sci. China Ser. F Inf. Sci., 2006

BJTU TRECVID 2006 Video Retrieval System.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Relevance Feedback based on Query Refining and Feature Database Updating in CBIR System.
Proceedings of the IASTED International Conference on Signal Processing, 2006

A Comprehensive Analysis for Relevance Feedback in CBIR System.
Proceedings of the IASTED International Conference on Signal Processing, 2006

LVQ Based Distributed Video Coding with LDPC in Pixel Domain.
Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Multiple Description Image Coding Using Shifted Lattice Vector Quantization.
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006

Wavelet-Domain Distributed Video Coding with Motion-Compensated Refinement.
Proceedings of the International Conference on Image Processing, 2006

Multiple Description Shifted Lattice Vector Quantization for Progressive Wavelet Image Coding.
Proceedings of the International Conference on Image Processing, 2006

Multi-Scale Analysis of Odd Gabor Transform for Edge Detection.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Seeking User's Query Concept Dynamically Based on Region in Relevance Feedback.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

An Anamnestic Semantic Tree-Based Relevance Feedback Method in CBIR System.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Improved Side-Information in Distributed Video Coding.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Performance Comparisons of Different Channel Codes in Distributed Video Coding.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Efficient Wavelet Zero-Tree Video Coding Based on Wyner-Ziv Coding and Lattice Vector Quantization.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Image Watermarking Robust to Print and Generation Copy.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Multiple Description Video Coding Based on Lattice Vector Quantization.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

2005
Improved Quantization Watermarking with an Adaptive Quantization Step Size and HVS.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

Reversible Watermarking Based on Improved Patchwork Algorithm and Symmetric Modulo Operation.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

A Reversible Watermark Scheme Combined with Hash Function and Lossless Compression.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

Print and Generation Copy Image Watermarking Based on Spread Spectrum Technique.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

Optimized multiple description image coding using lattice vector quantization.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2001
MSSBM and Its Application to Nature Image Coding.
Proceedings of the Data Compression Conference, 2001

1998
A new affine transformation: its theory and application to image coding.
IEEE Trans. Circuits Syst. Video Technol., 1998

1996
A hybrid image compression scheme combining block-based fractal coding and DCT.
Signal Process. Image Commun., 1996


  Loading...