Wenmin Wang

Pattern Recognit. Lett., 2024

Multimodal parallel attention network for medical image segmentation.

[BibT_eX]

[DOI]

Image Vis. Comput., 2024

Ultrahigh-definition video quality assessment: A new dataset and benchmark.

[BibT_eX]

[DOI]

Neurocomputing, 2024

SgLFT: Semantic-guided Late Fusion Transformer for video corpus moment retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2024

Improving generative adversarial network inversion via fine-tuning GAN encoders.

[BibT_eX]

[DOI]

Roberto Bugiolacchi

Appl. Soft Comput., 2024

Convolution Self-Guided Transformer for Diagnosis and Recognition of Crop Disease in Different Environments.

[BibT_eX]

[DOI]

IEEE Access, 2024

Local Information Guided Global Integration for Infrared Small Target Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Span Confusion is All You Need for Chinese Spelling Correction.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023

Bounding convolutional network for refining object locations.

[BibT_eX]

[DOI]

Neural Comput. Appl., September, 2023

SaGCN: Semantic-Aware Graph Calibration Network for Temporal Sentence Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., June, 2023

Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval.

[BibT_eX]

[DOI]

Ruochen Li

Nannan Li

Int. J. Multim. Inf. Retr., June, 2023

Shadow Removal of Text Document Images Using Background Estimation and Adaptive Text Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

SwapInpaint: Identity-Specific Face Inpainting With Identity Swapping.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

EVtracker: An Event-Driven Spatiotemporal Method for Dynamic Object Tracking.

[BibT_eX]

[DOI]

Sensors, 2022

Fast transformation of discriminators into encoders using pre-trained GANs.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2022

Affective word embedding in affective explanation generation for fine art paintings.

[BibT_eX]

[DOI]

Jianhao Yan

Pattern Recognit. Lett., 2022

Fast 2-step regularization on style optimization for real face morphing.

[BibT_eX]

[DOI]

Neural Networks, 2022

ANGraph: attribute-interactive neighborhood-aggregative graph representation learning.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2022

2021

Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent Vectors with Two-scale Attentions.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation.

[BibT_eX]

[DOI]

Xiongtao Chen

IEEE Trans. Multim., 2020

Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Self-Supervised Animation Synthesis Through Adversarial Training.

[BibT_eX]

[DOI]

Jianhao Yan

IEEE Access, 2020

Text-to-Image Generation via Semi-Supervised Training.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Dense-Gated U-Net for Brain Lesion Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Low Resolution Facial Manipulation Detection.

[BibT_eX]

[DOI]

Xiao Han

Zhongyi Ji

Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Generating Future Frames with Mask-Guided Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Exploring Entity-Level Spatial Relationships for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Context Augmentation Aggregation Network for Nuclei Segmentation.

[BibT_eX]

[DOI]

Ruizhe Geng

Proceedings of the CSAI 2020: 2020 4th International Conference on Computer Science and Artificial Intelligence, 2020

2019

Predicting Diverse Future Frames With Local Transformation-Guided Masking.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Brain-Inspired Inference on Missing Video Sequence.

[BibT_eX]

[DOI]

Weimian Li

Baoyang Chen

CoRR, 2019

ParNet: Position-aware Aggregated Relation Network for Image-Text matching.

[BibT_eX]

[DOI]

CoRR, 2019

Learning DALTS for cross-modal retrieval.

[BibT_eX]

[DOI]

Zheng Yu

CAAI Trans. Intell. Technol., 2019

Adaptively Aligned Image Captioning via Adaptive Attention Time.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Prediction with Temporal-Spatial Attention Mechanism and Deep Perceptual Similarity Branch.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Attention on Attention for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-step Self-attention Network for Cross-modal Retrieval Based on a Limited Text Space.

[BibT_eX]

[DOI]

Zheng Yu

Ge Li

Proceedings of the IEEE International Conference on Acoustics, 2019

Image Captioning with Two Cascaded Agents.

[BibT_eX]

[DOI]

Lun Huang

Gang Wang

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Multiscale Deep Alternative Neural Network for Large-Scale Video Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Second- and High-Order Graph Matching for Correspondence Problems.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

MPEG Internet Video Coding Standard and Its Performance Evaluation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Local patch encoding-based method for single image super-resolution.

[BibT_eX]

[DOI]

Inf. Sci., 2018

Beyond Knowledge Distillation: Collaborative Learning for Bidirectional Model Assistance.

[BibT_eX]

[DOI]

IEEE Access, 2018

Dual Subspaces with Adversarial Learning for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Yaxian Xia

Liang Han

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Adaptive Hierarchical Motion-Focused Model for Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Image Captioning with Scene-graph Based Semantic Concepts.

[BibT_eX]

[DOI]

Lizhao Gao

Bo Wang

Proceedings of the 10th International Conference on Machine Learning and Computing, 2018

A Motion Aided Merge Mode For Hevc.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Color Image-Guided Boundary-Inconsistent Region Refinement for Stereo Matching.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

Iterative projection reconstruction for fast and efficient image upsampling.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Local Patch Classification Based Framework for Single Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2017

Long-Term Video Interpolation with Bidirectional Predictive Network.

[BibT_eX]

[DOI]

CoRR, 2017

Video Imagination from a Single Image with Transformation Generation.

[BibT_eX]

[DOI]

CoRR, 2017

Deep discriminative network with inception module for person re-identification.

[BibT_eX]

[DOI]

Yihao Zhang

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Adaptive difference modelling for background subtraction.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Learning multi-view embedding in joint space for bidirectional image-text retrieval.

[BibT_eX]

[DOI]

Lu Ran

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Mask-streaming CNN for pedestrian detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Long-term video interpolation with bidirectional predictive network.

[BibT_eX]

[DOI]

Xiongtao Chen

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Aligned Local Descriptors and Hierarchical Global Features for Person Re-Identification.

[BibT_eX]

[DOI]

Yihao Zhang

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

SPOS: Deblur Image by Using Sparsity Prior and Outlier Suppression.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Unsupervised Concept Learning in Text Subspace for Cross-Media Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Collaborative Networks for Person Verification.

[BibT_eX]

[DOI]

Yihao Zhang

Proceedings of the First International Workshop on Multimedia Verification, MuVer@MM 2017, 2017

Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Object-Centric Transformation for Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Imagination from a Single Image with Transformation Generation.

[BibT_eX]

[DOI]

Baoyang Chen

Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Better deep visual attention with reinforcement learning in action recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

A joint model for action localization and classification in untrimmed video with visual attention.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

An Innovative Salient Object Detection Using Center-Dark Channel Prior.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A New Low-Light Image Enhancement Algorithm Using Camera Response Model.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Cross-modality matching based on Fisher Vector with neural word embeddings and deep image features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A Multilayer Backpropagation Saliency Detection Algorithm Based on Depth Mining.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2017

Learning a Limited Text Space for Cross-Media Retrieval.

[BibT_eX]

[DOI]

Zheng Yu

Mengdi Fan

Proceedings of the Computer Analysis of Images and Patterns, 2017

A New Image Contrast Enhancement Algorithm Using Exposure Fusion Framework.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2017

Progressive Probabilistic Graph Matching with Local Consistency Regularization.

[BibT_eX]

[DOI]

Min Tang

Proceedings of the Computer Analysis of Images and Patterns, 2017

A Violence Detection Approach Based on Spatio-temporal Hypergraph Transition.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2017

Attention-Based Two-Phase Model for Video Action Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2017

Salient Object Detection with Complex Scene Based on Cognitive Neuroscience.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Collaborative Deep Networks for Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Beyond Monte Carlo Tree Search: Playing Go with Deep Alternative Neural Network and Long-Term Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

CSPS: An Adaptive Pooling Method for Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Multilevel Modified Finite Radon Transform Network for Image Upsampling.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Spatially variant defocus blur map estimation and deblurring from a single image.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2016

Local Quantization Code histogram for texture classification.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Frame interpolation with pixel-level motion vector field and mesh based hole filling.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2016

Manifold alignment using discrete surface Ricci flow.

[BibT_eX]

[DOI]

Zhongxin Liu

Qun Jin

CAAI Trans. Intell. Technol., 2016

An advanced local offset matching strategy for object proposal matching.

[BibT_eX]

[DOI]

Proceedings of the 2016 Visual Communications and Image Processing, 2016

A new video denoising method using texture metric and adaptive structure variance.

[BibT_eX]

[DOI]

Proceedings of the 2016 Visual Communications and Image Processing, 2016

A simple but efficient way to combine VLAD with locality-constrained linear coding.

[BibT_eX]

[DOI]

Proceedings of the 2016 Visual Communications and Image Processing, 2016

Better region proposals for pedestrian detection with R-CNN.

[BibT_eX]

[DOI]

Peilei Dong

Proceedings of the 2016 Visual Communications and Image Processing, 2016

An effective post quantization rate estimation for HEVC intra encoder.

[BibT_eX]

[DOI]

Proceedings of the 2016 Visual Communications and Image Processing, 2016

Deep Alternative Neural Network: Exploring Contexts as Early as Possible for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Novel Shadow-Free Feature Extractor for Real-Time Road Detection.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Regional Subspace Projection Coding for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

An MCMC-based prior sub-hypergraph matching in presence of outliers.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

An Empirical Study of Deformable Part Model with fast feature pyramid.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

A K-Nearest-Neighbor-Pooling method for graph matching.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Coupled feature mapping and correlation mining for cross-media retrieval.

[BibT_eX]

[DOI]

Mengdi Fan

Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Tube ConvNets: Better exploiting motion for action recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

An Object-Aware Anomaly Detection and Localization in Surveillance Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

A Fast and Lossless IDCT Design for AVS2 Codec.

[BibT_eX]

[DOI]

Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015

High Resolution Local Structure-Constrained Image Upsampling.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Dynamic macroblock wavefront parallelism for parallel video coding.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2015

Bioinspired Mechanisms in Wireless Ad Hoc and Sensor Networks.

[BibT_eX]

[DOI]

Anand Paul

Daniel Bo-Wei Chen

J. Sensors, 2015

Video pre-processing with JND-based Gaussian filtering of superpixels.

[BibT_eX]

[DOI]

Proceedings of the Visual Information Processing and Communication VI, 2015

Weighted transformable spatial pyramid and scalable query for object retrieval.

[BibT_eX]

[DOI]

Zi'ou Zheng

Proceedings of the 2015 Visual Communications and Image Processing, 2015

Improving VLAD with regional PCA whitening.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

Clustering Sentences with Density Peaks for Multi-document Summarization.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Context-adaptive fast motion estimation of HEVC.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Fast intra mode decision algorithm based on refinement in HEVC.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Signature of unique angles Histograms for 3D data description.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

An improved averaging combination method for image and object recognition.

[BibT_eX]

[DOI]

Yingli Wei

Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Learning class-specific pooling shapes for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Improved cluster center adaption for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Accelerating CDVS extraction on mobile platform.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image deblurring using robust sparsity priors.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A compact shot representation for video semantic indexing.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image classification using RBM to encode local descriptors with group sparse learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A low-light image enhancement method for both denoising and contrast enlarging.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A novel integer-pixel motion estimation algorithm based on quadratic prediction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Learning discriminative visual dictionary for natural scene categorization.

[BibT_eX]

[DOI]

Ying Huang

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Stereo matching with space-constrained cost aggregation and segmentation-based disparity refinement.

[BibT_eX]

[DOI]

Proceedings of the Three-Dimensional Image Processing, 2015

2014

Local Stereo Matching with Improved Matching Cost and Disparity Refinement.

[BibT_eX]

[DOI]

IEEE Multim., 2014

Incremental Multi-manifold Out-of-Sample Data Prediction.

[BibT_eX]

[DOI]

Zhongxin Liu

Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014, 2014

An all-zero blocks early detection method for high-efficiency video coding.

[BibT_eX]

[DOI]

Proceedings of the Visual Information Processing and Communication V, 2014

Low-cost multi-hypothesis motion compensation for video coding.

[BibT_eX]

[DOI]

Proceedings of the Visual Information Processing and Communication V, 2014

A new frame interpolation method with pixel-level motion vector field.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

An approach to support stereoscopic 3D web.

[BibT_eX]

[DOI]

Proceedings of the Symposium on Applied Computing, 2014

Cost-volume filtering-based stereo matching with improved matching cost and secondary refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

HEVC decoder acceleration on multi-core X86 platform.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A rendering approach for stereoscopic web pages.

[BibT_eX]

[DOI]

Proceedings of the Stereoscopic Displays and Applications XXV, 2014

The design and implementation of stereoscopic 3D scalable vector graphics based on WebKit.

[BibT_eX]

[DOI]

Zhongxin Liu

Proceedings of the Stereoscopic Displays and Applications XXV, 2014

The rendering context for stereoscopic 3D web.

[BibT_eX]

[DOI]

Qinshui Chen

Proceedings of the Stereoscopic Displays and Applications XXV, 2014

Fast motion estimation methods for HEVC.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

2013

Dynamic MB-level Scheduling for parallel video coding.

[BibT_eX]

[DOI]

Proceedings of the 30th Picture Coding Symposium, 2013

Acceleration of HEVC transform and inverse transform on ARM NEON platform.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013

Adaptive motion estimation order for frame rate up-conversion.

[BibT_eX]

[DOI]

Chengzhou Tang

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

High definition IEEE AVS decoder on ARM NEON platform.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

A hybrid pixel-block based view synthesis for multiviewpoint 3D video.

[BibT_eX]

[DOI]

Proceedings of the 3DTV-Conference 2013: The True Vision, 2013

2012

Robust hand tracking with refined CAMShift based on combination of Depth and image features.

[BibT_eX]

[DOI]

Wenhuan Cui