Wenmin Wang

Orcid: 0000-0003-2664-4413

Affiliations:
  • Macau University of Science and Technology, School of Computer Science and Engineering, Macau
  • Peking University, National Engineering Laboratory for Video Technology, Shenzhen, China (former)
  • South China University of Technology, School of Software Engineering, Guangzhou, China (former)
  • Harbin Institute of Electrical Technology, China (former)
  • Harbin Institute of Technology, China (PhD 1989)


According to our database1, Wenmin Wang authored at least 144 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SwapInpaint2: Towards high structural consistency in identity-guided inpainting via background-preserving GAN inversion.
Pattern Recognit., 2025

DS&STM-Net: A novel hybrid network of feature mutual fusion for medical image segmentation.
Biomed. Signal Process. Control., 2025

2024
E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System.
ACM Trans. Multim. Comput. Commun. Appl., February, 2024

Recovery-Based Occluded Face Recognition by Identity-Guided Inpainting.
Sensors, January, 2024

TA2V: Text-Audio Guided Video Generation.
IEEE Trans. Multim., 2024

Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval.
IEEE Trans. Multim., 2024

Enhanced blind face inpainting via structured mask prediction.
Pattern Recognit. Lett., 2024

Multimodal parallel attention network for medical image segmentation.
Image Vis. Comput., 2024

Ultrahigh-definition video quality assessment: A new dataset and benchmark.
Neurocomputing, 2024

SgLFT: Semantic-guided Late Fusion Transformer for video corpus moment retrieval.
Neurocomputing, 2024

Improving generative adversarial network inversion via fine-tuning GAN encoders.
Appl. Soft Comput., 2024

Convolution Self-Guided Transformer for Diagnosis and Recognition of Crop Disease in Different Environments.
IEEE Access, 2024

Local Information Guided Global Integration for Infrared Small Target Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Span Confusion is All You Need for Chinese Spelling Correction.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
Bounding convolutional network for refining object locations.
Neural Comput. Appl., September, 2023

SaGCN: Semantic-Aware Graph Calibration Network for Temporal Sentence Grounding.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval.
Int. J. Multim. Inf. Retr., June, 2023

Shadow Removal of Text Document Images Using Background Estimation and Adaptive Text Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
SwapInpaint: Identity-Specific Face Inpainting With Identity Swapping.
IEEE Trans. Circuits Syst. Video Technol., 2022

EVtracker: An Event-Driven Spatiotemporal Method for Dynamic Object Tracking.
Sensors, 2022

Fast transformation of discriminators into encoders using pre-trained GANs.
Pattern Recognit. Lett., 2022

Affective word embedding in affective explanation generation for fine art paintings.
Pattern Recognit. Lett., 2022

Fast 2-step regularization on style optimization for real face morphing.
Neural Networks, 2022

ANGraph: attribute-interactive neighborhood-aggregative graph representation learning.
Neural Comput. Appl., 2022

2021
Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent Vectors with Two-scale Attentions.
CoRR, 2021

2020
Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation.
IEEE Trans. Multim., 2020

Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model.
IEEE Trans. Circuits Syst. Video Technol., 2020

Self-Supervised Animation Synthesis Through Adversarial Training.
IEEE Access, 2020

Text-to-Image Generation via Semi-Supervised Training.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Dense-Gated U-Net for Brain Lesion Segmentation.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Low Resolution Facial Manipulation Detection.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Generating Future Frames with Mask-Guided Prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Exploring Entity-Level Spatial Relationships for Image-Text Matching.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Context Augmentation Aggregation Network for Nuclei Segmentation.
Proceedings of the CSAI 2020: 2020 4th International Conference on Computer Science and Artificial Intelligence, 2020

2019
Predicting Diverse Future Frames With Local Transformation-Guided Masking.
IEEE Trans. Circuits Syst. Video Technol., 2019

Brain-Inspired Inference on Missing Video Sequence.
CoRR, 2019

ParNet: Position-aware Aggregated Relation Network for Image-Text matching.
CoRR, 2019

Learning DALTS for cross-modal retrieval.
CAAI Trans. Intell. Technol., 2019

Adaptively Aligned Image Captioning via Adaptive Attention Time.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Prediction with Temporal-Spatial Attention Mechanism and Deep Perceptual Similarity Branch.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Attention on Attention for Image Captioning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-step Self-attention Network for Cross-modal Retrieval Based on a Limited Text Space.
Proceedings of the IEEE International Conference on Acoustics, 2019

Image Captioning with Two Cascaded Agents.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Multiscale Deep Alternative Neural Network for Large-Scale Video Classification.
IEEE Trans. Multim., 2018

Second- and High-Order Graph Matching for Correspondence Problems.
IEEE Trans. Circuits Syst. Video Technol., 2018

MPEG Internet Video Coding Standard and Its Performance Evaluation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Local patch encoding-based method for single image super-resolution.
Inf. Sci., 2018

Beyond Knowledge Distillation: Collaborative Learning for Bidirectional Model Assistance.
IEEE Access, 2018

Dual Subspaces with Adversarial Learning for Cross-Modal Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Adaptive Hierarchical Motion-Focused Model for Video Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Image Captioning with Scene-graph Based Semantic Concepts.
Proceedings of the 10th International Conference on Machine Learning and Computing, 2018

A Motion Aided Merge Mode For Hevc.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU.
IEEE Trans. Multim., 2017

Color Image-Guided Boundary-Inconsistent Region Refinement for Stereo Matching.
IEEE Trans. Circuits Syst. Video Technol., 2017

Iterative projection reconstruction for fast and efficient image upsampling.
Neurocomputing, 2017

Local Patch Classification Based Framework for Single Image Super-Resolution.
CoRR, 2017

Long-Term Video Interpolation with Bidirectional Predictive Network.
CoRR, 2017

Video Imagination from a Single Image with Transformation Generation.
CoRR, 2017

Deep discriminative network with inception module for person re-identification.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Adaptive difference modelling for background subtraction.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Learning multi-view embedding in joint space for bidirectional image-text retrieval.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Mask-streaming CNN for pedestrian detection.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Long-term video interpolation with bidirectional predictive network.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Aligned Local Descriptors and Hierarchical Global Features for Person Re-Identification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

SPOS: Deblur Image by Using Sparsity Prior and Outlier Suppression.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Unsupervised Concept Learning in Text Subspace for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Collaborative Networks for Person Verification.
Proceedings of the First International Workshop on Multimedia Verification, MuVer@MM 2017, 2017

Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Object-Centric Transformation for Video Prediction.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Imagination from a Single Image with Transformation Generation.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Better deep visual attention with reinforcement learning in action recognition.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

A joint model for action localization and classification in untrimmed video with visual attention.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

An Innovative Salient Object Detection Using Center-Dark Channel Prior.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A New Low-Light Image Enhancement Algorithm Using Camera Response Model.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Cross-modality matching based on Fisher Vector with neural word embeddings and deep image features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A Multilayer Backpropagation Saliency Detection Algorithm Based on Depth Mining.
Proceedings of the Computer Analysis of Images and Patterns, 2017

Learning a Limited Text Space for Cross-Media Retrieval.
Proceedings of the Computer Analysis of Images and Patterns, 2017

A New Image Contrast Enhancement Algorithm Using Exposure Fusion Framework.
Proceedings of the Computer Analysis of Images and Patterns, 2017

Progressive Probabilistic Graph Matching with Local Consistency Regularization.
Proceedings of the Computer Analysis of Images and Patterns, 2017

A Violence Detection Approach Based on Spatio-temporal Hypergraph Transition.
Proceedings of the Computer Analysis of Images and Patterns, 2017

Attention-Based Two-Phase Model for Video Action Detection.
Proceedings of the Computer Analysis of Images and Patterns, 2017

Salient Object Detection with Complex Scene Based on Cognitive Neuroscience.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Collaborative Deep Networks for Pedestrian Detection.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Beyond Monte Carlo Tree Search: Playing Go with Deep Alternative Neural Network and Long-Term Evaluation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
CSPS: An Adaptive Pooling Method for Image Classification.
IEEE Trans. Multim., 2016

Multilevel Modified Finite Radon Transform Network for Image Upsampling.
IEEE Trans. Circuits Syst. Video Technol., 2016

Spatially variant defocus blur map estimation and deblurring from a single image.
J. Vis. Commun. Image Represent., 2016

Local Quantization Code histogram for texture classification.
Neurocomputing, 2016

Frame interpolation with pixel-level motion vector field and mesh based hole filling.
CAAI Trans. Intell. Technol., 2016

Manifold alignment using discrete surface Ricci flow.
CAAI Trans. Intell. Technol., 2016

An advanced local offset matching strategy for object proposal matching.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A new video denoising method using texture metric and adaptive structure variance.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A simple but efficient way to combine VLAD with locality-constrained linear coding.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Better region proposals for pedestrian detection with R-CNN.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

An effective post quantization rate estimation for HEVC intra encoder.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Deep Alternative Neural Network: Exploring Contexts as Early as Possible for Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Novel Shadow-Free Feature Extractor for Real-Time Road Detection.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Regional Subspace Projection Coding for Image Retrieval.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

An MCMC-based prior sub-hypergraph matching in presence of outliers.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

An Empirical Study of Deformable Part Model with fast feature pyramid.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

A K-Nearest-Neighbor-Pooling method for graph matching.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Coupled feature mapping and correlation mining for cross-media retrieval.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Tube ConvNets: Better exploiting motion for action recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

An Object-Aware Anomaly Detection and Localization in Surveillance Videos.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

A Fast and Lossless IDCT Design for AVS2 Codec.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
High Resolution Local Structure-Constrained Image Upsampling.
IEEE Trans. Image Process., 2015

Dynamic macroblock wavefront parallelism for parallel video coding.
J. Vis. Commun. Image Represent., 2015

Bioinspired Mechanisms in Wireless Ad Hoc and Sensor Networks.
J. Sensors, 2015

Video pre-processing with JND-based Gaussian filtering of superpixels.
Proceedings of the Visual Information Processing and Communication VI, 2015

Weighted transformable spatial pyramid and scalable query for object retrieval.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Improving VLAD with regional PCA whitening.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Clustering Sentences with Density Peaks for Multi-document Summarization.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Context-adaptive fast motion estimation of HEVC.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Fast intra mode decision algorithm based on refinement in HEVC.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Signature of unique angles Histograms for 3D data description.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

An improved averaging combination method for image and object recognition.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Learning class-specific pooling shapes for image classification.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Improved cluster center adaption for image classification.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Accelerating CDVS extraction on mobile platform.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image deblurring using robust sparsity priors.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A compact shot representation for video semantic indexing.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image classification using RBM to encode local descriptors with group sparse learning.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A low-light image enhancement method for both denoising and contrast enlarging.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A novel integer-pixel motion estimation algorithm based on quadratic prediction.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Learning discriminative visual dictionary for natural scene categorization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Stereo matching with space-constrained cost aggregation and segmentation-based disparity refinement.
Proceedings of the Three-Dimensional Image Processing, 2015

2014
Local Stereo Matching with Improved Matching Cost and Disparity Refinement.
IEEE Multim., 2014

Incremental Multi-manifold Out-of-Sample Data Prediction.
Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014, 2014

An all-zero blocks early detection method for high-efficiency video coding.
Proceedings of the Visual Information Processing and Communication V, 2014

Low-cost multi-hypothesis motion compensation for video coding.
Proceedings of the Visual Information Processing and Communication V, 2014

A new frame interpolation method with pixel-level motion vector field.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

An approach to support stereoscopic 3D web.
Proceedings of the Symposium on Applied Computing, 2014

Cost-volume filtering-based stereo matching with improved matching cost and secondary refinement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

HEVC decoder acceleration on multi-core X86 platform.
Proceedings of the IEEE International Conference on Acoustics, 2014

A rendering approach for stereoscopic web pages.
Proceedings of the Stereoscopic Displays and Applications XXV, 2014

The design and implementation of stereoscopic 3D scalable vector graphics based on WebKit.
Proceedings of the Stereoscopic Displays and Applications XXV, 2014

The rendering context for stereoscopic 3D web.
Proceedings of the Stereoscopic Displays and Applications XXV, 2014

Fast motion estimation methods for HEVC.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

2013
Dynamic MB-level Scheduling for parallel video coding.
Proceedings of the 30th Picture Coding Symposium, 2013

Acceleration of HEVC transform and inverse transform on ARM NEON platform.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013

Adaptive motion estimation order for frame rate up-conversion.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

High definition IEEE AVS decoder on ARM NEON platform.
Proceedings of the IEEE International Conference on Image Processing, 2013

A hybrid pixel-block based view synthesis for multiviewpoint 3D video.
Proceedings of the 3DTV-Conference 2013: The True Vision, 2013

2012
Robust hand tracking with refined CAMShift based on combination of Depth and image features.
Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics, 2012


  Loading...