Weining Wang

Orcid: 0000-0001-7299-6431

Affiliations:
  • Institute of Automation, Chinese Academy of Sciences, Beijing, China


According to our database1, Weining Wang authored at least 32 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Learning Disentangled Representation for One-Shot Progressive Face Swapping.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Open-Set Single-Domain Generalization for Robust Face Anti-Spoofing.
Int. J. Comput. Vis., November, 2024

Reparameterizing and dynamically quantizing image features for image generation.
Pattern Recognit., February, 2024

Temporal Action Proposal Generation With Action Frequency Adaptive Network.
IEEE Trans. Multim., 2024

Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation.
IEEE Trans. Multim., 2024

Learnable Feature Augmentation Framework for Temporal Action Localization.
IEEE Trans. Image Process., 2024

COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation.
CoRR, 2024

MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Semantic-based conditional generative adversarial hashing with pairwise labels.
Pattern Recognit., July, 2023

CASIA-E: A Large Comprehensive Dataset for Gait Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Anchor-free temporal action localization via Progressive Boundary-aware Boosting.
Inf. Process. Manag., 2023

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset.
CoRR, 2023

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation.
Proceedings of the International Joint Conference on Neural Networks, 2023

WL-MSR: Watch and Listen for Multimodal Subtitle Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

MOSO: Decomposing MOtion, Scene and Object for Video Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map.
IEEE Trans. Multim., 2022

An Efficient Sampling-Based Attention Network for Semantic Segmentation.
IEEE Trans. Image Process., 2022

Super-resolution semantic segmentation with relation calibrating network.
Pattern Recognit., 2022

Learning Disentangled Representation for One-shot Progressive Face Swapping.
CoRR, 2022

2021
Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing.
CoRR, 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation.
CoRR, 2021

Multi-caption Text-to-Face Synthesis: Dataset and Algorithm.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Temporal Memory Attention for Video Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Keypoint Context Aggregation for Human Pose Estimation.
Proceedings of the Image and Graphics - 11th International Conference, 2021

HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

CAS-AIR-3D Face: A Low-Quality, Multi-Modal and Multi-Pose 3D Face Database.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

2020
Long video question answering: A Matching-guided Attention Model.
Pattern Recognit., 2020

Robust Object Tracking via Information Theoretic Measures.
Int. J. Autom. Comput., 2020

AutoCaption: Image Captioning with Neural Architecture Search.
CoRR, 2020

2019
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


  Loading...