Wenguan Wang

Orcid: 0000-0002-0802-9567

Affiliations:
  • Zhejiang University, China


According to our database1, Wenguan Wang authored at least 165 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Prototype-Based Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Scalable Video Object Segmentation With Identification Mechanism.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Cross-Image Pixel Contrasting for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Learning to Follow and Generate Instructions for Language-Capable Navigation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Semantic Hierarchy-Aware Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models.
CoRR, 2024

Scene Graph Generation with Role-Playing Large Language Models.
CoRR, 2024

Vision-Language Navigation with Energy-Based Policy.
CoRR, 2024

Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation.
CoRR, 2024

Image Segmentation in Foundation Model Era: A Survey.
CoRR, 2024

Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.
CoRR, 2024

Visual Knowledge in the Big Model Era: Retrospect and Prospect.
CoRR, 2024

Retrosynthesis prediction enhanced by in-silico reaction data augmentation.
CoRR, 2024

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models.
CoRR, 2024

A Survey on 3D Gaussian Splatting.
CoRR, 2024

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent).
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Facing the Elephant in the Room: Visual Prompt Tuning or Full finetuning?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Nonverbal Interaction Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-Driven Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

Controllable Navigation Instruction Generation with Chain of Thought Prompting.
Proceedings of the Computer Vision - ECCV 2024, 2024

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data.
Proceedings of the Computer Vision - ECCV 2024, 2024

Navigation Instruction Generation with BEV Perception and Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

General and Task-Oriented Video Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Clustering for Protein Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Volumetric Environment Representation for Vision-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Clustering Propagation for Universal Medical Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Neural Clustering Based Visual Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Poly Kernel Inception Network for Remote Sensing Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Local-Global Context Aware Transformer for Language-Guided Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Guest Editorial: Learning from limited annotations for computer vision tasks.
IET Comput. Vis., August, 2023

Differentiable Multi-Granularity Human Parsing.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

A Survey on Deep Learning Technique for Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Coarse-to-Fine Video Instance Segmentation With Factorized Conditional Appearance Flows.
IEEE CAA J. Autom. Sinica, May, 2023

Active Perception for Visual-Language Navigation.
Int. J. Comput. Vis., March, 2023

LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning.
CoRR, 2023

ClusterFormer: Clustering As A Universal Visual Learner.
CoRR, 2023

E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning.
CoRR, 2023

Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for Navigation Instruction Generation.
CoRR, 2023

Segment and Track Anything.
CoRR, 2023

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments.
CoRR, 2023

ClusterFomer: Clustering As A Universal Visual Learner.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural-Logic Human-Object Interaction Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CLUSTSEG: Clustering for Universal Segmentation.
Proceedings of the International Conference on Machine Learning, 2023

Visual Recognition with Deep Nearest Centroids.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Large-Scale Person Detection and Localization using Overhead Fisheye Cameras.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dreamwalker: Mental Planning for Continuous Vision-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Bird's-Eye-View Scene Graph for Vision-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Logic-induced Diagnostic Reasoning for Semi-supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

E<sup>2</sup>VPT: An Effective and Efficient Approach for Visual Prompt Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Clustering based Point Cloud Representation Learning for 3D Analysis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Boosting Video Object Segmentation via Space-Time Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LANA: A Language-Capable Navigator for Instruction Following and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Cascaded Parsing of Human-Object Interaction Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Hierarchical Human Semantic Parsing With Comprehensive Part-Relation Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Salient Object Detection in the Deep Learning Era: An In-Depth Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Zero-Shot Video Object Segmentation With Co-Attention Siamese Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Segmenting Objects From Relational Visual Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Editorial: Human visual saliency and artificial neural attention in deep learning.
Neurocomputing, 2022

Towards Data-and Knowledge-Driven Artificial Intelligence: A Survey on Neuro-Symbolic Computing.
CoRR, 2022

LSAP: Rethinking Inversion Fidelity, Perception and Editability in GAN Latent Space.
CoRR, 2022

Learning Equivariant Segmentation with Instance-Unique Querying.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Versatile Embodied Navigation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Target-Driven Structured Transformer Planner for Vision-Language Navigation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semi-supervised 3D Object Detection with Proficient Teachers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Interpretable Video Super-Resolution via Alternating Optimization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Reference-Based Image Super-Resolution with Deformable Attention Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking Semantic Segmentation: A Prototype View.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Visual Abductive Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deep Hierarchical Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Revisiting Video Saliency Prediction in the Deep Learning Era.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Paying Attention to Video Object Pattern Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

TADA: Taxonomy Adaptive Domain Adaptation.
CoRR, 2021

Collaborative Visual Navigation.
CoRR, 2021

Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation.
CoRR, 2021




Exploring Cross-Image Pixel Contrast for Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Face Forensics in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Structured Scene Memory for Vision-Language Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Video Saliency Prediction Using Spatiotemporal Residual Attentive Networks.
IEEE Trans. Image Process., 2020

Motion-Aware Rapid Video Saliency Detection.
IEEE Trans. Circuits Syst. Video Technol., 2020

Inferring Salient Objects from Human Fixations.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

LID 2020: The Learning from Imperfect Data Challenge Results.
CoRR, 2020

Active Visual Information Gathering for Vision-Language Navigation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly Supervised 3D Object Detection from Lidar Point Cloud.
Proceedings of the Computer Vision - ECCV 2020, 2020

Video Object Segmentation with Episodic Graph Memory Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Cascaded Human-Object Interaction Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Hierarchical Human Parsing With Typed Part-Relation Reasoning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Video Object Segmentation From Unlabeled Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Comic-guided speech synthesis.
ACM Trans. Graph., 2019

Stereo Video Object Segmentation Using Stereoscopic Foreground Trajectories.
IEEE Trans. Cybern., 2019

Better Dense Trajectories by Motion in Videos.
IEEE Trans. Cybern., 2019

Semi-Supervised Video Object Segmentation with Super-Trajectories.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Teacher-Students Knowledge Distillation for Siamese Trackers.
CoRR, 2019

Human vs Machine Attention in Neural Networks: A Comparative Study.
CoRR, 2019

Salient Object Detection in the Deep Learning Era: An In-Depth Survey.
CoRR, 2019

Optimizing the F-Measure for Threshold-Free Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Compositional Neural Information Fusion for Human Parsing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Human-Aware Motion Deblurring.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Reasoning Visual Dialogs With Structural and Partial Observations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Salient Object Detection With Pyramid Attention and Salient Edges.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Unsupervised Video Object Segmentation Through Visual Attention.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Shifting More Attention to Video Salient Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Neural Machine Translation by Achieving Knowledge Transfer with Sentence Alignment Learning.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018
Video Salient Object Detection via Fully Convolutional Networks.
IEEE Trans. Image Process., 2018

Deep Visual Attention Prediction.
IEEE Trans. Image Process., 2018

Video Saliency Detection Using Object Proposals.
IEEE Trans. Cybern., 2018

Video Co-Saliency Guided Co-Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Saliency-Aware Video Object Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Human-Object Interactions by Graph Parsing Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Descriptor Networks for 3D Shape Synthesis and Analysis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Revisiting Video Saliency: A Large-Scale Benchmark and a New Model.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Salient Object Detection Driven by Fixation Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Inferring Shared Attention in Social Scene Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Examining CNN Representations With Respect to Dataset Bias.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Stereoscopic Thumbnail Creation via Efficient Stereo Saliency Detection.
IEEE Trans. Vis. Comput. Graph., 2017

Occlusion-Aware Real-Time Object Tracking.
IEEE Trans. Multim., 2017

Selective Video Object Cutout.
IEEE Trans. Image Process., 2017

Learning Knowledge-guided Pose Grammar Machine for 3D Human Pose Estimation.
CoRR, 2017

Deep Learning For Video Saliency Detection.
CoRR, 2017

Selective Video Cutout using Global Pyramid Models and Local Uncertainty Propagation.
CoRR, 2017

Super-Trajectory for Video Segmentation.
CoRR, 2017

Super-Trajectory for Video Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Cropping via Attention Box Prediction and Aesthetics Assessment.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Higher-Order Image Co-segmentation.
IEEE Trans. Multim., 2016

Correspondence Driven Saliency Transfer.
IEEE Trans. Image Process., 2016

Real-Time Superpixel Segmentation by DBSCAN Clustering Algorithm.
IEEE Trans. Image Process., 2016

2015
Video Object Segmentation Via Dense Trajectories.
IEEE Trans. Multim., 2015

Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement.
IEEE Trans. Image Process., 2015

Robust Video Object Cosegmentation.
IEEE Trans. Image Process., 2015

Saliency-aware geodesic video object segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Lazy Random Walks for Superpixel Segmentation.
IEEE Trans. Image Process., 2014


  Loading...