Pichao Wang

Orcid: 0000-0002-1430-0237

According to our database¹, Pichao Wang authored at least 110 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2024

DFN: A deep fusion network for flexible single and multi-modal action recognition.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2024

Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning.

[BibT_eX]

[DOI]

CoRR, 2024

Unraveling Movie Genres through Cross-Attention Fusion of Bi-Modal Synergy of Poster.

[BibT_eX]

[DOI]

CoRR, 2024

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

GQE: Generalized Query Expansion for Enhanced Text-Video Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Hallucination of Multimodal Large Language Models: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Align2Concept: Language Guided Interpretable Image Recognition by Visual Prototype and Textual Concept Alignment.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Adaptive Query Selection for Camouflaged Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

A Unified Multimodal De- and Re-Coupling Framework for RGB-D Motion Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

What Limits the Performance of Local Self-attention?

[BibT_eX]

[DOI]

Int. J. Comput. Vis., October, 2023

Multi-hypothesis representation learning for transformer-based 3D human pose estimation.

[BibT_eX]

[DOI]

Pattern Recognit., September, 2023

Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

BP-triplet net for unsupervised domain adaptation: A Bayesian perspective.

[BibT_eX]

[DOI]

Pattern Recognit., 2023

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2023

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2023

Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Revisiting Vision Transformer from the View of Path Ensemble.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Selective Structured State-Spaces for Long-Form Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DOAD: Decoupled One Stage Action Detection Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Making Vision Transformers Efficient from A Token Sparsification View.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Frequency Domain Disentanglement for Arbitrary Neural Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Head-Free Lightweight Semantic Segmentation with Linear Transformer.

[BibT_eX]

[DOI]

Bo Dong

Pichao Wang

Fan Wang

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Class-Aware Feature Aggregation Network for Video Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Trear: Transformer-Based RGB-D Egocentric Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Cogn. Dev. Syst., 2022

Effective Vision Transformer Training: A Data-Centric Perspective.

[BibT_eX]

[DOI]

CoRR, 2022

BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective.

[BibT_eX]

[DOI]

Shanshan Wang

Lei Zhang

Pichao Wang

CoRR, 2022

VTC-LFC: Vision Transformer Compression with Low-Frequency Components.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation.

[BibT_eX]

[DOI]

Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer.

[BibT_eX]

[DOI]

Pichao Wang

Fan Wang

Hao Li

Proceedings of the IEEE International Conference on Acoustics, 2022

TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

KVT: k-NN Attention for Boosting Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2022

Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Focal and Global Spatial-Temporal Transformer for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

Scaled ReLU Matters for Training Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

BR$^2$Net: Defocus Blur Detection Via a Bidirectional Channel Attention Residual Refining Network.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

TransRPPG: Remote Photoplethysmography Transformer for 3D Mask Face Presentation Attack Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

Transformer guided geometry model for flow-based unsupervised visual odometry.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2021

Context and Structure Mining Network for Video Object Detection.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

ELSA: Enhanced Local Self-Attention for Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

KVT: k-NN Attention for Boosting Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation.

[BibT_eX]

[DOI]

CoRR, 2021

Lifting Transformer for 3D Human Pose Estimation in Video.

[BibT_eX]

[DOI]

CoRR, 2021

Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TransReID: Transformer-based Object Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities.

[BibT_eX]

[DOI]

Sensors, 2020

A Review of Dynamic Maps for 3D Human Motion Recognition Using ConvNets and Its Improvement.

[BibT_eX]

[DOI]

Neural Process. Lett., 2020

SAR-NAS: Skeleton-based action recognition via neural architecture searching.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2020

RobustTAD: Robust Time Series Anomaly Detection via Decomposition and Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2020

Exploiting Better Feature Aggregation for Video Object Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

R²MRF: Defocus Blur Detection via Recurrently Refining Multi-Scale Residual Features.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Learning a Joint Affinity Graph for Multiview Subspace Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Adaptive Hypergraph Embedded Semi-Supervised Multi-Label Image Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Multiview-Based 3-D Action Recognition Using Deep Networks.

[BibT_eX]

[DOI]

IEEE Trans. Hum. Mach. Syst., 2019

Unsupervised feature selection via latent representation learning and manifold regularization.

[BibT_eX]

[DOI]

Neural Networks, 2019

Learning attentive dynamic maps (ADMs) for Understanding Human Actions.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

DVONet: Unsupervised Monocular Depth Estimation and Visual Odometry.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Light Weight Stereo Matching via Deep Extraction and Integration of Low and High Level Information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Self-Attention Guided Deep Features for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Salient Object Detection via Recurrently Aggregating Spatial Attention Weighted Cross-Level Deep Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018

Depth Pooling Based Large-Scale 3-D Action Recognition With Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Action recognition based on joint trajectory maps with convolutional neural networks.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2018

Robust unsupervised feature selection via dual self-representation and manifold regularization.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2018

Consensus learning guided multi-view unsupervised feature selection.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2018

Online human action recognition based on incremental learning of weighted covariance descriptors.

[BibT_eX]

[DOI]

Inf. Sci., 2018

Saliency detection via affinity graph learning and weighted manifold ranking.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Robust graph regularized unsupervised feature selection.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2018

RGB-D-based human motion recognition with deep learning: A survey.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2018

Depth Pooling Based Large-scale 3D Action Recognition with Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2018

Spatially and Temporally Structured Global to Local Aggregation of Dynamic Depth Information for Action Recognition.

[BibT_eX]

[DOI]

IEEE Access, 2018

Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Salient Object Detection via Weighted Low Rank Matrix Recovery.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Joint Distance Maps Based Action Recognition With Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

An effective edge-preserving smoothing method for image manipulation.

[BibT_eX]

[DOI]

Digit. Signal Process., 2017

Skeleton-based action recognition using LSTM and CNN.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Investigation of different skeleton features for CNN-based 3D action recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Weakly structured information aggregation for upper-body posture assessment using ConvNets.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Large-Scale Multimodal Gesture Segmentation and Recognition Based on Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Large-Scale Multimodal Gesture Recognition Using Heterogeneous Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Structured Images for RGB-D Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Action Recognition From Depth Maps Using Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Hum. Mach. Syst., 2016

A Spectral and Spatial Approach of Coarse-to-Fine Blurred Image Region Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

RGB-D-based action recognition datasets: A survey.

[BibT_eX]

[DOI]

Pattern Recognit., 2016

Salient object detection using color spatial distribution and minimum spanning tree weight.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Large-scale Continuous Gesture Recognition Using Convolutional Neutral Networks.

[BibT_eX]

[DOI]

CoRR, 2016

Combining ConvNets with Hand-Crafted Features for Action Recognition Based on an HMM-SVM Classifier.

[BibT_eX]

[DOI]

CoRR, 2016

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

A Large Scale RGB-D Dataset for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Understanding Human Activities Through 3D Sensors, 2016

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Large-scale Isolated Gesture Recognition using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

2015

A novel rate control algorithm for video coding based on fuzzy-PID controller.

[BibT_eX]

[DOI]

Signal Image Video Process., 2015

Deep Convolutional Neural Networks for Action Recognition Using Depth Map Sequences.

[BibT_eX]

[DOI]

CoRR, 2015

Online Action Recognition based on Incremental Learning of Weighted Covariance Descriptors.

[BibT_eX]

[DOI]

CoRR, 2015

ConvNets-Based Action Recognition from Depth Maps through Virtual Cameras and Pseudocoloring.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014

Mining Mid-Level Features for Action Recognition Based on Effective Skeleton Representation.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

2013

An Improved Direction Finding Algorithm Based on Toeplitz Approximation.

[BibT_eX]

[DOI]

Sensors, 2013

Pichao Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...