Yaowei Wang

Orcid: 0000-0003-2197-9038

Affiliations:
  • Peng Cheng Laboratory, Shenzhen, China


According to our database1, Yaowei Wang authored at least 230 papers between 2003 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Improving deep representation learning via auxiliary learnable target coding.
Pattern Recognit., 2025

2024
Distributed Semantic Communications for Multimodal Audio-Visual Parsing Tasks.
IEEE Trans. Green Commun. Netw., December, 2024

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network With Token Migration.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

TPTE: Text-Guided Patch Token Exploitation for Unsupervised Fine-Grained Representation Learning.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Spatial-Temporal Correlation Learning for Traffic Demand Prediction.
IEEE Trans. Intell. Transp. Syst., November, 2024

Knowledge-Based Multiple Relations Modeling for Traffic Forecasting.
IEEE Trans. Intell. Transp. Syst., September, 2024

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Self-Supervised Tracking via Target-Aware Data Synthesis.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

Multiple-Level Distillation for Video Fine-Grained Accident Detection.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.
IEEE Trans. Cybern., May, 2024

Universal Object Detection with Large Vision Model.
Int. J. Comput. Vis., April, 2024

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows.
IEEE Trans. Cybern., March, 2024

Exploring and Exploiting High-Order Spatial-Temporal Dynamics for Long-Term Frame Prediction.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Towards Bridged Vision and Language: Learning Cross-Modal Knowledge Representation for Relation Extraction.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Prompt-Based Learning for Unpaired Image Captioning.
IEEE Trans. Multim., 2024

CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding.
IEEE Trans. Multim., 2024

Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation.
IEEE Trans. Multim., 2024

Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering.
IEEE Trans. Multim., 2024

SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification.
IEEE Trans. Multim., 2024

CRADA: Cross Domain Object Detection With Cyclic Reconstruction and Decoupling Adaptation.
IEEE Trans. Multim., 2024

Context-Guided Black-Box Attack for Visual Tracking.
IEEE Trans. Multim., 2024

Fine-Grained Accident Detection: Database and Algorithm.
IEEE Trans. Image Process., 2024

Local context attention learning for fine-grained scene graph generation.
Pattern Recognit., 2024

Multi-scale architectures matter: Examining the adversarial robustness of flow-based lossless compression.
Pattern Recognit., 2024

CMCL: Cross-Modal Compressive Learning for Resource-Constrained Intelligent IoT Systems.
IEEE Internet Things J., 2024

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
CoRR, 2024

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment.
CoRR, 2024

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation.
CoRR, 2024

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS.
CoRR, 2024

Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm.
CoRR, 2024

Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers.
CoRR, 2024

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.
CoRR, 2024

Learning Spatial-Semantic Features for Robust Video Object Segmentation.
CoRR, 2024

Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition.
CoRR, 2024

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results.
CoRR, 2024

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation.
CoRR, 2024

vHeat: Building Vision Models upon Heat Conduction.
CoRR, 2024

MambaVC: Learned Visual Compression with Selective State Spaces.
CoRR, 2024

LG-VQ: Language-Guided Codebook Learning.
CoRR, 2024

Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models.
CoRR, 2024

Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition.
CoRR, 2024

State Space Model for New-Generation Network Alternative to Transformers: A Survey.
CoRR, 2024

MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network.
CoRR, 2024

Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting.
CoRR, 2024

VMamba: Visual State Space Model.
CoRR, 2024

Calibration for Long-tailed Scene Graph Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CoIn: A Lightweight and Effective Framework for Story Visualization and Continuation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CoTuning: A Large-Small Model Collaborating Distillation Framework for Better Model Generalization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Motion-aware Latent Diffusion Models for Video Frame Interpolation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

NivTA: Towards a Naturally Interactable Edu-Metaverse Teaching Assistant for CAVE.
Proceedings of the IEEE International Conference on Metaverse Computing, 2024

MLP-DINO: Category Modeling and Query Graphing with Deep MLP for Object Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Clip-Based Synergistic Knowledge Transfer for text-based Person Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.
Proceedings of the Computer Vision - ECCV 2024, 2024

Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance.
Proceedings of the Computer Vision - ECCV 2024, 2024

Modality-Collaborative Test-Time Adaptation for Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CricaVPR: Cross-Image Correlation-Aware Representation Learning for Visual Place Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RTracker: Recoverable Tracking via PN Tree Structured Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Regressor-Segmenter Mutual Prompt Learning for Crowd Counting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Generative Data Free Model Quantization With Knowledge Matching for Classification.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Multi-proxy feature learning for robust fine-grained visual recognition.
Pattern Recognit., November, 2023

Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level Product Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

DCR-ReID: Deep Component Reconstruction for Cloth-Changing Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Conformer: Local Features Coupling Global Representations for Recognition and Detection.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.
Mach. Intell. Res., August, 2023

DRAKE: Deep Pair-Wise Relation Alignment for Knowledge-Enhanced Multimodal Scene Graph Generation in Social Media Posts.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

WDMNet: Modeling diverse variations of regional wind speed for multi-step predictions.
Neural Networks, May, 2023

Robust and Hierarchical Spatial Relation Analysis for Traffic Forecasting.
IEEE Trans. Intell. Transp. Syst., January, 2023

Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition.
IEEE Trans. Multim., 2023

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking.
IEEE Trans. Multim., 2023

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition.
IEEE Trans. Multim., 2023

PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates.
IEEE Trans. Image Process., 2023

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection.
IEEE Trans. Image Process., 2023

Spatial-Temporal Graph Network for Video Crowd Counting.
IEEE Trans. Circuits Syst. Video Technol., 2023

Classification of single-view object point clouds.
Pattern Recognit., 2023

FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information.
CoRR, 2023

Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions.
CoRR, 2023

Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog.
CoRR, 2023

MixBCT: Towards Self-Adapting Backward-Compatible Training.
CoRR, 2023

ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden States.
CoRR, 2023

CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding.
CoRR, 2023

Towards Efficient Task-Driven Model Reprogramming with Foundation Models.
CoRR, 2023

Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation.
CoRR, 2023

Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias.
CoRR, 2023

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition.
CoRR, 2023

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Client-Adaptive Cross-Model Reconstruction Network for Modality-Incomplete Multimodal Federated Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HumVis: Human-Centric Visual Analysis System.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Manifold-Aware Self-Training for Unsupervised Domain Adaptation on Regressing 6D Object Pose.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Gradual Study Advising with Course Knowledge Graphs.
Proceedings of the Advances in Web-Based Learning - ICWL 2023, 2023

Spikformer: When Spiking Neural Network Meets Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CiteTracker: Correlating Image and Text for Visual Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Strip-MLP: Efficient Token Interaction for Vision MLP.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Recurrent Fine-Grained Self-Attention Network for Video Crowd Counting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Integrally Pre-Trained Transformer Pyramid Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Graph Neural Diffusion for Traffic Demand Forecasting.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Digging out Discrimination Information from Generated Samples for Robust Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domain.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Optimized separable convolution: Yet another efficient convolution operator.
AI Open, January, 2022

Tracking by Joint Local and Global Search: A Target-Aware Attention-Based Approach.
IEEE Trans. Neural Networks Learn. Syst., 2022

Attribute-Aware Feature Encoding for Object Recognition and Segmentation.
IEEE Trans. Multim., 2022

Bidirectional Posture-Appearance Interaction Network for Driver Behavior Recognition.
IEEE Trans. Intell. Transp. Syst., 2022

Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System.
IEEE Trans. Ind. Informatics, 2022

Adaptive Spatial Pyramid Constraint for Hyperspectral Image Classification With Limited Training Samples.
IEEE Trans. Geosci. Remote. Sens., 2022

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.
IEEE Trans. Cybern., 2022

Multi-attribute object detection benchmark for smart city.
Multim. Syst., 2022

A survey of crowd counting and density estimation based on convolutional neural network.
Neurocomputing, 2022

Million-scale Object Detection with Large Vision Model.
CoRR, 2022

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric.
CoRR, 2022

Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domai.
CoRR, 2022

Global-Supervised Contrastive Loss and View-Aware-Based Post-Processing for Vehicle Re-Identification.
CoRR, 2022

Boost Test-Time Performance with Closed-Loop Inference.
CoRR, 2022

Peng Cheng Object Detection Benchmark for Smart City.
CoRR, 2022

Conceptor Learning for Class Activation Mapping.
CoRR, 2022

Identifying the kind behind SMILES - anatomical therapeutic chemical classification using structure-only representations.
Briefings Bioinform., 2022

Asymptotic optimality for active learning processes.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Learning to Share in Networked Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mixed Supervision for Instance Learning in Object Detection with Few-shot Annotation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Span-based Audio-Visual Localization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Graph Embedded Pose Regularity Learning via Spatio-Temporal Transformer for Abnormal Behavior Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Intelligent Instructional Design via Interactive Knowledge Graph Editing.
Proceedings of the Learning Technologies and Systems, 2022

Downscaling and Overflow-aware Model Compression for Efficient Vision Processors.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

KCUBE: A Knowledge Graph University Curriculum Framework for Student Advising and Career Planning.
Proceedings of the Blended Learning: Engaging Students in the New Normal Era, 2022

DAS: Densely-Anchored Sampling for Deep Metric Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-Grained Object Classification via Self-Supervised Pose Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards End-to-End Image Compression and Analysis with Transformers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Progressive Feature Enhancement for Person Re-Identification.
IEEE Trans. Image Process., 2021

Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Diverse part attentive network for video-based person re-identification.
Pattern Recognit. Lett., 2021

Towards effective deep transfer via attentive feature alignment.
Neural Networks, 2021

Learning to Share in Multi-Agent Reinforcement Learning.
CoRR, 2021

An Informative Tracking Benchmark.
CoRR, 2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.
CoRR, 2021

Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings.
Proceedings of the IEEE International Conference on Data Mining, 2021

Conformer: Local Features Coupling Global Representations for Visual Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Reducing Image Compression Artifacts for Deep Neural Networks.
Proceedings of the 31st Data Compression Conference, 2021

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Contrastive Neural Architecture Search With Neural Architecture Comparators.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Hierarchically and Cooperatively Learning Traffic Signal Control.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Adaptation-Oriented Feature Projection for One-Shot Action Recognition.
IEEE Trans. Multim., 2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Anonymous Model Pruning for Compressing Deep Neural Networks.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Learning Compact Networks via Similarity-Aware Channel Pruning.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Prune it Yourself: Automated Pruning by Multiple Level Sensitivity.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

End-Edge-Cloud Collaborative System: A Video Big Data Processing and Analysis Architecture.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

R-SiamNet: ROI-Align Pooling Baesd Siamese Network for Object Tracking.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Large Batch Optimization for Object Detection: Training COCO in 12 minutes.
Proceedings of the Computer Vision - ECCV 2020, 2020

An Asymmetric Modeling for Action Assessment.
Proceedings of the Computer Vision - ECCV 2020, 2020

Modular Graph Attention Network for Complex Visual Relational Reasoning.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Towards Accurate Low Bit-Width Quantization with Multiple Phase Adaptations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning from Multi-annotator Data: A Noise-aware Classification Framework.
ACM Trans. Inf. Syst., 2019

Can Categories and Attributes Be Learned in a Multi-Task Way?
IEEE Trans. Multim., 2019

P-ODN: Prototype based Open Deep Network for Open Set Recognition.
CoRR, 2019

EAN: Event Attention Network for Stock Price Trend Prediction based on Sentimental Embedding.
Proceedings of the 11th ACM Conference on Web Science, 2019

Bi-directional Re-ranking for Person Re-identification.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Efficient and Fast Coefficient Sign Inference for Video Coding.
Proceedings of the Data Compression Conference, 2019

2018
Joint Semantic and Latent Attribute Modelling for Cross-Class Transfer Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Cross-Domain Adversarial Feature Learning for Sketch Re-identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fast Compressed Domain Copy Detection with Motion Vector Imaging.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Multi-Pose Learning based Head-Shoulder Re-identification.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Hierarchical Temporal Memory Enhanced One-Shot Distance Learning for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Attribute Driven Zero-Shot Classification and Segmentation.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

SFCM: Learn a Pooling Kernel for Weakly Supervised Object Localization.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

ODN: Opening the Deep Network for Open-Set Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Temporal Attentive Network for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Toward Efficient Simultaneous Detection and Segmentation.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Deep Transfer Learning for Person Re-Identification.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN.
IEEE Trans. Multim., 2017

Rate-Performance-Loss Optimization for Inter-Frame Deep Feature Coding From Videos.
IEEE Trans. Image Process., 2017

A fast skip and direction adaptive search algorithm for Sub-Pixel Motion Estimation on HEVC.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with mixed supervised losses for image search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with multi-task learning for large-scale instance-level vehicle search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Long-Term Dependencies for Action Recognition with a Biologically-Inspired Deep Network.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A Network Framework for Noisy Label Aggregation in Social Media.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Fixed-point Gaussian Mixture Model for analysis-friendly surveillance video coding.
Comput. Vis. Image Underst., 2016

shuttleNet: A biologically-inspired RNN with loop connection and parameter sharing.
CoRR, 2016

Joint Network based Attention for Action Recognition.
CoRR, 2016

Deep Transfer Learning for Person Re-identification.
CoRR, 2016

CNN vs. SIFT for Image Retrieval: Alternative or Complementary?
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Joint Learning of Semantic and Latent Attributes.
Proceedings of the Computer Vision - ECCV 2016, 2016

Unsupervised Cross-Dataset Transfer Learning for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Relative Distance Learning: Tell the Difference between Similar Vehicles.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

High-Efficiency Coding for Shaking Surveillance Videos Based on Global Motion Compensation.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
Robust multiple cameras pedestrian detection with multi-view Bayesian network.
Pattern Recognit., 2015

Quality-progressive coding for high bit-rate background frames on surveillance videos.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Learning Deep Trajectory Descriptor for action recognition in videos using deep neural networks.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

CNN Based Vehicle Counting with Virtual Coil in Traffic Surveillance Video.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Detecting Rare Actions and Events from Surveillance Big Data with Bag of Dynamic Trajectories.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Swiss-System Based Cascade Ranking for Gait-Based Person Re-Identification.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
A refined object detection method based on HTM.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Multi-view gait recognition with incomplete training data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2013

A coding unit classification based AVC-to-HEVC transcoding with background modeling for surveillance videos.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Wavelet based smoke detection method with RGB Contrast-image and shape constrain.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Pair-wise event detection using cubic features and sequence discriminant learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

A system based on sequence learning for event detection in surveillance video.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
PKU-NEC @TRECVID2012 SED : Uneven-Sequence Based Event Detection in Surveillance Video.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Multi-camera Pedestrian Detection with Multi-view Bayesian Network Model.
Proceedings of the British Machine Vision Conference, 2012

Single and Multiple View Detection, Tracking and Video Analysis in Crowded Environments.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Automatic Webcam-Based Human Heart Rate Measurements Using Laplacian Eigenmap.
Proceedings of the Computer Vision, 2012

2011
PKU-NEC @TRECVID2011 SED: Sequence-Based Event Detection in Surveillance Video.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Selective eigenbackgrounds method for background subtraction in crowed scenes.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

2010
PKU@TRECVID2010: Pair-Wise Event Detection in Surveillance Video.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Dynamic multi-cue tracking with detection responses association.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

ESUR: A system for Events detection in SURveillance video.
Proceedings of the International Conference on Image Processing, 2010

2009
PKU@TRECVID2009: Single-Actor and Pair-Activity Event Detection in surveillance Video.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

2007
A Robust Caption Detecting Algorithm on MPEG Compressed Video.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

2003
A regularized simultaneous autoregressive model for texture classification.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

A new algorithm for remotely sensed image texture classification and segmentation.
Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, 2003


  Loading...