Wenjun Zeng

Orcid: 0000-0003-2531-3137

Affiliations:
  • Eastern Institute of Technology, Ningbo, China
  • Microsoft Research Asia, Beijing, China
  • University of Missouri-Columbia, Department of Computer Science, MO, USA (2003 - 2016)
  • Packet Video Corporation, San Diego, CA, USA (former)
  • Sharp Laboratories of America, Inc., Camas, WA, USA (former)
  • Princeton University, Department of Electrical Engineering, NJ, USA (PhD 1997)


According to our database1, Wenjun Zeng authored at least 314 papers between 1996 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2012, "For contributions to multimedia communication and security".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Domain Prompt Tuning via Meta Relabeling for Unsupervised Adversarial Adaptation.
IEEE Trans. Multim., 2024

Understanding mobile GUI: From pixel-words to screen-sentences.
Neurocomputing, 2024

Open-World Reinforcement Learning over Long Short-Term Imagination.
CoRR, 2024

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation.
CoRR, 2024

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs.
CoRR, 2024

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models.
CoRR, 2024

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning.
CoRR, 2024

Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives.
CoRR, 2024

Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback.
CoRR, 2024

Correlation-Embedded Transformer Tracking: A Single-Branch Framework.
CoRR, 2024

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Temporal Context Learning for Camera-Based Semantic Scene Completion.
Proceedings of the Computer Vision - ECCV 2024, 2024

ReGenNet: Towards Human Action-Reaction Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inter-X: Towards Versatile Human-Human Interaction Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Generalizing to Unseen Domains: A Survey on Domain Generalization.
IEEE Trans. Knowl. Data Eng., August, 2023

Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition.
IEEE Trans. Multim., 2023

RailSeg: Learning Local-Global Feature Aggregation With Contextual Information for Railway Point Cloud Semantic Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2023

Extracting 3-D Structural Lines of Building From ALS Point Clouds Using Graph Neural Network Embedded With Corner Information.
IEEE Trans. Geosci. Remote. Sens., 2023

VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

RLLTE: Long-Term Evolution Project of Reinforcement Learning.
CoRR, 2023

Diffusion Models for Image Restoration and Enhancement - A Comprehensive Survey.
CoRR, 2023

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation.
CoRR, 2023

Collaborative World Models: An Online-Offline Transfer RL Approach.
CoRR, 2023

Inpaint Anything: Segment Anything Meets Image Inpainting.
CoRR, 2023

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation.
CoRR, 2023

StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion.
CoRR, 2023

Composable Image Coding for Machine via Task-oriented Internal Adaptor and External Prior.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

WEDGE: Web-Image Assisted Domain Generalization for Semantic Segmentation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Neighborhood Geometric Structure-Preserving Variational Autoencoder for Smooth and Bounded Data Sources.
IEEE Trans. Neural Networks Learn. Syst., 2022

Beyond Triplet Loss: Meta Prototypical N-Tuple Loss for Person Re-identification.
IEEE Trans. Multim., 2022

Style Normalization and Restitution for Domain Generalization and Adaptation.
IEEE Trans. Multim., 2022

APANet: Auto-Path Aggregation for Future Instance Segmentation Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

FPCR-Net: Feature pyramidal correlation and residual reconstruction for optical flow estimation.
Neurocomputing, 2022

Tackling Visual Control via Multi-View Exploration Maximization.
CoRR, 2022

Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning.
CoRR, 2022

Gaze- and Spacing-flow Unveil Intentions: Hidden Follower Discovery.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Building A Group-based Unsupervised Representation Disentanglement Framework.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Robust Multi-object Tracking by Marginal Inference.
Proceedings of the Computer Vision - ECCV 2022, 2022

VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data.
Proceedings of the Computer Vision - ECCV 2022, 2022

Correlation-Aware Deep Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ReSTR: Convolution-free Referring Image Segmentation Using Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
AttributeNet: Attribute enhanced vehicle re-identification.
Neurocomputing, 2021

CASINet: Content-Adaptive Scale Interaction Networks for scene parsing.
Neurocomputing, 2021

FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking.
Int. J. Comput. Vis., 2021

AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild.
Int. J. Comput. Vis., 2021

SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation.
CoRR, 2021

Confounder Identification-free Causal Visual Feature Learning.
CoRR, 2021

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition.
CoRR, 2021

WEDGE: Web-Image Assisted Domain Generalization for Semantic Segmentation.
CoRR, 2021

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP.
CoRR, 2021

ToAlign: Task-oriented Alignment for Unsupervised Domain Adaptation.
CoRR, 2021

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning.
CoRR, 2021

Understanding Mobile GUI: from Pixel-Words to Screen-Sentences.
CoRR, 2021

Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification.
CoRR, 2021

Do Generative Models Know Disentanglement? Contrastive Learning is All You Need.
CoRR, 2021

GroupifyVAE: from Group-based Definition to VAE-based Unsupervised Representation Disentanglement.
CoRR, 2021

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework.
CoRR, 2021

VAE^2: Preventing Posterior Collapse of Variational Video Predictions in the Wild.
CoRR, 2021

Style Normalization and Restitution for DomainGeneralization and Adaptation.
CoRR, 2021

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ToAlign: Task-Oriented Alignment for Unsupervised Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Uncertainty-Aware Few-Shot Image Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Learning Tracking Representations via Dual-Branch Fully Transformer Networks.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Rethinking Content and Style: Exploring Bias for Unsupervised Disentanglement.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Visual Representation Learning by Tracking Patches in Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Target-Tailored Source-Transformation for Scene Graph Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

S2R-DepthNet: Learning a Generalizable Depth-Specific Structural Representation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Very Important Person Localization in Unconstrained Conditions: A New Benchmark.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks.
IEEE Trans. Image Process., 2020

View Invariant 3D Human Pose Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2020

Temporal-Spatial Mapping for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2020

Object Detection in Videos by High Quality Object Linking.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Humble Teacher and Eager Student: Dual Network Learning for Semi-supervised 2D Human Pose Estimation.
CoRR, 2020

Re-identification = Retrieval + Verification: Back to Essence and Forward with a New Metric.
CoRR, 2020

Feature Alignment and Restoration for Domain Generalization and Adaptation.
CoRR, 2020

Rethinking Classification Loss Designs for Person Re-identification with a Unified View.
CoRR, 2020

End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras.
CoRR, 2020

A Simple Baseline for Multi-Object Tracking.
CoRR, 2020

STC-Flow: Spatio-temporal Context-aware Optical Flow Estimation.
CoRR, 2020

FPCR-Net: Feature Pyramidal Correlation and Residual Reconstruction for Semi-supervised Optical Flow Estimation.
CoRR, 2020

CNSA: a data repository for archiving omics data.
Database J. Biol. Databases Curation, 2020

Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Beyond Intra-modality: A Survey of Heterogeneous Person Re-identification.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Joint Time-Frequency and Time Domain Learning for Speech Enhancement.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

VoxelPose: Towards Multi-camera 3D Human Pose Estimation in Wild Environment.
Proceedings of the Computer Vision - ECCV 2020, 2020

AF2S: An Anchor-Free Two-Stage Tracker Based on a Strong SiamFC Baseline.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Global Distance-Distributions Separation for Unsupervised Person Re-identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Spatiotemporal Fusion in 3D CNNs: A Probabilistic View.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fusing Wearable IMUs With Multi-View Images for Human Pose Estimation: A Geometric Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Relation-Aware Global Attention for Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Granularity Reference-Aided Attentive Feature Aggregation for Video-Based Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Tracking by Instance Detection: A Meta-Learning Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Style Normalization and Restitution for Generalizable Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Posterior-Guided Neural Architecture Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Semantics-Aligned Representation Learning for Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning Attentional Recurrent Neural Network for Visual Tracking.
IEEE Trans. Multim., 2019

Learning to Update for Object Tracking With Recurrent Meta-Learner.
IEEE Trans. Image Process., 2019

Benchmarking Single-Image Dehazing and Beyond.
IEEE Trans. Image Process., 2019

High-Order Statistical Modeling Based on a Decision Tree for Distributed Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2019

Multi-Modality Multi-Task Recurrent Neural Network for Online Action Detection.
IEEE Trans. Circuits Syst. Video Technol., 2019

Skeleton-Based Action Recognition With Gated Convolutional Neural Networks.
IEEE Trans. Circuits Syst. Video Technol., 2019

View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

High-Speed Hyperspectral Video Acquisition By Combining Nyquist and Compressive Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

One-Shot Neural Architecture Search Through A Posteriori Distribution Guided Sampling.
CoRR, 2019

CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing.
CoRR, 2019

Relation-Aware Global Attention.
CoRR, 2019

Exploring the Semantics for Visual Relationship Detection.
CoRR, 2019

Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition.
CoRR, 2019

Predicting Future Instance Segmentation with Contextual Pyramid ConvLSTMs.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Quality-Gated Convolutional Lstm for Enhancing Compressed Video.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Moving Indoor: Unsupervised Video Depth Learning in Challenging Environments.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Unsupervised High-Resolution Depth Learning From Videos With Dual Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cross View Fusion for 3D Human Pose Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Content-Aware Personalised Rate Adaptation for Adaptive Streaming via Deep Video Analysis.
Proceedings of the 2019 IEEE International Conference on Communications, 2019

Context-Reinforced Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Densely Semantically Aligned Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GLSNet: Global and Local Streams Network for 3D Point Cloud Classification.
Proceedings of the 48th IEEE Applied Imagery Pattern Recognition Workshop, 2019

Learning Basis Representation to Refine 3D Human Pose Estimations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Detect or Track: Towards Cost-Effective Video Object Detection/Tracking.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Refine 3D Human Pose Sequences.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph.
IEEE Trans. Multim., 2018

Hybrid Digital-Analog Video Delivery With Shannon-Kotel'nikov Mapping.
IEEE Trans. Multim., 2018

Optimizing Quality of Experience for Adaptive Bitrate Streaming via Viewer Interest Inference.
IEEE Trans. Multim., 2018

Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection.
IEEE Trans. Image Process., 2018

Simultaneous Depth and Spectral Imaging With a Cross-Modal Stereo System.
IEEE Trans. Circuits Syst. Video Technol., 2018

Superimposed Modulation for Soft Video Delivery With Hidden Resources.
IEEE Trans. Circuits Syst. Video Technol., 2018

Variable Block-Sized Signal-Dependent Transform for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2018

A Practical Hybrid Digital-Analog Scheme for Wireless Video Transmission.
IEEE Trans. Circuits Syst. Video Technol., 2018

Learning to Update for Object Tracking.
CoRR, 2018

Object Detection in Videos by Short and Long Range Object Linking.
CoRR, 2018

Real-Time Object Tracking with Motion Information.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Skeleton-Indexed Deep Multi-Modal Feature Learning for High Performance Human Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Cooperative Hybrid Digital-Analog Video Transmission in D2D Networks.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Adding Attentiveness to the Neurons in Recurrent Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Online Dictionary Learning for Approximate Archetypal Analysis.
Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Towards a Better Match in Siamese Network Based Visual Object Tracker.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

A Twofold Siamese Network for Real-Time Object Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Progressive Pseudo-analog Transmission for Mobile Video Streaming.
IEEE Trans. Multim., 2017

Guest Editorial Special Issue on Visual Computing in the Cloud: Mobile Computing.
IEEE Trans. Circuits Syst. Video Technol., 2017

Fully Reversible Privacy Region Protection for Cloud Video Surveillance.
IEEE Trans. Cloud Comput., 2017

Computational Depth Sensing : Toward high-performance commodity depth cameras.
IEEE Signal Process. Mag., 2017

Adaptive Nonlocal Sparse Representation for Dual-Camera Compressive Hyperspectral Imaging.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Personalized long-term prediction of cognitive function: Using sequential assessments to improve model performance.
J. Biomed. Informatics, 2017

RESIDE: A Benchmark for Single Image Dehazing.
CoRR, 2017

Impact of Next-Generation Mobile Technologies on IoT-Cloud Convergence.
IEEE Commun. Mag., 2017

On-line fall detection via a boosted cascade of hybrid features.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Adaptive Pooling in Multi-instance Learning for Web Video Annotation.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Human Pose Estimation Using Global and Local Normalization.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Dropout Prediction in Home Care Training.
Proceedings of the 10th International Conference on Educational Data Mining, 2017

A CNN-Based Approach for Automatic License Plate Recognition in the Wild.
Proceedings of the British Machine Vision Conference 2017, 2017

An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Lossless Compression of JPEG Coded Photo Collections.
IEEE Trans. Image Process., 2016

Lossless ROI Privacy Protection of H.264/AVC Compressed Surveillance Videos.
IEEE Trans. Emerg. Top. Comput., 2016

Understanding Humans in Multimedia.
IEEE Multim., 2016

On the Connection of Deep Fusion to Ensembling.
CoRR, 2016

Deeply-Fused Nets.
CoRR, 2016

Compressive hyperspectral imaging with complementary RGB measurements.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A super-fast online face tracking system for video surveillance.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Pseudo-sequence-based light field image compression.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Online Human Action Detection Using Joint Classification-Regression Recurrent Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
The Attention Automaton: Sensing Collective User Interests in Social Network Communities.
IEEE Trans. Netw. Sci. Eng., 2015

Structure-Preserving Hybrid Digital-Analog Video Delivery in Wireless Networks.
IEEE Trans. Multim., 2015

Graph-based video fingerprinting using double optimal projection.
J. Vis. Commun. Image Represent., 2015

Progressive pseudo-analog transmission for mobile video live streaming.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Swift: A Hybrid Digital-Analog Scheme for Low-Delay Transmission of Mobile Stereo Video.
Proceedings of the 18th ACM International Conference on Modeling, 2015

Compound image compression using lossless and lossy LZMA in HEVC.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Removing camera fingerprint to disguise photograph source.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

High-speed hyperspectral video acquisition with a dual-camera architecture.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
A Compressive Sensing Based Secure Watermark Detection and Privacy Preserving Storage Framework.
IEEE Trans. Image Process., 2014

Secure and robust image hashing via compressive sensing.
Multim. Tools Appl., 2014

Structural similarity-based video fingerprinting for video copy detection.
IET Image Process., 2014

Forging a Close Relationship with Multimedia Communities.
IEEE Multim., 2014

Context-Adaptive Modeling for Wavelet-Domain Distributed Video Coding.
IEEE Multim., 2014

Compressive sensing based secure multiparty privacy preserving framework for collaborative data-mining and signal processing.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Delta interpolation for upsampling imaging solutions.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Influence of social media on performance of movies.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Improving distributed video coding by exploiting context-adaptive modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Bridging Human-Centered Social Media Content Across Web Domains.
Proceedings of the Human-Centered Social Media Analytics, 2014

2013
Cognitive canonicalization of natural language queries using semantic strata.
ACM Trans. Speech Lang. Process., 2013

Introduction to the special section of best papers of ACM multimedia 2012.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Towards Cross-Domain Learning for Social Video Popularity Prediction.
IEEE Trans. Multim., 2013

Social Multimedia Signals: Sense, Process, and Put Them to Work.
IEEE Multim., 2013

Integrated secure watermark detection and privacy preserving storage in the compressive sensing domain.
Proceedings of the 2013 IEEE International Workshop on Information Forensics and Security, 2013

The Hidden Potential of Movie Genome Communities: Analyzing Fine-Grained Semantic Information in Motion Pictures.
Proceedings of the 2013 IEEE Seventh International Conference on Semantic Computing, 2013

Proactive caching of online video by mining mainstream media.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Geometry based airborne LIDAR data compression.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

A hybrid approach for tree classification in airborne LIDAR data.
Proceedings of the IEEE International Conference on Acoustics, 2013

Mainstream media vs. social media for trending topic prediction - an experimental study.
Proceedings of the 10th IEEE Consumer Communications and Networking Conference, 2013

SDNAN: Software-defined networking in ad hoc networks of smartphones.
Proceedings of the 10th IEEE Consumer Communications and Networking Conference, 2013

2012
Multiple description coded video streaming in peer-to-peer networks.
Signal Process. Image Commun., 2012

Mobile Media in Action: Remote Target Localization and Tracking.
IEEE Multim., 2012

SocialTransfer: cross-domain transfer learning from social streams for media applications.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Kinect-like depth denoising.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Expert Talk for Time Machine Session: High Order Entropy Coding - From Conventional Video Coding to Distributed Video Coding.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Video Based Real-World Remote Target Tracking on Smartphones.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Empowering Cross-Domain Internet Media with Real-Time Topic Learning from Social Streams.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

AMIGO: accurate mobile image geotagging.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Scalable Lossy Compression for Pixel-Value Encrypted Images.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

A Computational Cognitive Model for Semantic Sub-Network Extraction from Natural Language Queries.
Proceedings of the COLING 2012, 2012

Nest: Networked smartphones for target localization.
Proceedings of the 2012 IEEE Consumer Communications and Networking Conference (CCNC), 2012

2011
Positionit: an image-based remote target localization system on smartphones.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

A multi-layer key stream based approach for joint encryption and compression of H.264 video.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Network Assisted Media Streaming in Multi-Hop Wireless Networks.
Proceedings of 20th International Conference on Computer Communications and Networks, 2011

2010
Efficient Compression of Encrypted Grayscale Images.
IEEE Trans. Image Process., 2010

Efficient general print-scanning resilient data hiding based on uniform log-polar mapping.
IEEE Trans. Inf. Forensics Secur., 2010

Motion Refinement Based Progressive Side-Information Estimation for Wyner-Ziv Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2010

Overview of the OMA Secure Content IDentification Mechanism.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Design Issues of the prTorrent File Sharing Protocol.
Proceedings of the 7th IEEE Consumer Communications and Networking Conference, 2010

iImage: An Image Based Information Retrieval Application for the iPhone.
Proceedings of the 7th IEEE Consumer Communications and Networking Conference, 2010

Cross-Site Request Forgery: Attack and Defense.
Proceedings of the 7th IEEE Consumer Communications and Networking Conference, 2010

2009
Path-Diversity P2P Overlay Retransmission for Reliable IP-Multicast.
IEEE Trans. Multim., 2009

Non-ambiguity of blind watermarking: a revisit with analytical resolution.
Sci. China Ser. F Inf. Sci., 2009

prTorrent: On Establishment of Piece Rarity in the BitTorrent Unchoking Algorithm.
Proceedings of the Proceedings P2P 2009, 2009

Throughput and Delay Analysis of the IEEE 802.15.3 CSMA/CA Mechanism.
Proceedings of the IEEE 6th International Conference on Mobile Adhoc and Sensor Systems, 2009

Challenges and opportunities in supporting video streaming over infrastructure wireless mesh networks.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Multi-resolution based hybrid spatiotemporal compression of encrypted videos.
Proceedings of the IEEE International Conference on Acoustics, 2009

Estimating side-information for Wyner-Ziv video coding using resolution-progressive decoding and extensive motion exploration.
Proceedings of the IEEE International Conference on Acoustics, 2009

A New Data-Mining Based Approach for Network Intrusion Detection.
Proceedings of the 7th Annual Conference on Communication Networks and Services Research, 2009

High Performance Adaptive Video Services Based on Bitstream Switching for IPTV Systems.
Proceedings of the 6th IEEE Consumer Communications and Networking Conference, 2009

2008
End-to-End Security for Multimedia Adaptation.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Real Time Multimedia.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Improving Robustness of Quantization-Based Image Watermarking via Adaptive Receiver.
IEEE Trans. Multim., 2008

A Multi-band Wavelet Watermarking Scheme.
Int. J. Netw. Secur., 2008

Resolution-progressive compression of encrypted grayscale images.
Proceedings of the International Conference on Image Processing, 2008

An efficient print-scanning resilient data hiding scheme based on a novel LPM.
Proceedings of the International Conference on Image Processing, 2008

Supporting Video Streaming Services in Infrastructure Wireless Mesh Networks: Architecture and Protocols.
Proceedings of IEEE International Conference on Communications, 2008

Power-efficient rate allocation for Slepian-Wolf coding over wireless sensor networks.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Joint Design of Source Rate Control and QoS-Aware Congestion Control for Video Streaming Over the Internet.
IEEE Trans. Multim., 2007

Fast Bitstream Switching Algorithms for Real-Time Adaptive Video Multicasting.
IEEE Trans. Multim., 2007

Optimum Detection for Spread-Spectrum Watermarking That Employs Self-Masking.
IEEE Trans. Inf. Forensics Secur., 2007

Cross-Layer Design of Source Rate Control and Congestion Control for Wireless Video Streaming.
Adv. Multim., 2007

Fast and automatic watermark resynchronization based on zernike moments.
Proceedings of the Security, Steganography, and Watermarking of Multimedia Contents IX, 2007

Exploiting Overlay Path-Diversity for Scalable Reliable Multicast.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Real Time Multimedia.
Proceedings of the Encyclopedia of Multimedia, 2006

A sequence-based rate control framework for consistent quality real-time video.
IEEE Trans. Circuits Syst. Video Technol., 2006

Security for Multimedia Adaptation: Architectures and Solutions.
IEEE Multim., 2006

Cross-Layer Design of Source Rate Control and Qos-Aware Congestion Control for Wireless Video Streaming.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Path-Diversity Overlay Retransmission Architecture for Reliable Multicast.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Optimum Detection of Image-Adaptive Watermarking in the DCT Domain.
Proceedings of the International Conference on Image Processing, 2006

Rate-Distortion Optimized Transmission Power Adaptation for Video Streaming over Wireless Channels.
Proceedings of the International Conference on Image Processing, 2006

On Security Architecture and Functionality of Distributed Multimedia.
Proceedings of the 40th Annual Conference on Information Sciences and Systems, 2006

2005
Tile-boundary artifact reduction using odd tile size and the low-pass first convention.
IEEE Trans. Image Process., 2005

Low-pass filtering of rate-distortion functions for quality smoothing in real-time video communication.
IEEE Trans. Circuits Syst. Video Technol., 2005

Adaptive spatial-temporal error concealment with embedded side information.
J. Vis. Commun. Image Represent., 2005

Operational distortion-quantization curve-based bit allocation for smooth video quality.
J. Vis. Commun. Image Represent., 2005

Security architectures and analysis for content adaptation.
Proceedings of the Security, Steganography, and Watermarking of Multimedia Contents VII, 2005

Scalable Non-Binary Distributed Source Coding Using Gray Codes.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Multi-band Wavelet Based Digital Watermarking Using Principal Component Analysis.
Proceedings of the Digital Watermarking, 4th International Workshop, 2005

On the rate-distortion performance of dynamic bitstream switching mechanisms.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

A protocol for simultaneous real time playback and full quality storage of streaming media.
Proceedings of IEEE International Conference on Communications, 2005

2004
MPEG-4 IPMP Extension for Interoperable Protection of Multimedia Content.
EURASIP J. Adv. Signal Process., 2004

Rate-distortion optimized dynamic bitstream switching for scalable video streaming.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Single-pass frame-level constant distortion bit allocation for smooth video quality.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Network friendly media security: rationales, solutions, and open issues.
Proceedings of the 2004 International Conference on Image Processing, 2004

An improved rate-quantization model for rate control in real-time video encoding.
Proceedings of the 2004 International Conference on Image Processing, 2004

Two fast bitstream switching algorithms for real-time adaptive multicasting of video.
Proceedings of IEEE International Conference on Communications, 2004

2003
Efficient frequency domain selective scrambling of digital video.
IEEE Trans. Multim., 2003

Spatial-temporal error concealment with side information for standard video codecs.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Source characteristics based fast bitstream switching.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002
3G wireless multimedia: technologies and practical issues.
Wirel. Commun. Mob. Comput., 2002

A format-compliant configurable encryption framework for access control of video.
IEEE Trans. Circuits Syst. Video Technol., 2002

An overview of the visual optimization tools in JPEG 2000.
Signal Process. Image Commun., 2002

Format-Compliant Selective Scrambling for Multimedia Access Control.
Proceedings of the 2002 International Symposium on Information Technology (ITCC 2002), 2002

Fast self-synchronous content scrambling by spatially shuffling codewords of compressed bitstreams.
Proceedings of the 2002 International Conference on Image Processing, 2002

Sequence-based rate control for constant quality video.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
A format-compliant configurable encryption framework for access control of multimedia.
Proceedings of the Fourth IEEE Workshop on Multimedia Signal Processing, 2001

Scalable streaming of JPEG2000 images using hypertext transfer protocol.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Preface to Special Sessions on Multimedia Security and Watermarking Applications.
Proceedings of the 2001 International Symposium on Information Technology (ITCC 2001), 2001

HTTP Streaming of JPEG2000 Images.
Proceedings of the 2001 International Symposium on Information Technology (ITCC 2001), 2001

2000
An Efficient Color Re-Indexing Scheme for Palette-Based Compression.
Proceedings of the 2000 International Conference on Image Processing, 2000

Visual Optimization Tools in JPEG 2000.
Proceedings of the 2000 International Conference on Image Processing, 2000

Point-Wise Extended Visual Masking for JPEG-2000 Image Compression.
Proceedings of the 2000 International Conference on Image Processing, 2000

1999
A statistical watermark detection technique without using original images for resolving rightful ownerships of digital images.
IEEE Trans. Image Process., 1999

Geometric-structure-based error concealment with novel applications in block-based low-bit-rate coding.
IEEE Trans. Circuits Syst. Video Technol., 1999

Informing Clientele through Networked Multimedia Information Systems: Introduction to the Special Issues.
Informing Sci. Int. J. an Emerg. Transdiscipl., 1999

Extraction of multiresolution watermark images for resolving rightful ownership.
Proceedings of the Security and Watermarking of Multimedia Contents, 1999

Efficient frequency domain video scrambling for content access control.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

1998
Image-adaptive watermarking using visual models.
IEEE J. Sel. Areas Commun., 1998

Adaptive Wavelet Transforms with Spatially Varying Filters for Scalable Image Coding.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

1997
Perceptual watermarking of still images.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

Feature-Oriented Rate Shaping of Pre-Compressed Image/Video.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

On Resolving Rightful Ownership's of Digital Images by Invisible Watermarks.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Digital image watermarking using visual models.
Proceedings of the Human Vision and Electronic Imaging II, 1997

1996
Rate Shaping by Block Dropping for Transmission of MPEG-Precoded Video over Channels of Dynamic Bandwidth.
Proceedings of the Forth ACM International Conference on Multimedia '96, 1996

Integrated Image and Speech Analysis for Content-Based Video Indexing.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1996

Directional spatial interpolation for DCT-based low bit rate coding.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996


  Loading...