2024
In-use calibration: improving domain-specific fine-grained few-shot recognition.
Neural Comput. Appl., May, 2024
End-to-End Human Instance Matting.
IEEE Trans. Circuits Syst. Video Technol., April, 2024
Editorial for Special Issue on Artificial Intelligence for Art.
Mach. Intell. Res., February, 2024
Artistic image synthesis from unsupervised segmentation maps.
Multim. Tools Appl., January, 2024
Artistic image synthesis with tag-guided correlation matching.
Multim. Tools Appl., January, 2024
Hierarchical pose net: spatial hierarchical body tree driven multi-person pose estimation.
Multim. Tools Appl., January, 2024
Limb-Aware Virtual Try-On Network With Progressive Clothing Warping.
IEEE Trans. Multim., 2024
Stereo Image Restoration via Attention-Guided Correspondence Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2024
Uncertainty-aware pseudo-label filtering for source-free unsupervised domain adaptation.
Neurocomputing, 2024
Bridge then Begin Anew: Generating Target-relevant Intermediate Model for Source-free Visual Emotion Adaptation.
CoRR, 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation.
CoRR, 2024
Multi-source Domain Adaptation for Panoramic Semantic Segmentation.
CoRR, 2024
MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2024, 2024
Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Blur-Aware Spatio-Temporal Sparse Transformer for Video Deblurring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
FakePoI: A Large-Scale Fake Person of Interest Video Detection Benchmark and a Strong Baseline.
IEEE Trans. Circuits Syst. Video Technol., November, 2023
Focus nuance and toward diversity: exploring domain-specific fine-grained few-shot recognition.
Neural Comput. Appl., October, 2023
Center-push loss for joint view-based 3D model classification and retrieval feature learning.
Signal Image Video Process., June, 2023
MIFNet: Multiple instances focused temporal action proposal generation.
Neurocomputing, June, 2023
Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., June, 2023
Unsupervised Low-Light Video Enhancement With Spatial-Temporal Co-Attention Transformer.
IEEE Trans. Image Process., 2023
HPnet: Hybrid Parallel Network for Human Pose Estimation.
Sensors, 2023
Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation.
Int. J. Comput. Vis., 2023
Graph Convolutional GRU for Music-Oriented Dance Choreography Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
BRMR: TAL Based on Boundary Refinement and Multi-scale Regression.
Proceedings of the Image and Graphics - 12th International Conference, 2023
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders.
CoRR, 2022
Spatio-Temporal Deformable Attention Network for Video Deblurring.
Proceedings of the Computer Vision - ECCV 2022, 2022
Temporal Action Proposal Generation with Background Constraint.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Deep Semantic Parsing of Freehand Sketches With Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning.
IEEE Trans. Multim., 2021
Unsupervised Discrete Hashing With Affinity Similarity.
IEEE Trans. Image Process., 2021
Iterated shape-bias graph cut with application to ellipse segmentation.
J. Intell. Fuzzy Syst., 2021
Sketch-specific data augmentation for freehand sketch recognition.
Neurocomputing, 2021
Toward 3D object reconstruction from stereo images.
Neurocomputing, 2021
Image editing with varying intensities of processing.
Comput. Vis. Image Underst., 2021
Temporal Action Proposal Generation with Transformers.
CoRR, 2021
Visual Chirality Meets Freehand Sketches.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
3D Reconstruction from Single-View Image Using Feature Selection.
Proceedings of the Image and Graphics - 11th International Conference, 2021
Adaptive Spatio-Temporal Convolutional Network for Video Deblurring.
Proceedings of the Image and Graphics - 11th International Conference, 2021
Efficient Regional Memory Network for Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Deep Saliency Hashing for Fine-Grained Retrieval.
IEEE Trans. Image Process., 2020
Discrete Probability Distribution Prediction of Image Emotions with Shared Sparse Learning.
IEEE Trans. Affect. Comput., 2020
Conditional GAN based individual and global motion fusion for multiple object tracking in UAV videos.
Pattern Recognit. Lett., 2020
TVENet: Temporal variance embedding network for fine-grained action representation.
Pattern Recognit., 2020
Actionness-pooled Deep-convolutional Descriptor for fine-grained action recognition.
Neurocomputing, 2020
Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images.
Int. J. Comput. Vis., 2020
Object contour tracking via adaptive data-driven kernel.
EURASIP J. Adv. Signal Process., 2020
Meta3D: Single-View 3D Object Reconstruction from Shape Priors in Memory.
CoRR, 2020
An Effective Way to Boost Black-Box Adversarial Attack.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
PRF-Ped: Multi-scale Pedestrian Detector with Prior-based Receptive Field.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
GRNet: Gridding Residual Network for Dense Point Cloud Completion.
Proceedings of the Computer Vision - ECCV 2020, 2020
SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Discovering Latent Discriminative Patterns for Multi-Mode Event Representation.
IEEE Trans. Multim., 2019
Learning Descriptors With Cube Loss for View-Based 3-D Object Retrieval.
IEEE Trans. Multim., 2019
Nighttime image enhancement based on image decomposition.
Signal Image Video Process., 2019
An efficient way to refine DenseNet.
Signal Image Video Process., 2019
Hedging Deep Features for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Gradual recovery based occluded digit images recognition.
Multim. Tools Appl., 2019
Action recognition with multi-scale trajectory-pooled 3D convolutional descriptors.
Multim. Tools Appl., 2019
Handling missing labels and class imbalance challenges simultaneously for facial action unit recognition.
Multim. Tools Appl., 2019
Robust visual tracking via scale-and-state-awareness.
Neurocomputing, 2019
Unsupervised semantic deep hashing.
Neurocomputing, 2019
Toward 3D Object Reconstruction from Stereo Images.
CoRR, 2019
Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images.
CoRR, 2019
Focal CTC Loss for Chinese Optical Character Recognition on Unbalanced Datasets.
Complex., 2019
Self-balance Motion and Appearance Model for Multi-object Tracking in UAV.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019
Adaptive Semantic-Visual Tree for Hierarchical Embeddings.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
2018
Predicting Personalized Image Emotion Perceptions in Social Networks.
IEEE Trans. Affect. Comput., 2018
Locally aggregated histogram-based descriptors.
Signal Image Video Process., 2018
Distinctive action sketch for human action recognition.
Signal Process., 2018
Event patches: Mining effective parts for event detection and understanding.
Signal Process., 2018
Off-the-shelf CNN features for 3D object retrieval.
Multim. Tools Appl., 2018
Exploring part-aware segmentation for fine-grained visual categorization.
Multim. Tools Appl., 2018
Rediscover flowers structurally.
Multim. Tools Appl., 2018
Online multiple object tracking via exchanging object context.
Neurocomputing, 2018
Hierarchical semantic image matching using CNN feature pyramid.
Comput. Vis. Image Underst., 2018
Training samples-optimizing based dictionary learning algorithm for MR sparse superresolution reconstruction.
Biomed. Signal Process. Control., 2018
RS-MSSF Frame: Remote Sensing Image Classification Based on Extraction and Fusion of Multiple Spectral-Spatial Features.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Very High Resolution Image Scene Classification with Semantic Fisher Vectors.
Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, 2018
Add: Actionness-Pooled Deep-Convolutional Descriptor.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018
Local Image Descriptors with Statistical Losses.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Cycle-Consistency Based Hierarchical Dense Semantic Correspondence.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Illustrate your travel notes: web-based story visualization.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018
Weighted voxel: a novel voxel representation for 3D reconstruction.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018
2017
A Biologically Inspired Appearance Model for Robust Visual Tracking.
IEEE Trans. Neural Networks Learn. Syst., 2017
Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression.
IEEE Trans. Multim., 2017
Dancelets Mining for Video Recommendation Based on Dance Styles.
IEEE Trans. Multim., 2017
Deep Feature Fusion for VHR Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2017
Guest Editorial Introduction to the Special Issue on Group and Crowd Behavior Analysis for Intelligent Multicamera Video Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2017
Breaking video into pieces for action recognition.
Multim. Tools Appl., 2017
Anomaly detection based on spatio-temporal sparse representation and visual attention analysis.
Multim. Tools Appl., 2017
Towards more efficient and flexible face image deblurring using robust salient face landmark detection.
Multim. Tools Appl., 2017
Discovering discriminative patches for free-hand sketch analysis.
Multim. Syst., 2017
Exploiting the complementary strengths of multi-layer CNN features for image retrieval.
Neurocomputing, 2017
Actor identification via mining representative actions.
Neurocomputing, 2017
View-based 3D object retrieval with discriminative views.
Neurocomputing, 2017
Text image deblurring via two-tone prior.
Neurocomputing, 2017
How many zero crossings? A method for structure-texture image decomposition.
Comput. Graph., 2017
Sketch Based Model-Like Standing Style Recommendation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Shallow and Deep Model Investigation for Distinguishing Corn and Weeds.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Object Discovery and Cosegmentation Based on Dense Correspondences.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Multi-scale Discriminative Patches for Fined-Grained Visual Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Trajectory-Pooled 3D Convolutional Descriptors for Action Recognition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Style Transfer with Content Preservation from Multiple Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Multi-level Semantic Representation for Flower Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Multi-modal Emotion Recognition Based on Speech and Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Image Decomposition Based Nighttime Image Enhancement.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Part-based fine-grained bird image retrieval respecting species correlation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Gated additive skip context connection for object detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Dancing like a superstar: Action guidance based on pose estimation and conditional pose alignment.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
The shortest matching path based on novel cycle consistency.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Non-rigid Object Tracking via Deformable Patches Using Shape-Preserved KCF and Level Sets.
Proceedings of the IEEE International Conference on Computer Vision, 2017
Online multi-target tracking via depth range segmentation.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017
2016
Facial action unit recognition under incomplete data based on multi-label learning with missing labels.
Pattern Recognit., 2016
Multi-modal microblog classification via multi-task learning.
Multim. Tools Appl., 2016
An Informative Feature Selection Method Based on Sparse PCA for VHR Scene Classification.
IEEE Geosci. Remote. Sens. Lett., 2016
Auto-encoder based dimensionality reduction.
Neurocomputing, 2016
Unsupervised discovery of crowd activities by saliency-based clustering.
Neurocomputing, 2016
Exploring Discriminative Views for 3D Object Retrieval.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016
Predicting Personalized Emotion Perceptions of Social Images.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
A VHR scene classification method integrating sparse PCA and saliency computing.
Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, 2016
Crowd video retrieval via deep attribute-embedding graph ranking.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Tree-guided Group Sparse based Representation for Person Re-identification.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016
Mining representative actions for actor identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
User-Centric Affective Computing of Image Emotion Perceptions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
Affective Computing and Applications of Image Emotion Perceptions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs.
IEEE Trans. Multim., 2015
Non-Rigid Object Contour Tracking via a Novel Supervised Level Set Model.
IEEE Trans. Image Process., 2015
Social Attribute-Aware Force Model: Exploiting Richness of Interaction for Abnormal Crowd Detection.
IEEE Trans. Circuits Syst. Video Technol., 2015
View-based 3D object retrieval via multi-modal graph learning.
Signal Process., 2015
Adaptive NormalHedge for robust visual tracking.
Signal Process., 2015
深度学习中的自编码器的表达能力研究 (Representation Ability Research of Auto-encoders in Deep Learning).
计算机科学, 2015
Strategy for dynamic 3D depth data matching towards robust action retrieval.
Neurocomputing, 2015
Strategy for aesthetic photography recommendation via collaborative composition model.
IET Comput. Vis., 2015
Part-Aware Segmentation for Fine-Grained Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Efficient Face Image Deblurring via Robust Face Salient Landmark Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Blur Detection Using Multi-method Fusion.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Predicting Continuous Probability Distribution of Image Emotions in Valence-Arousal Space.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
"Clustering of Dancelets": Towards Video Recommendation Based on Dance Styles.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Formation Period Matters: Towards Socially Consistent Group Detection via Dense Subgraph Seeking.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
Distinctive action sketch.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Predicting discrete probability distribution of image emotions.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Histograms of locally aggregated oriented gradients.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Why does the sketch look so vivid?
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015
Dimensionality reduction strategy based on auto-encoder.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015
Dual-mode video stabilization based on adaptive motion clustering.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015
Boost sparse coding based abnormal event detection via explicitly applying temporal continuity constraint.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015
Learning a discriminative dictionary for facial expression recognition.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Toward Statistical Modeling of Saccadic Eye-Movement and Visual Saliency.
IEEE Trans. Image Process., 2014
Visual tracking via weakly supervised learning from multiple imperfect oracles.
Pattern Recognit., 2014
A refined particle filter based on determined level set model for robust contour tracking.
Mach. Vis. Appl., 2014
Where should I stand? Learning based human position recommendation for mobile photographing.
Multim. Tools Appl., 2014
Preface: Internet multimedia computing and service.
Multim. Tools Appl., 2014
Action recognition based on overcomplete independent components analysis.
Inf. Sci., 2014
Visualizing and Comparing Convolutional Neural Networks.
CoRR, 2014
Using Label Propagation to Get Confidence Map for Segmentation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014
Affective Image Retrieval via Multi-Graph Learning.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Exploring Principles-of-Art Features For Image Emotion Recognition.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Emotion based image musicalization.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014
Exploring covert attention for generic boosting of saliency models.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Structure-aware multi-object discovery for weakly supervised tracking.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
"Clustering by saliency" - Unsupervised discovery of crowd activities.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Discriminative Features for Bird Species Classification.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014
DNN Flow: DNN Feature Pyramid based Image Matching.
Proceedings of the British Machine Vision Conference, 2014
2013
Learning to Distribute Vocabulary Indexing for Scalable Visual Search.
IEEE Trans. Multim., 2013
Learning from mobile contexts to minimize the mobile location search latency.
Signal Process. Image Commun., 2013
Weakly supervised codebook learning by iterative label propagation with graph quantization.
Signal Process., 2013
Sparse coding based visual tracking: Review and experimental comparison.
Pattern Recognit., 2013
Bidirectional-isomorphic manifold learning at image semantic understanding & representation.
Multim. Tools Appl., 2013
Visual attention modeling based on short-term environmental adaption.
J. Vis. Commun. Image Represent., 2013
Video classification and recommendation based on affective analysis of viewers.
Neurocomputing, 2013
Robust visual tracking based on online learning sparse representation.
Neurocomputing, 2013
Nested-SIFT for Efficient Image Matching and Retrieval.
IEEE Multim., 2013
Eating activity detection from images acquired by a wearable camera.
Proceedings of the International SenseCam & Pervasive Imaging Conference 2013, 2013
Edge-Respecting Image Smoothing via Extrema Interpolation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013
Flexible Presentation of Videos Based on Affective Content Analysis.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
Beyond particle flow: Bag of Trajectory Graphs for dense crowd event recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013
The shortest warping path based multiple images alignment.
Proceedings of the IEEE International Conference on Image Processing, 2013
Structured Textons for texture representation.
Proceedings of the IEEE International Conference on Image Processing, 2013
Sparse coding based motion attention for abnormal event detection.
Proceedings of the IEEE International Conference on Image Processing, 2013
Non-rigid object tracking by adaptive data-driven kernel.
Proceedings of the IEEE International Conference on Image Processing, 2013
Real-time visual tracking using ℓ2 norm regularization based collaborative representation.
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Night video enhancement using improved dark channel prior.
Proceedings of the IEEE International Conference on Image Processing, 2013
A spatial-temporal constraint-based action recognition method.
Proceedings of the IEEE International Conference on Image Processing, 2013
Age classification based on back-propagation network.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013
Exploring Implicit Image Statistics for Visual Representativeness Modeling.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
2012
Robust Visual Tracking Using an Effective Appearance Model Based on Sparse Coding.
ACM Trans. Intell. Syst. Technol., 2012
Context-Aware Semi-Local Feature Detector.
ACM Trans. Intell. Syst. Technol., 2012
Task-Dependent Visual-Codebook Compression.
IEEE Trans. Image Process., 2012
Location Discriminative Vocabulary Coding for Mobile Landmark Search.
Int. J. Comput. Vis., 2012
Action retrieval based on generalized dynamic depth data matching.
Proceedings of the 2012 Visual Communications and Image Processing, 2012
Action Segmentation in Dance Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012
Real-Time Viewfinder Composition Assessment and Recommendation to Mobile Photographing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012
Memorable basis: towards human-centralized sparse representation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Aesthetic composition represetation for portrait photographing recommendation.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Abnormal crowd behavior detection based on social attribute-aware force model.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
What are we looking for: Towards statistical modeling of saccadic eye movements and visual saliency.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
2011
Mining flickr landmarks by modeling reconstruction sparsity.
ACM Trans. Multim. Comput. Commun. Appl., 2011
A Progressive Quality Hiding Strategy Based on Equivalence Partitions of Hiding Units.
Trans. Data Hiding Multim. Secur., 2011
Actor-independent action search using spatiotemporal vocabulary with appearance hashing.
Pattern Recognit., 2011
Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search.
IEEE Multim., 2011
Video indexing and recommendation based on affective analysis of viewers.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Unsupervised fast anomaly detection in crowds.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Learning heterogeneous data for hierarchical web video classification.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Towards low bit rate mobile visual search with multiple-channel coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Probe the Potts States in the Minicolumn Dynamics.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011
Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.
Proceedings of the IJCAI 2011, 2011
Modular Scale-Free Function Subnetworks in Auditory Areas.
Proceedings of the Neural Information Processing - 18th International Conference, 2011
Stable Fast Rewiring Depends on the Activation of Skeleton Voxels.
Proceedings of the Neural Information Processing - 18th International Conference, 2011
Robust visual tracking via context objects computing.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Contour tracking via on-line discriminative appearance modeling based level sets.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Sparse representation based visual element analysis.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Learning the trip suggestion from landmark photos on the web.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
PKUBench: A context rich mobile visual search benchmark.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Video stabilization based on saliency driven SIFT matching and discriminative RANSAC.
Proceedings of the ICIMCS 2011, 2011
Contextual dictionaries for image super resolution.
Proceedings of the ICIMCS 2011, 2011
A spatiotemporal context phrase description for general dynamic texture.
Proceedings of the ICIMCS 2011, 2011
Robust object tracking via inertial potential based mean shift.
Proceedings of the ICIMCS 2011, 2011
Affective Video Classification Based on Spatio-temporal Feature Fusion.
Proceedings of the Sixth International Conference on Image and Graphics, 2011
When codeword frequency meets geographical location.
Proceedings of the IEEE International Conference on Acoustics, 2011
A lowbit rate vocabulary coding scheme for mobile landmark search.
Proceedings of the IEEE International Conference on Acoustics, 2011
Sorting local descriptors for lowbit rate mobile visual search.
Proceedings of the IEEE International Conference on Acoustics, 2011
A novel supervised level set method for non-rigid object tracking.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
2010
Adaptive Sign Language Recognition With Exemplar Extraction and MAP/IVFS.
IEEE Signal Process. Lett., 2010
Local Histogram of Figure/Ground Segmentations for Dynamic Background Subtraction.
EURASIP J. Adv. Signal Process., 2010
Partial occlusion robust object tracking using an effective appearance model.
Proceedings of the Visual Communications and Image Processing 2010, 2010
Robust object tracking combining color and scale invariant features.
Proceedings of the Visual Communications and Image Processing 2010, 2010
Robust object tracking based on sparse representation.
Proceedings of the Visual Communications and Image Processing 2010, 2010
A rotation and scale invariant texture description approach.
Proceedings of the Visual Communications and Image Processing 2010, 2010
3D silhouette tracking with occlusion inference.
Proceedings of the Visual Communications and Image Processing 2010, 2010
A refined particle filter method for contour tracking.
Proceedings of the Visual Communications and Image Processing 2010, 2010
Adaptive Particle Filter Based on Energy Field for Robust Object Tracking in Complex Scenes.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010
A Determined Binary Level Set Method Based on Mean Shift for Contour Tracking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010
Scalable and Credible Video Watermarking towards Scalable Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010
Localized Image Matte Evaluation by Gradient Correlation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010
A steganography strategy based on equivalence partitions of hiding units.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Saliency detection based on short-term sparse representation.
Proceedings of the International Conference on Image Processing, 2010
Visual saliency as sequential eye fixation probability.
Proceedings of the International Conference on Image Processing, 2010
A robust texture descriptor using multifractal analysis with Gabor filter.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010
Visual topic model for web image annotation.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010
An Image Data Hiding Method Using Pixel-Based JND Model.
Proceedings of the Advanced Intelligent Computing Theories and Applications, 2010
Robust background modeling via standard variance feature.
Proceedings of the IEEE International Conference on Acoustics, 2010
Robust visual tracking using feature-based visual attention.
Proceedings of the IEEE International Conference on Acoustics, 2010
Mining actor correlations with hierarchical concurrence parsing.
Proceedings of the IEEE International Conference on Acoustics, 2010
SIGMA: Spatial Integrated Matching Association algorithm for logo detection.
Proceedings of the IEEE International Conference on Acoustics, 2010
Exploring statistical properties for semantic annotation: sparse distributed and convergent assumptions for keywords.
Proceedings of the IEEE International Conference on Acoustics, 2010
Visual tracking via weakly supervised learning from multiple imperfect oracles.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
Novel observation model for probabilistic object tracking.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
Towards semantic embedding in visual vocabulary.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
Who dominates the retinotectal mapping?
Proceedings of the 9th IEEE International Conference on Cognitive Informatics, 2010
2009
Event Tactic Analysis Based on Broadcast Sports Video.
IEEE Trans. Multim., 2009
Contour-motion feature (CMF): A space-time approach for robust pedestrian detection.
Pattern Recognit. Lett., 2009
Synthetic data generation technique in Signer-independent sign language recognition.
Pattern Recognit. Lett., 2009
Visual and textual fusion for semantically supervised region-based retrieval.
Multim. Syst., 2009
Dynamic Background Subtraction Based on Local Dependency Histogram.
Int. J. Pattern Recognit. Artif. Intell., 2009
Geometric and Algebraic Approaches of Planar Structure Recovery Based on Properties of Dual Conic.
Int. J. Pattern Recognit. Artif. Intell., 2009
Photo assessment based on computational visual attention model.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
What is a complete set of keywords for image description & annotation on the web.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Location sensitive indexing for image-based advertising.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Mining city landmarks from blogs by graph modeling.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Spatial-temporal nonparametric background subtraction in dynamic scenes.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Neighboring Image Patches Embedding for background modeling.
Proceedings of the International Conference on Image Processing, 2009
Multl-resolution background subtraction for dynamic scenes.
Proceedings of the International Conference on Image Processing, 2009
VisualCor system: search actor correlations in TV series.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009
Vocabulary hierarchy optimization for effective and transferable retrieval.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009
Local Spatial Co-occurrence for Background Subtraction via Adaptive Binned Kernel Estimation.
Proceedings of the Computer Vision, 2009
Universal Steganalysis Based on Statistical Models Using Reorganization of Block-based DCT Coefficients.
Proceedings of the Fifth International Conference on Information Assurance and Security, 2009
2008
DRM: dynamic region matching for image retrieval using probabilistic fuzzy matching and boosting feature selection.
Signal Image Video Process., 2008
Effective and Automatic Calibration Using Concentric Circles.
Int. J. Pattern Recognit. Artif. Intell., 2008
Shape from silhouettes based on a centripetal pentahedron model.
Graph. Model., 2008
Vision-Based Semi-supervised Homecare with Spatial Constraint.
Proceedings of the Advances in Multimedia Information Processing, 2008
Attention-driven action retrieval with DTW-based 3d descriptor matching.
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Place retrieval with graph-based place-view model.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008
Cross-media manifold learning for image retrieval & annotation.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008
DTW/ISODATA Algorithm and Multilayer Architecture in Sign Language Recognition with Large Vocabulary.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008
Sign Language Synthesis of Individuation Based on Data Model.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008
A Real-Time Video Watermarking Using Adjacent Luminance Blocks Correlation Based on Compressed Domain.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008
Hierarchical background subtraction using local pixel clustering.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
A covariance-based method for dynamic background subtraction.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
Mahalanobis distance based Polynomial Segment Model for Chinese Sign Language Recogniton.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Clustering-based subspace SVM ensemble for relevance feedback learning.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Directional correlation analysis of local Haar binary pattern for text detection.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Vocabulary tree incremental indexing for scalable location recognition.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Dynamic background modeling and subtraction using spatio-temporal local binary patterns.
Proceedings of the International Conference on Image Processing, 2008
Text Particles Multi-band Fusion for Robust Text Detection.
Proceedings of the Image Analysis and Recognition, 5th International Conference, 2008
2007
Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video.
IEEE Trans. Multim., 2007
Nonparametric background generation.
J. Vis. Commun. Image Represent., 2007
Elliptic Curve Cryptography Based Wireless Authentication Protocol.
Int. J. Netw. Secur., 2007
Novel Secure Communication Protocol for Conditional Access System.
Int. J. Netw. Secur., 2007
Shape from silhouette outlines using an adaptive dandelion model.
Comput. Vis. Image Underst., 2007
An image fragile watermark scheme based on chaotic image pattern and pixel-pairs.
Appl. Math. Comput., 2007
Trajectory based event tactics analysis in broadcast sports video.
Proceedings of the 15th International Conference on Multimedia 2007, 2007
Visual & textual fusion for region retrieval: from both fuzzy matching and bayesian reasoning aspects.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007
An Image Classification Method Based on a SK Sub-Vector Multi-Hierarchy Clustering Algorithm.
Proceedings of the Proceeding of the Second International Multi-Symposium of Computer and Computational Sciences (IMSCCS 2007), 2007
Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation.
Proceedings of the International Conference on Image Processing, 2007
Minimizing the Distortion Spatial Data Hiding Based on Equivalence Class.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007
Using Visual Dictionary to Associate Semantic Objects in Region-Based Image Retrieval.
Proceedings of the Image Analysis and Recognition, 4th International Conference, 2007
A Novel Retrieval Refinement and Interaction Pattern by Exploring Result Correlations for Image Retrieval.
Proceedings of the Adaptive Multimedial Retrieval: Retrieval, 2007
Combining Global and Local Classifiers for Lipreading.
Proceedings of the Affective Computing and Intelligent Interaction, 2007
2006
Security on Aydos et al's Elliptic Curve Cryptography Based Wireless Authentication Protocol.
J. Comput. Res. Dev., 2006
Online Selection of Discriminative Features Using Bayes Error Rate for Visual Tracking.
Proceedings of the Advances in Multimedia Information Processing, 2006
A New Steganalysis Method for Adaptive Spread Spectrum Steganography.
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006
Genetic Algorithm Based Optimal Block Mapping Method for LSB Substitution.
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006
A PCA Based Visual DCT Feature Extraction Method for Lip-Reading.
Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006
Real Time Large Vocabulary Continuous Sign Language Recognition Based on OP/Viterbi Algorithm.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
A Novel Volumetric Shape from Silhouette Algorithm Based on a Centripetal Pentahedron Model.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Visual Hull Embossment by Graph Cuts.
Proceedings of the International Conference on Image Processing, 2006
2005
Illumination Invariant Feature Selection for Face Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2005
Watermark Detection Schemes with High Security.
Proceedings of the International Symposium on Information Technology: Coding and Computing (ITCC 2005), 2005
Spatial Video Watermarking Based on Stability of DC Coefficients.
Proceedings of the Advances in Machine Learning and Cybernetics, 2005
An active volumetric model for 3D reconstruction.
Proceedings of the 2005 International Conference on Image Processing, 2005
Image Evaluation Factors.
Proceedings of the Image Analysis and Recognition, Second International Conference, 2005
Visual Sign Language Recognition Based on HMMs and Auto-regressive HMMs.
Proceedings of the Gesture in Human-Computer Interaction and Simulation, 2005
Static Gesture Quantization and DCT Based Sign Language Generation.
Proceedings of the Affective Computing and Intelligent Interaction, 2005
An Information Acquiring Channel - Lip Movement.
Proceedings of the Affective Computing and Intelligent Interaction, 2005
The Bunch-Active Shape Model.
Proceedings of the Affective Computing and Intelligent Interaction, 2005
An Adaptive Dandelion Model for Reconstructing Spherical Terrain-Like Visual Hull Surfaces.
Proceedings of the Fifth International Conference on 3D Digital Imaging and Modeling (3DIM 2005), 2005
2004
Mapping Energy Video Watermarking Algorithm Based on Compressed Domain.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
Online Learning Objectionable Image Filter Based on SVM.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
A Fragile Watermarking Based on Knapsack Problem.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
A Fragile Image Watermarking Based on Mass and Centroid.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
Steganalysis of Data Hiding Techniques in Wavelet Domain.
Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04), 2004
A texture-based tamper detection scheme by fragile watermark.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004
A robust watermarking method based on wavelet and Zernike transform.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004
Multilayer architecture in sign language recognition system.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
Based on HMM and SVM multilayer architecture classifier for Chinese sign language recognition with large vocabulary.
Proceedings of the Third International Conference on Image and Graphics, 2004
Inter-frame Differential Energy Video Watermarking Algorithm Based on Compressed Domain.
Proceedings of the Image Analysis and Recognition: International Conference, 2004
Fast Moving Region Detection Scheme in Ad Hoc Sensor Network.
Proceedings of the Image Analysis and Recognition: International Conference, 2004
2003
Steganalysis of Images Created in Wavelet Domain Using Quantization Modulation.
Proceedings of the Wavelet Analysis and Its Applications, 2003
Neural network based steganalysis in still images.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Visual Features Extracting & Selecting for Lipreading.
Proceedings of the Audio-and Video-Based Biometrie Person Authentication, 2003
2002
Secure Watermark Verification Scheme.
Proceedings of the Advances in Multimedia Information Processing, 2002
A High Capacity Data Hiding Scheme Based on DFT.
Proceedings of the Advances in Multimedia Information Processing, 2002
System Identification for Nonlinear Control Using Lifted Wavelet.
Proceedings of the 6th Joint Conference on Information Science, 2002
Mouth-Shape Classification and Recognition for Lipreading.
Proceedings of the 6th Joint Conference on Information Science, 2002
Recognition of Strong and Weak Connection Models in Continuous Sign Language.
Proceedings of the 16th International Conference on Pattern Recognition, 2002
Robust Frontal Face Detection in Complex Environment.
Proceedings of the 16th International Conference on Pattern Recognition, 2002
Blind watermarking method based on DWT middle frequency pair.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
2001
Face detection and location based on skin chrominance and lip chrominance transformation from color images.
Pattern Recognit., 2001
Fusion of Biometrics Based on D-S Theory.
Proceedings of the Advances in Multimedia Information Processing, 2001
2000
Towards robust lipreading.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000