Shuicheng Yan

Orcid: 0000-0001-8906-3777

Affiliations:
  • National University of Singapore, Department of Electrical and Computer Engineering
  • University of Illinois, Department of Electrical and Computer Engineering


According to our database1, Shuicheng Yan authored at least 846 papers between 2002 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2020, "For contributions to visual content understanding techniques and application".

IEEE Fellow

IEEE Fellow 2017, "For contributions to subspace learning and visual classification".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Instant3D: Instant Text-to-3D Generation.
Int. J. Comput. Vis., October, 2024

Towards Understanding Convergence and Generalization of AdamW.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Arbitrary Virtual Try-on Network: Characteristics Preservation and Tradeoff between Body and Clothing.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024

SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Enhancing Visual Grounding in Vision-Language Pre-Training With Position-Guided Text Prompts.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Learning to Synthesize Compatible Fashion Items Using Semantic Alignment and Collocation Classification: An Outfit Generation Framework.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

On the Equivalence of Linear Discriminant Analysis and Least Squares Regression.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

FRC-Net: A Simple Yet Effective Architecture for Low-Light Image Enhancement.
IEEE Trans. Consumer Electron., February, 2024

MetaFormer Baselines for Vision.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation.
IEEE Trans. Multim., 2024

Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation.
IEEE Trans. Multim., 2024

UniParser: Multi-Human Parsing With Unified Correlation Representation Learning.
IEEE Trans. Image Process., 2024

Win: Weight-Decay-Integrated Nesterov Acceleration for Faster Network Training.
J. Mach. Learn. Res., 2024

UniVST: A Unified Framework for Training-free Localized Video Style Transfer.
CoRR, 2024

Two are better than one: Context window extension with multi-grained self-injection.
CoRR, 2024

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs.
CoRR, 2024

MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes.
CoRR, 2024

MoH: Multi-Head Attention as Mixture-of-Head Attention.
CoRR, 2024

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights.
CoRR, 2024

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis.
CoRR, 2024

Poison-splat: Computation Cost Attack on 3D Gaussian Splatting.
CoRR, 2024

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts.
CoRR, 2024

Optimization Hyper-parameter Laws for Large Language Models.
CoRR, 2024

PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model.
CoRR, 2024

EasyInv: Toward Fast and Better DDIM Inversion.
CoRR, 2024

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models - The Story Goes On.
CoRR, 2024

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement.
CoRR, 2024

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding.
CoRR, 2024

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model.
CoRR, 2024

UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMs.
CoRR, 2024

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning.
CoRR, 2024

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models.
CoRR, 2024

MVGamba: Unify 3D Content Generation as State Space Sequence Modeling.
CoRR, 2024

Towards Semantic Equivalence of Tokenization in Multimodal LLM.
CoRR, 2024

LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models.
CoRR, 2024

Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model.
CoRR, 2024

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing.
CoRR, 2024

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries.
CoRR, 2024

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction.
CoRR, 2024

AgentStudio: A Toolkit for Building General Virtual Agents.
CoRR, 2024

Explore In-Context Segmentation via Latent Diffusion Models.
CoRR, 2024

Data Augmentation in Human-Centric Vision.
CoRR, 2024

Point Cloud Mamba: Point Cloud Learning via State Space Model.
CoRR, 2024

Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation.
CoRR, 2024

See the Unseen: Better Context-Consistent Knowledge-Editing by Noises.
CoRR, 2024

Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket.
CoRR, 2024

Learning to Optimize for Reinforcement Learning.
RLJ, 2024

DGMamba: Domain Generalization via Generalized State Space Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Generative AI in Multimedia: Challenges and Opportunities for Academic and Industrial Impact.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Reinforcement Learning from Diverse Human Preferences.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Auto-Encoding Morph-Tokens for Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Non-confusing Generation of Customized Concepts in Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving Video Segmentation via Dynamic Anchor Queries.
Proceedings of the Computer Vision - ECCV 2024, 2024

Region-Native Visual Tokenization.
Proceedings of the Computer Vision - ECCV 2024, 2024

BAFFLE: A Baseline of Backpropagation-Free Federated Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

InceptionNeXt: When Inception Meets ConvNeXt.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Towards Garment Sewing Pattern Reconstruction from a Single Image.
ACM Trans. Graph., December, 2023

Data-Driven single image deraining: A Comprehensive review and new perspectives.
Pattern Recognit., November, 2023

Contrastive Video Question Answering via Video Graph Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Deep Long-Tailed Learning: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

VOLO: Vision Outlooker for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators.
IEEE Trans. Multim., 2023

Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement.
CoRR, 2023

Skywork: A More Open Bilingual Foundation Model.
CoRR, 2023

Heterogenous Memory Augmented Neural Networks.
CoRR, 2023

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning.
CoRR, 2023

Offline Prioritized Experience Replay.
CoRR, 2023

Efficient Multi-Grained Knowledge Reuse for Class Incremental Segmentation.
CoRR, 2023

Improving and Benchmarking Offline Reinforcement Learning Algorithms.
CoRR, 2023

Generative Table Pre-training Empowers Models for Tabular Prediction.
CoRR, 2023

A Review of Deep Learning for Video Captioning.
CoRR, 2023

CoSDA: Continual Source-Free Domain Adaptation.
CoRR, 2023

Does Federated Learning Really Need Backpropagation?
CoRR, 2023

On Calibrating Diffusion Probabilistic Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Diffusion Policies For Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Gaussian Mixture Solvers for Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mutual Information Regularized Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in the Dark.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

EditAnything: Empowering Unparalleled Flexibility in Image Editing and Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RRAM-PoolFormer: A Resistive Memristor-based PoolFormer Modeling and Training Framework for Edge-AI Applications.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Bag of Tricks for Training Data Extraction from Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Better Diffusion Models Further Improve Adversarial Training.
Proceedings of the International Conference on Machine Learning, 2023

Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows.
Proceedings of the International Conference on Machine Learning, 2023

Spikformer: When Spiking Neural Network Meets Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Distributional Meta-Gradient Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Arbitrary Virtual Try-on Network: Characteristics Representation and Trade-off between Body and Clothing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient Offline Policy Optimization with a Learned Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Visual Imitation Learning with Patch Rewards.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

D4FT: A Deep Learning Approach to Kohn-Sham Density Functional Theory.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

LPT: Long-tailed Prompt Tuning for Image Classification.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Bag of Tricks for Unsupervised Text-to-Speech.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Diffusion Transformer is a Strong Image Synthesizer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generative Table Pre-training Empowers Models for Tabular Prediction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Incompatible Knowledge Transfer in Few-shot Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Position-Guided Text Prompt for Vision-Language Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DRNet: Double Recalibration Network for Few-Shot Semantic Segmentation.
IEEE Trans. Image Process., 2022

Towards Age-Invariant Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Human-Centric Relation Segmentation: Dataset and Solution.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Fine-Grained Human-Centric Tracklet Segmentation with Single Frame Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Position-guided Text Prompt for Vision-Language Pre-training.
CoRR, 2022

NoiSER: Noise is All You Need for Low-Light Image Enhancement.
CoRR, 2022

Towards Sustainable Self-supervised Learning.
CoRR, 2022

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning.
CoRR, 2022

Boosting Offline Reinforcement Learning via Data Rebalancing.
CoRR, 2022

AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression.
CoRR, 2022

Seeing Through The Noisy Dark: Toward Real-world Low-Light Image Enhancement and Denoising.
CoRR, 2022

O(N<sup>2</sup>) Universal Antisymmetry in Fermionic Neural Networks.
CoRR, 2022

ClusterQ: Semantic Feature Distribution Alignment for Data-Free Quantization.
CoRR, 2022

Mugs: A Multi-Granular Self-Supervised Learning Framework.
CoRR, 2022

Modern Augmented Reality: Applications, Trends, and Future Directions.
CoRR, 2022

Towards Class Interpretable Vision Transformer with Multi-Class-Tokens.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Inception Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep Multi-Resolution Mutual Learning for Image Inpainting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CRNet: Unsupervised Color Retention Network for Blind Motion Deblurring.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Robust Attention Deraining Network for Synchronous Rain Streaks and Raindrops Removal.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Robustness and Accuracy Could Be Reconcilable by (Proper) Definition.
Proceedings of the International Conference on Machine Learning, 2022

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization.
Proceedings of the IEEE International Conference on Data Mining, 2022

Video Graph Transformer for Video Question Answering.
Proceedings of the Computer Vision - ECCV 2022, 2022

DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Promoted Supervision for Few-Shot Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering.
Proceedings of the Computer Vision - ECCV 2022, 2022

Improving Vision Transformers by Revisiting High-Frequency Components.
Proceedings of the Computer Vision, 2022

Deep Color Consistent Network for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MetaFormer is Actually What You Need for Vision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
DLRF-Net: A Progressive Deep Latent Low-Rank Fusion Network for Hierarchical Subspace Discovery.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Multi-human Parsing with a Graph-based Generative Adversarial Model.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Scene Graph Generation With Hierarchical Context.
IEEE Trans. Neural Networks Learn. Syst., 2021

Learning Target-Domain-Specific Classifier for Partial Domain Adaptation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Flexible Auto-Weighted Local-Coordinate Concept Factorization: A Robust Framework for Unsupervised Clustering.
IEEE Trans. Knowl. Data Eng., 2021

DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking.
IEEE Trans. Image Process., 2021

Heterogeneous Domain Adaptation via Covariance Structured Feature Translators.
IEEE Trans. Cybern., 2021

Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Tensor Low-Rank Representation for Data Recovery and Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Editorial: IMAVIS special issue on deep cross-media neural model for generating image descriptions.
Image Vis. Comput., 2021

A Survey on Concept Factorization: From Shallow to Deep Representation Learning.
Inf. Process. Manag., 2021

Dual-Constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior.
Int. J. Comput. Vis., 2021

TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning.
CoRR, 2021

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering.
CoRR, 2021

Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing.
CoRR, 2021

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.
CoRR, 2021

Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Direct Multi-view Multi-person 3D Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Video Background Music Generation with Controllable Music Transformer.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Triplet Deep Subspace Clustering via Self-Supervised Data Augmentation.
Proceedings of the IEEE International Conference on Data Mining, 2021

PnP-DETR: Towards Efficient Visual Analysis with Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Partial-Label and Structure-constrained Deep Coupled Factorization Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning.
IEEE Trans. Neural Networks Learn. Syst., 2020

Deep Subspace Clustering.
IEEE Trans. Neural Networks Learn. Syst., 2020

Collocating Clothes With Generative Adversarial Networks Cosupervised by Categories and Attributes: A Multidiscriminator Framework.
IEEE Trans. Neural Networks Learn. Syst., 2020

Learning Semisupervised Multilabel Fully Convolutional Network for Hierarchical Object Parsing.
IEEE Trans. Neural Networks Learn. Syst., 2020

Dual Adversarial Autoencoders for Clustering.
IEEE Trans. Neural Networks Learn. Syst., 2020

Joint Label Prediction Based Semi-Supervised Adaptive Concept Factorization for Robust Data Representation.
IEEE Trans. Knowl. Data Eng., 2020

Learning Hybrid Representation by Robust Dictionary Learning in Factorized Compressed Space.
IEEE Trans. Image Process., 2020

ORDNet: Capturing Omni-Range Dependencies for Scene Parsing.
IEEE Trans. Image Process., 2020

Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Perspective-Adaptive Convolutions for Scene Parsing.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Joint Rain Detection and Removal from a Single Image with Contextualized Deep Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Tensor Robust Principal Component Analysis with a New Tensor Nuclear Norm.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Deep neural networks for emerging multimedia computing and applications.
Neurocomputing, 2020

Recognizing Profile Faces by Imagining Frontal View.
Int. J. Comput. Vis., 2020

Fine-Grained Multi-human Parsing.
Int. J. Comput. Vis., 2020

Special Issue on Generating Realistic Visual Data of Human Behavior.
Int. J. Comput. Vis., 2020

ProxylessKD: Direct Knowledge Distillation with Inherited Classifier for Face Recognition.
CoRR, 2020

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes.
CoRR, 2020

A Simple Baseline for Pose Tracking in Videos of Crowded Scenes.
CoRR, 2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes.
CoRR, 2020

Dual Adversarial Auto-Encoders for Clustering.
CoRR, 2020

A Survey on Concept Factorization: From Shallow to Deep Representation Learning.
CoRR, 2020

Recapture as You Want.
CoRR, 2020

PANDA: Prototypical Unsupervised Domain Adaptation.
CoRR, 2020

Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition.
CoRR, 2020

ConvBERT: Improving BERT with Span-based Dynamic Convolution.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Toward Accurate Person-level Action Recognition in Videos of Crowed Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Neural Network Design for Multimedia: Bio-inspired and Hardware-friendly.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Beautify As You Like.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

InteractGAN: Learning to Generate Human-Object Interaction.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Simple Baseline for Pose Tracking in Videos of Crowed Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Rethinking Bottleneck Structure for Efficient Mobile Network Design.
Proceedings of the Computer Vision - ECCV 2020, 2020

Highly Efficient Salient Object Detection with 100K Parameters.
Proceedings of the Computer Vision - ECCV 2020, 2020

Convolutional Dictionary Pair Learning Network for Image Representation Learning.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Multilayer Collaborative Low-Rank Coding Network for Robust Deep Subspace Discovery.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AdversarialNAS: Adversarial Neural Architecture Search for GANs.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Online Robust Low-Rank Tensor Modeling for Streaming Data Analysis.
IEEE Trans. Neural Networks Learn. Syst., 2019

Unsupervised Nonnegative Adaptive Feature Extraction for Data Representation.
IEEE Trans. Knowl. Data Eng., 2019

Hierarchical Contextual Refinement Networks for Human Pose Estimation.
IEEE Trans. Image Process., 2019

Asymmetric GAN for Unpaired Image-to-Image Translation.
IEEE Trans. Image Process., 2019

Revisiting Jump-Diffusion Process for Visual Tracking: A Reinforcement Learning Approach.
IEEE Trans. Circuits Syst. Video Technol., 2019

Toward a Comprehensive Face Detector in the Wild.
IEEE Trans. Circuits Syst. Video Technol., 2019

Kernel-Induced Label Propagation by Mapping for Semi-Supervised Classification.
IEEE Trans. Big Data, 2019

UP-CNN: Un-pooling augmented convolutional neural network.
Pattern Recognit. Lett., 2019

3D-Aided Dual-Agent GANs for Unconstrained Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Recurrent Face Aging with Hierarchical AutoRegressive Memory.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Subspace Clustering by Block Diagonal Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Recurrent Shape Regression.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

RC-DARTS: Resource Constrained Differentiable Architecture Search.
CoRR, 2019

Fast DenseNet: Towards Efficient and Accurate Text Recognition with Fast Dense Networks.
CoRR, 2019

Efficient Differentiable Neural Architecture Search with Meta Kernels.
CoRR, 2019

PSGAN: Pose-Robust Spatial-Aware GAN for Customizable Makeup Transfer.
CoRR, 2019

Task Relation Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Efficient Meta Learning via Minibatch Proximal Update.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Finding Images by Dialoguing with Image.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multi-Prototype Networks for Unconstrained Set-based Face Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Very Long Natural Scenery Image Prediction by Outpainting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Single-Stage Multi-Person Pose Machines.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Robust Unsupervised Flexible Auto-weighted Local-coordinate Concept Factorization for Image Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Graph-Based Global Reasoning Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Robust Adaptive Embedded Label Propagation With Weight Learning for Inductive Classification.
IEEE Trans. Neural Networks Learn. Syst., 2018

Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier.
IEEE Trans. Neural Networks Learn. Syst., 2018

Learning Temporal Information for Brain-Computer Interface Using Convolutional Neural Networks.
IEEE Trans. Neural Networks Learn. Syst., 2018

Deep Salient Object Detection With Dense Connections and Distraction Diagnosis.
IEEE Trans. Multim., 2018

Scale-Aware Fast R-CNN for Pedestrian Detection.
IEEE Trans. Multim., 2018

Multistage Object Detection With Group Recursive Learning.
IEEE Trans. Multim., 2018

Learning Multi-Instance Deep Ranking and Regression Network for Visual House Appraisal.
IEEE Trans. Knowl. Data Eng., 2018

High-Precision Camera Localization in Scenes with Repetitive Patterns.
ACM Trans. Intell. Syst. Technol., 2018

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild.
IEEE Trans. Image Process., 2018

Flexible Manifold Learning With Optimal Graph for Image and Video Representation.
IEEE Trans. Image Process., 2018

Landmark Free Face Attribute Prediction.
IEEE Trans. Image Process., 2018

Implicit Negative Sub-Categorization and Sink Diversion for Object Detection.
IEEE Trans. Image Process., 2018

FatRegion: A Fast Adaptive Tree-Structured Region Extraction Approach.
IEEE Trans. Circuits Syst. Video Technol., 2018

First-Person Daily Activity Recognition With Manipulated Object Proposals and Non-Linear Feature Fusion.
IEEE Trans. Circuits Syst. Video Technol., 2018

Image Classification With Tailored Fine-Grained Dictionaries.
IEEE Trans. Circuits Syst. Video Technol., 2018

Video-Based Person Re-Identification With Accumulative Motion Context.
IEEE Trans. Circuits Syst. Video Technol., 2018

Deep Recurrent Regression for Facial Landmark Detection.
IEEE Trans. Circuits Syst. Video Technol., 2018

Object Proposal Generation With Fully Convolutional Networks.
IEEE Trans. Circuits Syst. Video Technol., 2018

Learning with rethinking: Recurrently improving convolutional neural networks through feedback.
Pattern Recognit., 2018

Towards Robust and Accurate Multi-View and Partially-Occluded Face Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Personalized Age Progression with Bi-Level Aging Dictionary Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

A Unified Alternating Direction Method of Multipliers by Majorization Minimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Proposal-Free Network for Instance-Level Object Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

SRNN: Self-regularized neural network.
Neurocomputing, 2018

Learning supervised descent directions for optic disc segmentation.
Neurocomputing, 2018

Attentive Systems: A Survey.
Int. J. Comput. Vis., 2018

Video super-resolution based on spatial-temporal recurrent residual networks.
Comput. Vis. Image Underst., 2018

A<sup>2</sup>-Nets: Double Attention Networks.
CoRR, 2018

Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
CoRR, 2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing.
CoRR, 2018

A^2-Nets: Double Attention Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Style Separation and Synthesis via Generative Adversarial Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-Human Parsing Machines.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

3D-Aided Deep Pose-Invariant Face Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

High Resolution Feature Recovering for Accelerating Urban Scene Parsing.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Sharing Residual Units Through Collective Tensor Factorization To Improve Deep Neural Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Robust Locality-Constrained Label Consistent K-SVD by Joint Sparse Embedding.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Robust Discriminative Projective Dictionary Pair Learning by Adaptive Representations.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Robust Projective Low-Rank and Sparse Representation by Robust Dictionary Learning.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

WSNet: Compact and Efficient Networks Through Weight Sampling.
Proceedings of the 35th International Conference on Machine Learning, 2018

Dynamic Conditional Networks for Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pose Partition Networks for Multi-person Pose Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Multi-fiber Networks for Video Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Towards Pose Invariant Face Recognition in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Neural Style Transfer via Meta Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human Pose Estimation With Parsing Induced Learner.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning and Thinking Strategy for Training Sequence Generation Models.
Proceedings of the British Machine Vision Conference 2018, 2018

Nonconvex Sparse Spectral Clustering by Alternating Direction Method of Multipliers and Its Convergence Analysis.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Diversified Visual Attention Networks for Fine-Grained Object Classification.
IEEE Trans. Multim., 2017

Attentive Contexts for Object Detection.
IEEE Trans. Multim., 2017

Visual Classification of Furniture Styles.
ACM Trans. Intell. Syst. Technol., 2017

Event Classification in Microblogs via Social Tracking.
ACM Trans. Intell. Syst. Technol., 2017

Robust Neighborhood Preserving Projection by Nuclear/L2, 1-Norm Regularization for Image Feature Extraction.
IEEE Trans. Image Process., 2017

Deep Edge Guided Recurrent Residual Learning for Image Super-Resolution.
IEEE Trans. Image Process., 2017

End-to-End Comparative Attention Networks for Person Re-Identification.
IEEE Trans. Image Process., 2017

Facial Age Estimation With Age Difference.
IEEE Trans. Image Process., 2017

Cross-Modal Retrieval With CNN Visual Features: A New Baseline.
IEEE Trans. Cybern., 2017

Cross-Scale Cost Aggregation for Stereo Matching.
IEEE Trans. Circuits Syst. Video Technol., 2017

Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Layerwise Class-Aware Convolutional Neural Network.
IEEE Trans. Circuits Syst. Video Technol., 2017

Discriminative sparse flexible manifold embedding with novel graph for robust visual representation and label propagation.
Pattern Recognit., 2017

LG-CNN: From local parts to global discrimination for fine-grained recognition.
Pattern Recognit., 2017

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Human Parsing with Contextualized Convolutional Neural Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Learning to Segment Human by Watching YouTube.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Robust Alternating Low-Rank Representation by joint Lp- and L2, p-norm minimization.
Neural Networks, 2017

SDE: A Novel Selective, Discriminative and Equalizing Feature Representation for Visual Recognition.
Int. J. Comput. Vis., 2017

A survey on deep learning-based fine-grained object classification and semantic segmentation.
Int. J. Autom. Comput., 2017

Editorial- Deep Learning for Computer Vision.
Comput. Vis. Image Underst., 2017

BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition.
CoRR, 2017

Weaving Multi-scale Context for Single Shot Detector.
CoRR, 2017

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids.
CoRR, 2017

WSNet: Compact and Efficient Networks with Weight Sampling.
CoRR, 2017

Personalized and Occupational-aware Age Progression by Generative Adversarial Networks.
CoRR, 2017

HashGAN: Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval.
CoRR, 2017

Deep Sparse Subspace Clustering.
CoRR, 2017

Meta Networks for Neural Style Transfer.
CoRR, 2017

Discriminative Similarity for Clustering and Semi-Supervised Learning.
CoRR, 2017

Generative Partition Networks for Multi-Person Pose Estimation.
CoRR, 2017

Sharing Residual Units Through Collective Tensor Factorization in Deep Neural Networks.
CoRR, 2017

Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Predicting Scene Parsing and Motion Dynamics in the Future.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Dual Path Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Time Traveler: A Real-time Face Aging System.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Face Aging with Contextual Generative Adversarial Nets.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Integrated Face Analytics Networks through Cross-Dataset Hybrid Training.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Global-residual and Local-boundary Refinement Networks for Rectifying Scene Parsing Predictions.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Online Robust Low-Rank Tensor Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Training Group Orthogonal Neural Networks with Privileged Information.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Robust Projective Dictionary Learning by Joint Label Embedding and Classification.
Proceedings of the 2017 IEEE International Conference on Data Mining Workshops, 2017

Scale-Adaptive Convolutions for Scene Parsing.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Neural Person Search Machines.
Proceedings of the IEEE International Conference on Computer Vision, 2017

FoveaNet: Perspective-Aware Urban Scene Parsing.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Scene Parsing with Predictive Feature Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Self-Supervised Neural Aggregation Networks for Human Parsing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Joint Rain Detection and Removal from a Single Image.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

3D-Assisted Coarse-to-Fine Extreme-Pose Facial Landmark Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Interpretable Structure-Evolving LSTM.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Perceptual Generative Adversarial Networks for Small Object Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Estimation of Affective Level in the Wild with Multiple Memory Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

More is Less: A More Complicated Network with Less Inference Complexity.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Marginalized CNN: Learning Deep Invariant Representations.
Proceedings of the British Machine Vision Conference 2017, 2017

Multi-Path Feedback Recurrent Neural Networks for Scene Parsing.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Sparse Codes Auto-Extractor for Classification: A Joint Embedding and Dictionary Learning Framework for Representation.
IEEE Trans. Signal Process., 2016

Dual Low-Rank Pursuit: Learning Salient Features for Saliency Detection.
IEEE Trans. Neural Networks Learn. Syst., 2016

Clothing Cosegmentation for Shopping Images With Cluttered Background.
IEEE Trans. Multim., 2016

Deep Relative Attributes.
IEEE Trans. Multim., 2016

Image Classification by Selective Regularized Subspace Learning.
IEEE Trans. Multim., 2016

Deep Aging Face Verification With Large Gaps.
IEEE Trans. Multim., 2016

Clothes Co-Parsing Via Joint Image Segmentation and Labeling With Application to Clothing Retrieval.
IEEE Trans. Multim., 2016

Modality-Dependent Cross-Media Retrieval.
ACM Trans. Intell. Syst. Technol., 2016

Multitask Low-Rank Affinity Graph for Image Segmentation and Image Annotation.
ACM Trans. Intell. Syst. Technol., 2016

Joint Low-Rank and Sparse Principal Feature Coding for Enhanced Robust Representation and Visual Classification.
IEEE Trans. Image Process., 2016

Learning of Multimodal Representations With Random Walks on the Click Graph.
IEEE Trans. Image Process., 2016

Convex Sparse Spectral Clustering: Single-View to Multi-View.
IEEE Trans. Image Process., 2016

Nonconvex Nonsmooth Low Rank Minimization via Iteratively Reweighted Nuclear Norm.
IEEE Trans. Image Process., 2016

Relative Forest for Visual Attribute Prediction.
IEEE Trans. Image Process., 2016

Instance-Aware Hashing for Multi-Label Image Retrieval.
IEEE Trans. Image Process., 2016

Scale-Aware Pixelwise Object Proposal Networks.
IEEE Trans. Image Process., 2016

Multi-loss Regularized Deep Neural Network.
IEEE Trans. Circuits Syst. Video Technol., 2016

Convolutional Fusion Network for Face Verification in the Wild.
IEEE Trans. Circuits Syst. Video Technol., 2016

Cast2Face: Assigning Character Names Onto Faces in Movie With Actor-Character Correspondence.
IEEE Trans. Circuits Syst. Video Technol., 2016

Learning to segment with image-level annotations.
Pattern Recognit., 2016

Kinship-Guided Age Progression.
Pattern Recognit., 2016

HCP: A Flexible CNN Framework for Multi-Label Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

A Deterministic Analysis for LRR.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

NUS-PRO: A New Visual Tracking Challenge.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Parsing Based on Parselets: A Unified Deformable Mixture Model for Human Parsing.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Event-based media processing and analysis: A survey of the literature.
Image Vis. Comput., 2016

Special Issue on Event-based Media Processing and Analysis.
Image Vis. Comput., 2016

Peak-Piloted Deep Network for Facial Expression Recognition.
CoRR, 2016

Joint Rain Detection and Removal via Iterative Region Dependent Multi-Task Learning.
CoRR, 2016

Visual Processing by a Unified Schatten-p Norm and ℓ<sub>q</sub> Norm Regularized Principal Component Pursuit.
CoRR, 2016

Multi-stage Object Detection with Group Recursive Learning.
CoRR, 2016

Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods.
CoRR, 2016

Multi-Path Feedback Recurrent Neural Network for Scene Parsing.
CoRR, 2016

Scale-aware Pixel-wise Object Proposal Networks.
CoRR, 2016

A Focused Dynamic Attention Model for Visual Question Answering.
CoRR, 2016

Seq-NMS for Video Object Detection.
CoRR, 2016

Keynotes: Deep learning for visual understanding: Effectiveness vs. efficiency.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Tree-Structured Reinforcement Learning for Sequential Object Localization.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Computational Face Reader.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A Live Face Swapper.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Nuclear-norm regularized neighborhood preserving projection.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Peak-Piloted Deep Network for Facial Expression Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Robust Facial Landmark Detection via Recurrent Attentive-Refinement Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Semantic Object Parsing with Graph LSTM.
Proceedings of the Computer Vision - ECCV 2016, 2016

Collaborative Layer-Wise Discriminative Learning in Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Recurrent Face Aging.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Reversible Recursive Instance-Level Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Semantic Object Parsing with Local-Global Long Short-Term Memory.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recurrently Target-Attending Tracking.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Projective Unsupervised Flexible Embedding with Optimal Graph.
Proceedings of the British Machine Vision Conference 2016, 2016

Fast Proximal Linearized Alternating Direction Method of Multiplier with Parallel Splitting.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Deep Learning with S-Shaped Rectified Linear Activation Units.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Learning Feature Hierarchies: A Layer-Wise Tag-Embedded Approach.
IEEE Trans. Multim., 2015

Fashion Parsing With Video Context.
IEEE Trans. Multim., 2015

Understanding Blooming Human Groups in Social Networks.
IEEE Trans. Multim., 2015

Disease Inference from Health-Related Questions via Sparse Deep Learning.
IEEE Trans. Knowl. Data Eng., 2015

Visual Understanding with RGB-D Sensors: An Introduction to the Special Issue.
ACM Trans. Intell. Syst. Technol., 2015

Smoothed Low Rank and Sparse Matrix Recovery by Iteratively Reweighted Least Squares Minimization.
IEEE Trans. Image Process., 2015

Horror Image Recognition Based on Context-Aware Multi-Instance Learning.
IEEE Trans. Image Process., 2015

Angular-Similarity-Preserving Binary Signatures for Linear Subspaces.
IEEE Trans. Image Process., 2015

Max-Confidence Boosting With Uncertainty for Visual Tracking.
IEEE Trans. Image Process., 2015

Age Estimation via Grouping and Decision Fusion.
IEEE Trans. Inf. Forensics Secur., 2015

Facilitating Image Search With a Scalable and Compact Semantic Mapping.
IEEE Trans. Cybern., 2015

Data-Driven Affective Filtering for Images and Videos.
IEEE Trans. Cybern., 2015

Discriminative Analysis for Symmetric Positive Definite Matrices on Lie Groups.
IEEE Trans. Circuits Syst. Video Technol., 2015

Facial Analysis With a Lie Group Kernel.
IEEE Trans. Circuits Syst. Video Technol., 2015

Segmentation Over Detection via Optimal Sparse Reconstructions.
IEEE Trans. Circuits Syst. Video Technol., 2015

Background Context Augmented Hypothesis Graph for Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2015

STAP: Spatial-Temporal Attention-Aware Pooling for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2015

Adaptive Nonparametric Image Parsing.
IEEE Trans. Circuits Syst. Video Technol., 2015

Clothing Attributes Assisted Person Reidentification.
IEEE Trans. Circuits Syst. Video Technol., 2015

Crowded Scene Analysis: A Survey.
IEEE Trans. Circuits Syst. Video Technol., 2015

Looking Inside Category: Subcategory-Aware Object Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2015

Visual data denoising with a unified Schatten-p norm and ℓ<sub>q</sub> norm regularized principal component pursuit.
Pattern Recognit., 2015

On robust image spam filtering via comprehensive visual modeling.
Pattern Recognit., 2015

Order Preserving Sparse Coding.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Dense Subgraph Partition of Positive Hypergraphs.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Deep Human Parsing with Active Template Regression.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Contextualizing Object Detection and Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Bilinear low-rank coding framework and extension for robust image recovery and feature representation.
Knowl. Based Syst., 2015

Weakly-supervised scene parsing with multiple contextual cues.
Inf. Sci., 2015

Visibility-aware part model for robust facial point detection.
Neurocomputing, 2015

Collaborative Linear Coding for Robust Image Classification.
Int. J. Comput. Vis., 2015

Pose Adaptive Motion Feature Pooling for Human Action Analysis.
Int. J. Comput. Vis., 2015

STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation.
CoRR, 2015

Group $K$-Means.
CoRR, 2015

Purine: A bi-graph based deep learning framework.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Cross-pose Face Recognition by Canonical Correlation Analysis.
CoRR, 2015

Scale-aware Fast R-CNN for Pedestrian Detection.
CoRR, 2015

What Shall I Look Like after N Years?
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Deep Face Beautification.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Automatic Feature Learning for Glaucoma Detection Based on Deep Learning.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 - 18th International Conference Munich, Germany, October 5, 2015

Discriminative Feature Selection for Multiple Ocular Diseases Classification by Sparse Induced Graph Regularized Group Lasso.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

Facial Landmark Detection via Progressive Initialization.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Conditional Convolutional Neural Network for Modality-Aware Face Recognition.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Task-Driven Feature Pooling for Image Classification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Additive Nearest Neighbor Feature Maps.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Personalized Age Progression with Aging Dictionary.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Human Parsing with Contextualized Convolutional Neural Network.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Cross-Domain Image Retrieval with a Dual Attribute-Aware Ranking Network.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Parallel convolutional-linear neural network for motor imagery classification.
Proceedings of the 23rd European Signal Processing Conference, 2015

Structural Sparse Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Motion Part Regularization: Improving action recognition via trajectory group selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Matching-CNN meets KNN: Quasi-parametric human parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Shape driven kernel adaptation in Convolutional Neural Network for robust facial trait recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

SOLD: Sub-optimal low-rank decomposition for efficient video segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Simultaneous feature learning and hash coding with deep neural networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep domain adaptation for describing people based on fine-grained clothing attributes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Generalized Singular Value Thresholding.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Person Re-identification by Attribute-Assisted Clothes Appearance.
Proceedings of the Person Re-Identification, 2014

Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2014

"Wow! You Are So Beautiful Today!".
ACM Trans. Multim. Comput. Commun. Appl., 2014

Circle & Search: Attribute-Aware Shoe Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Adaptive Learning for Celebrity Identification With Video Context.
IEEE Trans. Multim., 2014

Touch Saliency: Characteristics and Prediction.
IEEE Trans. Multim., 2014

Fashion Parsing With Weak Color-Category Labels.
IEEE Trans. Multim., 2014

PicWords: Render a Picture by Packing Keywords.
IEEE Trans. Multim., 2014

Snap & Play: Auto-Generated Personalized Find-the-Difference Game.
ACM Trans. Intell. Syst. Technol., 2014

Robust (Semi) Nonnegative Graph Embedding.
IEEE Trans. Image Process., 2014

A General Exponential Framework for Dimensionality Reduction.
IEEE Trans. Image Process., 2014

Unified Structured Learning for Simultaneous Human Pose Estimation and Garment Attribute Classification.
IEEE Trans. Image Process., 2014

Nonnegative Tensor Cofactorization and Its Unified Solution.
IEEE Trans. Image Process., 2014

Autogrouped Sparse Representation for Visual Analysis.
IEEE Trans. Image Process., 2014

Decomposition and Extraction: A New Framework for Visual Classification.
IEEE Trans. Image Process., 2014

Robust Face Recognition via Adaptive Sparse Representation.
IEEE Trans. Cybern., 2014

Exposure Fusion Using Boosting Laplacian Pyramid.
IEEE Trans. Cybern., 2014

Geometric Optimum Experimental Design for Collaborative Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2014

Body Surface Context: A New Robust Feature for Action Recognition From Depth Videos.
IEEE Trans. Circuits Syst. Video Technol., 2014

Video De-Fencing.
IEEE Trans. Circuits Syst. Video Technol., 2014

Toward Large-Population Face Identification in Unconstrained Videos.
IEEE Trans. Circuits Syst. Video Technol., 2014

Object Tracking With Only Background Cues.
IEEE Trans. Circuits Syst. Video Technol., 2014

Face Authentication With Makeup Changes.
IEEE Trans. Circuits Syst. Video Technol., 2014

Audio Matters in Visual Attention.
IEEE Trans. Circuits Syst. Video Technol., 2014

Batch-Orthogonal Locality-Sensitive Hashingfor Angular Similarity.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Bin Ratio-Based Histogram Distances and Their Application to Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Similarity preserving low-rank representation for enhanced data representation and effective subspace learning.
Neural Networks, 2014

Seeing Human Weight from a Single RGB-D Image.
J. Comput. Sci. Technol., 2014

Recognizing human group action by layered model with multiple cues.
Neurocomputing, 2014

Harnessing Lab Knowledge for Real-World Action Recognition.
Int. J. Comput. Vis., 2014

Fashion Analysis: Current Techniques and Future Directions.
IEEE Multim., 2014

Guest Editorial: Special issue on large scale multimedia semantic indexing.
Comput. Vis. Image Underst., 2014

CNN: Single-label to Multi-label.
CoRR, 2014

Image Denoising with a Unified Schattern-p Norm and ℓ<sub>q</sub> Norm Regularization.
CoRR, 2014

Network In Network.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Computational Baby Learning.
CoRR, 2014

Age group classification via structured fusion of uncertainty-driven shape features and selected surface features.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Adaptive Object Learning for Robot Carinet.
Proceedings of the Social Robotics - 6th International Conference, 2014

Convex Optimization Procedure for Clustering: Theoretical Revisit.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

On a Theory of Nonparametric Pairwise Similarity for Clustering: Connecting Clustering to Classification.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Robust Logistic Regression and Classification.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Optimized Distances for Binary Code Ranking.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fashion Parsing with Video Context.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Deep Search with Attribute-aware Deep Network.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Robust bilinear matrix recovery by Tensor Low-Rank Representation.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Cross-media relevance mining for evaluating text-based image search engine.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Pipelining Localized Semantic Features for Fine-Grained Action Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

Efficient k-Support Matrix Pursuit.
Proceedings of the Computer Vision - ECCV 2014, 2014

Towards Unified Object Detection and Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Cross-Scale Cost Aggregation for Stereo Matching.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Towards Multi-view and Partially-Occluded Face Alignment.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Generalized Nonconvex Nonsmooth Low-Rank Minimization.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Robust Subspace Segmentation with Block-Diagonal Prior.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Learning Scalable Discriminative Dictionary with Sample Relatedness.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Towards Unified Human Parsing and Pose Estimation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Multi-instance Learning Using Information Entropy Theory for Image Retrieval.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014

Robust Scene Classification with Cross-Level LLC Coding on CNN Features.
Proceedings of the Computer Vision - ACCV 2014, 2014

Multiple Ocular Diseases Classification with Graph Regularized Probabilistic Multi-label Learning.
Proceedings of the Computer Vision - ACCV 2014, 2014

Supervised Hashing for Image Retrieval via Image Representation Learning.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Proximal Iteratively Reweighted Algorithm with Multiple Splitting for Nonconvex Sparsity Optimization.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Similarity-Preserving Binary Signature for Linear Subspaces.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Scalable Low-Rank Representation.
Proceedings of the Low-Rank and Sparse Modeling for Visual Analysis, 2014

Latent Low-Rank Representation.
Proceedings of the Low-Rank and Sparse Modeling for Visual Analysis, 2014

Community Understanding in Location-based Social Networks.
Proceedings of the Human-Centered Social Media Analytics, 2014

2013
GPSView: A scenic driving route planner.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Detecting profilable and overlapping communities with user-generated multimedia contents in LBSNs.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Towards optimizing human labeling for interactive image tagging.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Towards decrypting attractiveness via multi-modality cues.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Image retrieval with query-adaptive hashing.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Robust image annotation via simultaneous feature and sample outlier pursuit.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Large-scale multilabel propagation based on efficient sparse graph construction.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Learning to Photograph: A Compositional Perspective.
IEEE Trans. Multim., 2013

Image Re-Attentionizing.
IEEE Trans. Multim., 2013

VideoPuzzle: Descriptive One-Shot Video Composition.
IEEE Trans. Multim., 2013

Discovering Discriminative Graphlets for Aerial Image Categories Recognition.
IEEE Trans. Image Process., 2013

Image Classification via Object-Aware Holistic Superpixel Selection.
IEEE Trans. Image Process., 2013

Linear Distance Coding for Image Classification.
IEEE Trans. Image Process., 2013

High-Order Local Spatial Context Modeling by Spatialized Random Forest.
IEEE Trans. Image Process., 2013

Robust Image Analysis With Sparse Representation on Quantized Visual Features.
IEEE Trans. Image Process., 2013

General Subspace Learning With Corrupted Training Data Via Graph Embedding.
IEEE Trans. Image Process., 2013

Pairwise Sparsity Preserving Embedding for Unsupervised Subspace Learning and Classification.
IEEE Trans. Image Process., 2013

Multilevel Depth and Image Fusion for Human Activity Detection.
IEEE Trans. Cybern., 2013

Improving Bottom-up Saliency Detection by Looking into Neighbors.
IEEE Trans. Circuits Syst. Video Technol., 2013

Label-specific training set construction from web resource for image annotation.
Signal Process., 2013

Forward Basis Selection for Pursuing Sparse Representations over a Dictionary.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Robust Recovery of Subspace Structures by Low-Rank Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Fast Detection of Dense Subgraphs with Iterative Shrinking and Expansion.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Sparse representations for image and video analysis.
J. Vis. Commun. Image Represent., 2013

Revealing Cluster Structure of Graph by Path Following Replicator Dynamic
CoRR, 2013

A novel image tag saliency ranking algorithm based on sparse representation.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Multimedia recommendation: technology and techniques.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Online Robust PCA via Stochastic Optimization.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Online PCA for Contaminated Data.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval.
Proceedings of the ACM Multimedia Conference, 2013

Cross-media semantic representation via bi-directional learning to rank.
Proceedings of the ACM Multimedia Conference, 2013

Static saliency vs. dynamic saliency: a comparative study.
Proceedings of the ACM Multimedia Conference, 2013

Scale based region growing for scene text detection.
Proceedings of the ACM Multimedia Conference, 2013

eHeritage of shadow puppetry: creation and manipulation.
Proceedings of the ACM Multimedia Conference, 2013

Towards efficient sparse coding for scalable image annotation.
Proceedings of the ACM Multimedia Conference, 2013

Spatio-temporal fisher vector coding for surveillance event detection.
Proceedings of the ACM Multimedia Conference, 2013

Robust image representation and decomposition by Laplacian regularized latent low-rank representation.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Min-Max Hash for Jaccard Similarity.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

How Related Exemplars Help Complex Event Detection in Web Videos?
Proceedings of the IEEE International Conference on Computer Vision, 2013

Robust Object Tracking with Online Multi-lifespan Dictionary Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Hierarchical Part Matching for Fine-Grained Visual Categorization.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Semantic Segmentation without Annotating Segments.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Correntropy Induced L2 Graph for Robust Subspace Clustering.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Correlation Adaptive Subspace Segmentation by Trace Lasso.
Proceedings of the IEEE International Conference on Computer Vision, 2013

A Deformable Mixture Parsing Model with Parselets.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

A Divide-and-Conquer Method for Scalable Low-Rank Latent Matrix Pursuit.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Complex Event Detection via Multi-source Video Attributes.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Compressed Hashing.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Subcategory-Aware Object Classification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Efficient Maximum Appearance Search for Large-Scale Object Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Perception Preserving Projections.
Proceedings of the British Machine Vision Conference, 2013

Magic Mirror: An Intelligent Fashion Recommendation System.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Rank Aggregation via Low-Rank and Structured-Sparse Decomposition.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Robust Multiperson Detection and Tracking for Mobile Service and Social Robots.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Label-to-region with continuity-biased bi-layer sparsity priors.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Image label completion by pursuing contextual decomposability.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Special Section on Object and Event Classification in Large-Scale Video Collections.
IEEE Trans. Multim., 2012

Movie2Comics: Towards a Lively Video Content Presentation.
IEEE Trans. Multim., 2012

Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification.
IEEE Trans. Multim., 2012

Weakly Supervised Graph Propagation Towards Collective Image Parsing.
IEEE Trans. Multim., 2012

Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game.
IEEE Trans. Multim., 2012

Hidden-Concept Driven Multilabel Image Annotation and Label Ranking.
IEEE Trans. Multim., 2012

Learning a Propagable Graph for Semisupervised Learning: Classification and Regression.
IEEE Trans. Knowl. Data Eng., 2012

Visual Classification With Multitask Joint Sparse Representation.
IEEE Trans. Image Process., 2012

Saliency Detection by Multitask Sparsity Pursuit.
IEEE Trans. Image Process., 2012

Camera Constraint-Free View-Based 3-D Object Retrieval.
IEEE Trans. Image Process., 2012

Histogram Contextualization.
IEEE Trans. Image Process., 2012

Inductive Robust Principal Component Analysis.
IEEE Trans. Image Process., 2012

Nondegenerate Piecewise Linear Systems: A Finite Newton Algorithm and Applications in Machine Learning.
Neural Comput., 2012

Active Subspace: Toward Scalable Low-Rank Learning.
Neural Comput., 2012

Forward Basis Selection for Sparse Approximation over Dictionary.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Exact Subspace Segmentation and Outlier Detection by Low-Rank Representation.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

A unified supervised codebook learning framework for classification.
Neurocomputing, 2012

Multimedia semantics-aware query-adaptive hashing with bits reconfigurability.
Int. J. Multim. Inf. Retr., 2012

Dense Neighborhoods on Affinity Graph.
Int. J. Comput. Vis., 2012

Epitome for Automatic Image Colorization
CoRR, 2012


Modeling concept dynamics for large scale music search.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Automatic User Preference Elicitation for Music Recommendation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Super-Bit Locality-Sensitive Hashing.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Attribute feedback.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Touch saliency.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Annotating web images using NOVA: NOn-conVex group spArsity.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Don't ask me what i'm like, just watch and listen.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Multimedia recommendation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

MoViMash: online mobile video mashup.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Harvesting visual concepts for image search with complex queries.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

3DME: 3D media express from RGB-D images.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Sense beauty via face, dressing, and/or voice.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hybrid social media network.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Robust PCA in High-dimension: A Deterministic Approach.
Proceedings of the 29th International Conference on Machine Learning, 2012

Image Super-Resolution via Low-Pass Filter Based Multi-scale Image Decomposition.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Recognizing emotions of characters in movies.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Segmentation over Detection by Coupled Global and Local Sparse Representations.
Proceedings of the Computer Vision - ECCV 2012, 2012

Order-Preserving Sparse Coding for Sequence Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

Robust and Efficient Subspace Segmentation via Least Squares Regression.
Proceedings of the Computer Vision - ECCV 2012, 2012

Object-Layout-Aware Image Retrieval for Personal Album Management.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Depth Matters: Influence of Depth Cues on Visual Saliency.
Proceedings of the Computer Vision - ECCV 2012, 2012

Auto-Grouped Sparse Representation for Visual Analysis.
Proceedings of the Computer Vision - ECCV 2012, 2012

Generalizing Wiberg algorithm for rigid and nonrigid factorizations with missing components and metric constraints.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Practical low-rank matrix approximation under robust L1-norm.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Robust Non-negative Graph Embedding: Towards noisy data, unreliable graphs, and noisy labels.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Omni-range spatial contexts for visual classification.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Efficient structure detection via random consensus graph.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Hierarchical matching with side information for image classification.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Video accessibility enhancement for hearing-impaired users.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Beyond search: Event-driven summarization for web videos.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Web Image and Video Mining Towards Universal and Robust Age Estimator.
IEEE Trans. Multim., 2011

Image Retagging Using Collaborative Tag Propagation.
IEEE Trans. Multim., 2011

Automated Assembly of Shredded Pieces From Multiple Photos.
IEEE Trans. Multim., 2011

Trace-Oriented Feature Analysis for Large-Scale Text Data Dimension Reduction.
IEEE Trans. Knowl. Data Eng., 2011

Recognizing pair-activities by causality analysis.
ACM Trans. Intell. Syst. Technol., 2011

Image annotation by <i>k</i>NN-sparse graph-based label propagation over noisily tagged web images.
ACM Trans. Intell. Syst. Technol., 2011

Assemble New Object Detector With Few Examples.
IEEE Trans. Image Process., 2011

Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness.
IEEE Trans. Image Process., 2011

Image Decomposition With Multilabel Context: Algorithms and Applications.
IEEE Trans. Image Process., 2011

Integrating Spatio-Temporal Context With Multiview Representation for Object Recognition in Visual Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2011

Adaptive Object Tracking by Learning Hybrid Template Online.
IEEE Trans. Circuits Syst. Video Technol., 2011

Correntropy based feature selection using binary projection.
Pattern Recognit., 2011

Active learning with adaptive regularization.
Pattern Recognit., 2011

Efficient region-aware large graph construction towards scalable multi-label propagation.
Pattern Recognit., 2011

A Finite Newton Algorithm for Non-degenerate Piecewise Linear Systems.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Special issue on feature-oriented image and video computing for extracting contexts and semantics.
Comput. Vis. Image Underst., 2011

Label-Specific Training Set Construction from Web Resource for Image Annotation
CoRR, 2011

Sparse hidden-dynamics conditional random fields for user intent understanding.
Proceedings of the 20th International Conference on World Wide Web, 2011

Generative Group Activity Analysis with Quaternion Descriptor.
Proceedings of the Advances in Multimedia Modeling, 2011

Multi-actor Emotion Recognition in Movies Using a Bimodal Approach.
Proceedings of the Advances in Multimedia Modeling, 2011

Multimedia tagging: past, present and future.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Purposive hidden-object-game: embedding human computation in popular game.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Next photo please: towards visually consistent sequential photo browsing.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Snap & play: auto-generate personalized find-the-difference mobile game.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Tag-based social image search with visual-text joint hypergraph learning.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Purposive hidden-object game (P-HOG) towards imperceptible human computation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Human group activity analysis with fusion of motion and appearance information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards multi-semantic image annotation with graph regularized exclusive group lasso.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning reconfigurable hashing for diverse semantics.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Probabilistic indexing of media sequences.
Proceedings of the ICIMCS 2011, 2011

Towards Optimal Discriminating Order for Multiclass Classification.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Cross Domain Random Walk for Query Intent Pattern Mining from Search Engine Log.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Predicting occupation via human clothing and contexts.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Learning universal multi-view age estimator using video context.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Multi-class semi-supervised SVMs with Positiveness Exclusive Regularization.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Latent Low-Rank Representation for subspace segmentation and feature extraction.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Multi-task low-rank affinity pursuit for image segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Multi-label visual classification with label exclusive context.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Accumulated motion images for facial expression recognition in videos.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Contextualizing object detection and classification.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Accelerated low-rank visual recovery by random projection.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Segment an image by looking into an image corpus.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Geometric ℓp-norm feature pooling for image classification.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Learning to rank audience for behavioral targeting in display ads.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Efficient Subspace Segmentation via Quadratic Programming.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Size Adaptive Selection of Most Informative Features.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Information-Theoretic Analysis of Input Strokes in Visual Object Cutout.
IEEE Trans. Multim., 2010

Image Clustering Using Local Discriminant Models and Global Integration.
IEEE Trans. Image Process., 2010

Misalignment-Robust Face Recognition.
IEEE Trans. Image Process., 2010

Projective Nonnegative Graph Embedding.
IEEE Trans. Image Process., 2010

Learning With ℓ<sup>1</sup>-Graph for Image Analysis.
IEEE Trans. Image Process., 2010

Near Duplicate Identification With Spatially Aligned Pyramid Matching.
IEEE Trans. Circuits Syst. Video Technol., 2010

Sparse Representation for Computer Vision and Pattern Recognition.
Proc. IEEE, 2010

Closed-Form Solutions to A Category of Nuclear Norm Minimization Problems
CoRR, 2010

Selective Image Super-Resolution
CoRR, 2010

TRECVID 2010 Known-item Search by NUS.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

A co-learning framework for learning user search intents from rule-generated training data.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Effective music tagging through advanced statistical modeling.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Robust Clustering as Ensembles of Affinity Relations.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning Cooking Techniques from YouTube.
Proceedings of the Advances in Multimedia Modeling, 2010

Image tag refinement towards low-rank, content-tag prior and error sparsity.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Cast2Face: character identification in movie with actor-character correspondence.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

One person labels one million images.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Unified tag analysis with multi-edge graph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image segmentation with patch-pair density priors.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Movie2Comics: a feast of multimedia artwork.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Dynamic captioning: video accessibility enhancement for hearing impairment.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

iComics: automatic conversion of movie into comics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Towards a universal detector by mining concepts with small semantic gaps.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning to photograph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Efficient large-scale image annotation by probabilistic collaborative multi-label propagation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Epitomized Summarization of Wireless Capsule Endoscopic Videos for Efficient Visualization.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2010

Robust Graph Mode Seeking by Graph Shift.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Activity recognition using dense long-duration trajectories.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Auto-generation of professional background music for home-made videos.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

A Novel Contrast Co-learning Framework for Generating High Quality Training Data.
Proceedings of the ICDM 2010, 2010

Robust Low-Rank Subspace Segmentation with Semidefinite Guarantees.
Proceedings of the ICDMW 2010, 2010

Feature selection for facial expression recognition using deformation modeling.
Proceedings of the Second International Conference on Digital Image Processing, 2010

Multi-View Object Detection by Classifier Interpolation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Randomized Locality Sensitive Vocabularies for Bag-of-Features Model.
Proceedings of the Computer Vision, 2010

Visual classification with multi-task joint sparse representation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Weakly-supervised hashing in kernel space.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Nonparametric Label-to-Region by search.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Common visual pattern discovery via spatially coherent correspondences.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Sparse representation using nonnegative curds and whey.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Spatialized epitome and its applications.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Factorization towards a classifier.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Learning to rank tags.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Non-Metric Locality-Sensitive Hashing.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
An online blog reading system by topic clustering and personalized ranking.
ACM Trans. Internet Techn., 2009

Ubiquitously Supervised Subspace Learning.
IEEE Trans. Image Process., 2009

Mode-kn Factor Analysis for Image Ensembles.
IEEE Trans. Image Process., 2009

Synchronized Submanifold Embedding for Person-Independent Pose Estimation and Beyond.
IEEE Trans. Image Process., 2009

Semi-Supervised Bilinear Subspace Learning.
IEEE Trans. Image Process., 2009

Correspondence Propagation with Weak Priors.
IEEE Trans. Image Process., 2009

Introduction to the special issue on Video-based Object and Event Analysis.
Pattern Recognit. Lett., 2009

Enhancing Bilinear Subspace Learning by Element Rearrangement.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Non-Negative Semi-Supervised Learning.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

ML-fusion based multi-model human detection and tracking for robust human-robot interfaces.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009

Temporal query substitution for ad search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Straightforward Feature Selection for Scalable Latent Semantic Indexing.
Proceedings of the SIAM International Conference on Data Mining, 2009

Semi-supervised Learning by Sparse Representation.
Proceedings of the SIAM International Conference on Data Mining, 2009

Inferring semantic concepts from community-contributed images and noisy tags.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Web image mining towards universal age estimator.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Label to region by bi-layer sparsity priors.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event driven summarization for web videos.
Proceedings of the first SIGMM workshop on Social media, 2009

Probabilistic latent semantic user segmentation for behavioral targeted advertising.
Proceedings of the 3rd ACM SIGKDD Workshop on Data Mining and Audience Intelligence for Advertising, 2009

Semi-Supervised Classification on Evolutionary Data.
Proceedings of the IJCAI 2009, 2009

Local-driven semi-supervised learning with multi-label.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Hidden-concept driven image decomposition towards semi-supervised multi-label image annotation.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Synthesizing Novel Dimension Reduction Algorithms in Matrix Trace Oriented Optimization Framework.
Proceedings of the ICDM 2009, 2009

Unified Solution to Nonnegative Data Factorization Problems.
Proceedings of the ICDM 2009, 2009

An HOG-LBP human detector with partial occlusion handling.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Action detection in complex scenes with spatial and temporal ambiguities.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Large scale natural image classification by sparsity exploration.
Proceedings of the IEEE International Conference on Acoustics, 2009

Directed Markov Stationary Features for visual classification.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multi-label sparse coding for automatic image annotation.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Multiplicative nonnegative graph embedding.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Hierarchical spatio-temporal context modeling for action recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Contextualizing histogram.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Recognizing human group activities with localized causalities.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Contextual decomposition of multi-label images.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Contextual motion field-based distance for video analysis.
Vis. Comput., 2008

Gait Components and Their Application to Gender Recognition.
IEEE Trans. Syst. Man Cybern. Part C, 2008

Discriminant Locally Linear Embedding With High-Order Tensor Data.
IEEE Trans. Syst. Man Cybern. Part B, 2008

Matrix-Variate Factor Analysis and Its Applications.
IEEE Trans. Neural Networks, 2008

Face Recognition Using Spatially Constrained Earth Mover's Distance.
IEEE Trans. Image Process., 2008

Regression From Uncertain Labels and Its Applications to Soft Biometrics.
IEEE Trans. Inf. Forensics Secur., 2008

Classification and Feature Extraction by Simplexization.
IEEE Trans. Inf. Forensics Secur., 2008

Reconstruction and Recognition of Tensor-Based Objects With Concurrent Subspaces Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2008

Convergent 2-D Subspace Learning With Null Space Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2008

Correlation Metric for Generalized Feature Extraction.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Near-duplicate keyframe retrieval by nonrigid image matching.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

SIFT-Bag kernel for video event analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Categorizing bi-object video activities using bag of segments and causality features.
Proceedings of the 1st ACM Workshop on Vision Networks for Behavior Analysis, 2008

Real-time human action recognition by luminance field trajectory analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Learning the Latent Semantic Space for Ranking in Text Retrieval.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Learning by Propagability.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Web Query Prediction by Unifying Model.
Proceedings of the Workshops Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Extracting age information from local spatially flexible patches.
Proceedings of the IEEE International Conference on Acoustics, 2008

Flexible X-Y patches for face recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Discriminant simplex analysis.
Proceedings of the IEEE International Conference on Acoustics, 2008

Pair-activity classification by bi-trajectories analysis.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Non-negative graph embedding.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Regression from patch-kernel.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Near duplicate image identification with patially Aligned Pyramid Matching.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Discriminative local binary patterns for human detection in personal album.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Trace Ratio Criterion for Feature Selection.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Rank-One Projections With Adaptive Margins for Face Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2007

Bilinear Analysis for Kernel Selection and Nonlinear Feature Extraction.
IEEE Trans. Neural Networks, 2007

Multilinear Discriminant Analysis for Face Recognition.
IEEE Trans. Image Process., 2007

Face Verification With Balanced Thresholds.
IEEE Trans. Image Process., 2007

Formulating Face Verification With Semidefinite Programming.
IEEE Trans. Image Process., 2007

Marginal Fisher Analysis and Its Variants for Human Gait Recognition and Content- Based Image Retrieval.
IEEE Trans. Image Process., 2007

A Parameter-Free Framework for General Supervised Subspace Learning.
IEEE Trans. Inf. Forensics Secur., 2007

Nonlinear Discriminant Analysis on Embedded Manifold.
IEEE Trans. Circuits Syst. Video Technol., 2007

Graph Embedding and Extensions: A General Framework for Dimensionality Reduction.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Face Recognition - a Generalized Marginal Fisher Analysis Approach.
Int. J. Image Graph., 2007

Ranking with uncertain labels and its applications.
Frontiers Comput. Sci. China, 2007

Causal relation of queries from temporal logs.
Proceedings of the 16th International Conference on World Wide Web, 2007

A Convengent Solution to Tensor Subspace Learning.
Proceedings of the IJCAI 2007, 2007

Transductive regression piloted by inter-manifold relations.
Proceedings of the Machine Learning, 2007

Detecting Anomaly in Videos from Trajectory Similarity Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Ranking with Uncertain Labels.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Local Word Bag Model for Text Categorization.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Learning Auto-Structured Regressor from Uncertain Nonnegative Labels.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Exploring Feature Descritors for Face Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

Element Rearrangement for Tensor-Based Subspace Learning.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Trace Ratio vs. Ratio Trace for Dimensionality Reduction.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Learning a Person-Independent Representation for Precise 3D Pose Estimation.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Effective and Efficient Dimensionality Reduction for Large-Scale and Streaming Data Preprocessing.
IEEE Trans. Knowl. Data Eng., 2006

Human Gait Recognition With Matrix Representation.
IEEE Trans. Circuits Syst. Video Technol., 2006

A scalable supervised algorithm for dimensionality reduction on streaming data.
Inf. Sci., 2006

Maximum unfolded embedding: formulation, solution, and application for image clustering.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Perspective Symmetry Invariant and Its Applications.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Dimensionality Reduction with Adaptive Kernels.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Locally adaptive classification piloted by uncertainty.
Proceedings of the Machine Learning, 2006

A Novel Scalable Algorithm for Supervised Subspace Learning.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Trace Quotient Problems Revisited.
Proceedings of the Computer Vision, 2006

Learning Semantic Patterns with Discriminant Localized Binary Projections.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Pursuing Informative Projection on Grassmann Manifold.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Efficient 3D reconstruction for face recognition.
Pattern Recognit., 2005

Face Recognition Using Laplacianfaces.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Robust Non-Frontal Face Alignment with Edge Based Texture.
J. Comput. Sci. Technol., 2005

Learning quantifiable associations via principal sparse non-negative matrix factorization.
Intell. Data Anal., 2005

OCFS: optimal orthogonal centroid feature selection for text categorization.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Parallel Image Matrix Compression for Face Recognition.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Realistic 3D Face Modeling by Fusing Multiple 2D Images.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Largest-eigenvalue-theory for incremental principal component analysis.
Proceedings of the 2005 International Conference on Image Processing, 2005

Comparative study: face recognition on unspecific persons using linear subspace methods.
Proceedings of the 2005 International Conference on Image Processing, 2005

Feedback-based dynamic generalized LDA for face recognition.
Proceedings of the 2005 International Conference on Image Processing, 2005

Tensor-based factor decomposition for relighting.
Proceedings of the 2005 International Conference on Image Processing, 2005

Neighborhood Preserving Embedding.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Fisher+Kernel Criterion for Discriminant Analysis.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Coupled Kernel-Based Subspace Learning.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Graph Embedding: A General Framework for Dimensionality Reduction.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Discriminant Analysis with Tensor Representation.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Concurrent Subspaces Analysis.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Bayesian shape localization for face recognition using global and local textures.
IEEE Trans. Circuits Syst. Video Technol., 2004

Link fusion: a unified link analysis framework for multi-type interrelated data objects.
Proceedings of the 13th international conference on World Wide Web, 2004

Automatic, Effective, and Efficient 3D Face Reconstruction from Arbitrary View Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

IMMC: incremental maximum margin criterion.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Mining Ratio Rules Via Principal Sparse Non-Negative Matrix Factorization.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Online Supervised Learning for Digital Library.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

Automatic 3D Reconstruction for Face Recognition.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Discriminant Analysis on Embedded Manifold.
Proceedings of the Computer Vision, 2004

Learning similarity measures in non-orthogonal space.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

Efficient PageRank with Same Out-Link Groups.
Proceedings of the Information Retrieval Technology, Asia Information Retrieval Symposium, 2004

2003
Face alignment using texture-constrained active shape models.
Image Vis. Comput., 2003

Face alignment using view-based direct appearance models.
Int. J. Imaging Syst. Technol., 2003

Ranking Prior Likelihood Distributions for Bayesian Shape Localization Framework.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Learning a Locality Preserving Subspace for Visual Recognition.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

2002
Multi-Class SVM Classifier Based on Pairwise Coupling.
Proceedings of the Pattern Recognition with Support Vector Machines, 2002

Multi-View Face Alignment Using Direct Appearance Models.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002


  Loading...