Jiashi Feng

Orcid: 0000-0001-6843-0064

According to our database1, Jiashi Feng authored at least 467 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ManiCLIP: Multi-attribute Face Manipulation from Text.
Int. J. Comput. Vis., October, 2024

Contrastive Masked Autoencoders are Stronger Vision Learners.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

MetaFormer Baselines for Vision.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Loong: Generating Minute-level Long Videos with Autoregressive Language Models.
CoRR, 2024

High Quality Human Image Animation using Regional Supervision and Motion Blur Condition.
CoRR, 2024

Hierarchical Memory for Long Video QA.
CoRR, 2024

Depth Anything V2.
CoRR, 2024

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams.
CoRR, 2024

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations.
CoRR, 2024

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention.
CoRR, 2024

InstaDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos.
CoRR, 2024

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator.
CoRR, 2024

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation.
CoRR, 2024

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning.
CoRR, 2024

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion.
CoRR, 2024

Magic-Me: Identity-Specific Video Customized Diffusion.
CoRR, 2024

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation.
CoRR, 2024

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

COSA: Concatenated Sample Pretrained Vision-Language Foundation Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PixelLM: Pixel Reasoning with Large Multimodal Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Vista-llama: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Video Recognition in Portrait Mode.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Token Selection is a Simple Booster for Vision Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Deep Long-Tailed Learning: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Learning to Augment Poses for 3D Human Pose Estimation in Images and Videos.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

VOLO: Vision Outlooker for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Harnessing Diffusion Models for Visual Perception with Meta Prompts.
CoRR, 2023

DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation.
CoRR, 2023

Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method.
CoRR, 2023

Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens.
CoRR, 2023

AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text.
CoRR, 2023

ChatAnything: Facetime Chat with LLM-Enhanced Personas.
CoRR, 2023

Low-Resolution Self-Attention for Semantic Segmentation.
CoRR, 2023

MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask.
CoRR, 2023

MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation.
CoRR, 2023

MagicEdit: High-Fidelity and Temporally Coherent Video Editing.
CoRR, 2023

MagicAvatar: Multimodal Avatar Generation and Animation.
CoRR, 2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs.
CoRR, 2023

Delving Deeper into Data Scaling in Masked Image Modeling.
CoRR, 2023

VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending.
CoRR, 2023

Associating Spatially-Consistent Grouping with Text-supervised Semantic Segmentation.
CoRR, 2023

AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning.
CoRR, 2023

Multimodal Video Adapter for Parameter Efficient Video Text Retrieval.
CoRR, 2023

Temporal Perceiving Video-Language Pre-training.
CoRR, 2023

CMAE-V: Contrastive Masked Autoencoders for Video Action Recognition.
CoRR, 2023

Expanding Small-Scale Datasets with Guided Imagination.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

XAGen: 3D Expressive Human Avatars Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient.
Proceedings of the International Conference on Machine Learning, 2023

Reachability-Aware Laplacian Representation in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PV3D: A 3D Generative Model for Portrait Video Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dataset Quantization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GETAvatar: Generative Textured Meshes for Animatable Human Avatars.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Probabilistic Model Made Slim.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Clover: Towards A Unified Video-Language Alignment and Fusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DOAD: Decoupled One Stage Action Detection Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Contrastive Attention for Video Anomaly Detection.
IEEE Trans. Multim., 2022

DHA: End-to-End Joint Optimization of Data Augmentation Policy, Hyper-parameter and Architecture.
Trans. Mach. Learn. Res., 2022

SODAR: Exploring Locally Aggregated Learning of Mask Representations for Instance Segmentation.
IEEE Trans. Image Process., 2022

Robust Video-Based Person Re-Identification by Hierarchical Mining.
IEEE Trans. Circuits Syst. Video Technol., 2022

Image-to-Video Generation via 3D Facial Dynamics.
IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Face Image Restoration and Frontalization for Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Dense Attentive Feature Enhancement for Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Velocity-to-velocity human motion forecasting.
Pattern Recognit., 2022

Towards Age-Invariant Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Recurrent Multi-Frame Deraining: Combining Physics Guidance and Adversarial Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Source Data-Absent Unsupervised Domain Adaptation Through Hypothesis Transfer and Labeling Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Class Prototype-based Cleaner for Label Noise Learning.
CoRR, 2022

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition.
CoRR, 2022

MagicVideo: Efficient Video Generation With Latent Diffusion Models.
CoRR, 2022

MagicMix: Semantic Mixing with Diffusion Models.
CoRR, 2022

ManiCLIP: Multi-Attribute Face Manipulation from Text.
CoRR, 2022

Clover: Towards A Unified Video-Language Alignment and Fusion Model.
CoRR, 2022

Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction.
CoRR, 2022

SODAR: Segmenting Objects by DynamicallyAggregating Neighboring Mask Representations.
CoRR, 2022

Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sharpness-Aware Training for Free.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Jointly Modelling Uncertainty and Diversity for Active Molecular Property Prediction.
Proceedings of the Learning on Graphs Conference, 2022

Towards Adversarially Robust Deep Image Denoising.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Understanding The Robustness in Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2022

The Geometry of Robust Value Functions.
Proceedings of the International Conference on Machine Learning, 2022

How Well Does Self-Supervised Pre-Training Perform with Streaming Data?
Proceedings of the Tenth International Conference on Learning Representations, 2022

Generalizing Few-Shot NAS with Gradient Matching.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

AvatarGen: A 3D Generative Model for Animatable Human Avatars.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Slim Scissors: Segmenting Thin Object from Synthetic Background.
Proceedings of the Computer Vision - ECCV 2022, 2022

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering.
Proceedings of the Computer Vision - ECCV 2022, 2022

MetaFormer is Actually What You Need for Vision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Shunted Self-Attention via Multi-Scale Token Aggregation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DINE: Domain Adaptation from Single and Multiple Black-box Predictors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Multi-human Parsing with a Graph-based Generative Adversarial Model.
ACM Trans. Multim. Comput. Commun. Appl., 2021

3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild.
IEEE Trans. Multim., 2021

PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction.
IEEE Trans. Image Process., 2021

Spatial-Aware Texture Transformer for High-Fidelity Garment Transfer.
IEEE Trans. Image Process., 2021

Cross-Layer Feature Pyramid Network for Salient Object Detection.
IEEE Trans. Image Process., 2021

Detail Preserving Coarse-to-Fine Matching for Stereo Matching and Optical Flow.
IEEE Trans. Image Process., 2021

Heterogeneous Domain Adaptation via Covariance Structured Feature Translators.
IEEE Trans. Cybern., 2021

Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Tensor Low-Rank Representation for Data Recovery and Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Anytime Recognition with Routing Convolutional Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Deep spatio-frequency saliency detection.
Neurocomputing, 2021

UMAD: Universal Model Adaptation under Domain and Category Shift.
CoRR, 2021

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering.
CoRR, 2021

Triplet Contrastive Learning for Brain Tumor Classification.
CoRR, 2021

Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision.
CoRR, 2021

Refiner: Refining Self-attention for Vision Transformers.
CoRR, 2021

How Well Self-Supervised Pre-Training Performs with Streaming Data?
CoRR, 2021

Token Labeling: Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet.
CoRR, 2021

Distill and Fine-tune: Effective Adaptation from a Black-box Source Model.
CoRR, 2021

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation.
CoRR, 2021

DeepViT: Towards Deeper Vision Transformer.
CoRR, 2021

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.
CoRR, 2021

Visual Relationship Detection With Visual-Linguistic Knowledge From Multimodal Representations.
IEEE Access, 2021

DANCE : A Deep Attentive Contour Model for Efficient Instance Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Deep Interactive Thin Object Selection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Task similarity aware meta learning: theory-inspired improvement on MAML.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Direct Multi-view Multi-person 3D Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

All Tokens Matter: Token Labeling for Training Better Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Recovering the Unbiased Scene Graphs from the Biased Ones.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

ModularNAS: Towards Modularized and Reusable Neural Architecture Search.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos.
Proceedings of the ICDAR@ICMR 2021: Proceedings of the 2021 Workshop on Intelligent Cross-Data Analysis and Retrieval, 2021

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection.
Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing.
Proceedings of the 38th International Conference on Machine Learning, 2021

Exploring Balanced Feature Spaces for Representation Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

AutoSpace: Neural Architecture Search with Less Human Interference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Voxel Transformer for 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PnP-DETR: Towards Efficient Visual Analysis with Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Body Meshes as Points.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Continual Learning via Bit-Level Information Preserving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Domain Adaptation With Auxiliary Target Domain-Oriented Classifier.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Coordinate Attention for Efficient Mobile Network Design.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Understanding and Resolving Performance Degradation in Deep Graph Convolutional Networks.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

LV-BERT: Exploiting Layer Variety for BERT.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Deep Clustering With Sample-Assignment Invariance Prior.
IEEE Trans. Neural Networks Learn. Syst., 2020

Deep Subspace Clustering.
IEEE Trans. Neural Networks Learn. Syst., 2020

Dual Adversarial Autoencoders for Clustering.
IEEE Trans. Neural Networks Learn. Syst., 2020

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks.
IEEE Trans. Multim., 2020

Learning Generalizable and Identity-Discriminative Representations for Face Anti-Spoofing.
ACM Trans. Intell. Syst. Technol., 2020

PML-LocNet: Improving Object Localization With Prior-Induced Multi-View Learning Network.
IEEE Trans. Image Process., 2020

ORDNet: Capturing Omni-Range Dependencies for Scene Parsing.
IEEE Trans. Image Process., 2020

Temporally Refined Graph U-Nets for Human Shape and Pose Estimation From Monocular Videos.
IEEE Signal Process. Lett., 2020

Deep multi-person kinship matching and recognition for family photos.
Pattern Recognit., 2020

Adaptive ROI generation for video object segmentation using reinforcement learning.
Pattern Recognit., 2020

Joint Rain Detection and Removal from a Single Image with Contextualized Deep Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Online Meta Adaptation for Fast Video Object Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Tensor Robust Principal Component Analysis with a New Tensor Nuclear Norm.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Predicting Alzheimer's disease progression using deep recurrent neural networks.
NeuroImage, 2020

Deep neural networks and kernel regression achieve comparable accuracies for functional connectivity prediction of behavior and demographics.
NeuroImage, 2020

Recognizing Profile Faces by Imagining Frontal View.
Int. J. Comput. Vis., 2020

Fine-Grained Multi-human Parsing.
Int. J. Comput. Vis., 2020

Adversarial images for the primate brain.
CoRR, 2020

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes.
CoRR, 2020

A Simple Baseline for Pose Tracking in Videos of Crowded Scenes.
CoRR, 2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes.
CoRR, 2020

RVL-BERT: Visual Relationship Detection with Visual-Linguistic Knowledge from Pre-trained Representations.
CoRR, 2020

Dual Adversarial Auto-Encoders for Clustering.
CoRR, 2020

Few-shot Classification via Adaptive Attention.
CoRR, 2020

Combating Domain Shift with Self-Taught Labeling.
CoRR, 2020

Local Grid Rendering Networks for 3D Object Detection in Point Clouds.
CoRR, 2020

Multi-Miner: Object-Adaptive Region Mining for Weakly-Supervised Semantic Segmentation.
CoRR, 2020

Effective Training Strategies for Deep Graph Neural Networks.
CoRR, 2020

RAIN: Robust and Accurate Classification Networks with Randomization and Enhancement.
CoRR, 2020

PANDA: Prototypical Unsupervised Domain Adaptation.
CoRR, 2020

MetaSelector: Meta-Learning for Recommendation with User-Level Adaptive Model Selection.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Improving Generalization in Reinforcement Learning with Mixture Regularization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ConvBERT: Improving BERT with Span-based Dynamic Convolution.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Toward Accurate Person-level Action Recognition in Videos of Crowed Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Simple Baseline for Pose Tracking in Videos of Crowed Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Neural Epitome Search for Architecture-Agnostic Network Compression.
Proceedings of the 8th International Conference on Learning Representations, 2020

ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning.
Proceedings of the 8th International Conference on Learning Representations, 2020

On Robustness of Neural Ordinary Differential Equations.
Proceedings of the 8th International Conference on Learning Representations, 2020

Decoupling Representation and Classifier for Long-Tailed Recognition.
Proceedings of the 8th International Conference on Learning Representations, 2020

Query-efficient Meta Attack to Deep Neural Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

Rethinking Bottleneck Structure for Efficient Mobile Network Design.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Devil Is in Classification: A Simple Framework for Long-Tail Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adversarial Self-supervised Learning for Semi-supervised 3D Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Balanced and Uncertainty-Aware Approach for Partial Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Central Similarity Quantization for Efficient Image and Video Retrieval.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Revisiting Knowledge Distillation via Label Smoothing Regularization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Improving Convolutional Networks With Self-Calibrated Convolutions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Boosting Few-Shot Learning With Adaptive Margin Loss.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Online Robust Low-Rank Tensor Modeling for Streaming Data Analysis.
IEEE Trans. Neural Networks Learn. Syst., 2019

Co-saliency Detection with Graph Matching.
ACM Trans. Intell. Syst. Technol., 2019

Hierarchical Contextual Refinement Networks for Human Pose Estimation.
IEEE Trans. Image Process., 2019

Compressed-Domain Highway Vehicle Counting by Spatial and Temporal Regression.
IEEE Trans. Circuits Syst. Video Technol., 2019

Toward a Comprehensive Face Detector in the Wild.
IEEE Trans. Circuits Syst. Video Technol., 2019

IAN: The Individual Aggregation Network for Person Search.
Pattern Recognit., 2019

3D-Aided Dual-Agent GANs for Unconstrained Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Anticipating Where People will Look Using Adversarial Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Recurrent Face Aging with Hierarchical AutoRegressive Memory.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Subspace Clustering by Block Diagonal Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Accelerated Randomized Mirror Descent Algorithms for Composite Non-strongly Convex Optimization.
J. Optim. Theory Appl., 2019

PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection.
CoRR, 2019

RC-DARTS: Resource Constrained Differentiable Architecture Search.
CoRR, 2019

Zoom in to where it matters: a hierarchical graph based model for mammogram analysis.
CoRR, 2019

Efficient Differentiable Neural Architecture Search with Meta Kernels.
CoRR, 2019

Classification Calibration for Long-tail Instance Segmentation.
CoRR, 2019

Revisit Knowledge Distillation: a Teacher-free Framework.
CoRR, 2019

Hierarchic Neighbors Embedding.
CoRR, 2019

PSGAN: Pose-Robust Spatial-Aware GAN for Customizable Makeup Transfer.
CoRR, 2019

Central Similarity Hashing via Hadamard matrix.
CoRR, 2019

Deep Model Compression via Filter Auto-sampling.
CoRR, 2019

VRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction.
CoRR, 2019

Unsupervised Image Noise Modeling with Self-Consistent GAN.
CoRR, 2019

Understanding Adversarial Behavior of DNNs by Disentangling Non-Robust and Robust Components in Performance Metric.
CoRR, 2019

Deep Face Recognition Model Compression via Knowledge Transfer and Distillation.
CoRR, 2019

Panoptic Edge Detection.
CoRR, 2019

Cross-Resolution Face Recognition via Prior-Aided Face Hallucination and Residual Knowledge Distillation.
CoRR, 2019

Prototype Reminding for Continual Learning.
CoRR, 2019

Hierarchical Meta Learning.
CoRR, 2019

Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification.
CoRR, 2019

Joint 3D Face Reconstruction and Dense Face Alignment from A Single Image with 2D-Assisted Self-Supervised Learning.
CoRR, 2019

Lift-the-Flap: Context Reasoning Using Object-Centered Graphs.
CoRR, 2019

Deep Reasoning with Multi-scale Context for Salient Object Detection.
CoRR, 2019

Learning Generalizable and Identity-Discriminative Representations for Face Anti-Spoofing.
CoRR, 2019

Temporal Spiking Recurrent Neural Network for Action Recognition.
IEEE Access, 2019

Task Relation Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Efficient Meta Learning via Minibatch Proximal Update.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Generalized Majorization-Minimization for Non-Convex Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dynamic Feature Fusion for Semantic Edge Detection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multi-Prototype Networks for Unconstrained Set-based Face Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Dynamic Kernel Distillation for Efficient Pose Estimation in Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Single-Stage Multi-Person Pose Machines.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Few-Shot Object Detection via Feature Reweighting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Frame-Consistent Recurrent Video Deraining With Dual-Level Flow.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Few-Shot Adaptive Faster R-CNN.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Distilling Object Detectors With Fine-Grained Feature Imitation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Simple Pooling-Based Design for Real-Time Salient Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Graph-Based Global Reasoning Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Localize Objects with Noisy Labeled Instances.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Deep Salient Object Detection With Dense Connections and Distraction Diagnosis.
IEEE Trans. Multim., 2018

Scale-Aware Fast R-CNN for Pedestrian Detection.
IEEE Trans. Multim., 2018

Multistage Object Detection With Group Recursive Learning.
IEEE Trans. Multim., 2018

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild.
IEEE Trans. Image Process., 2018

Structured AutoEncoders for Subspace Clustering.
IEEE Trans. Image Process., 2018

Zero-Shot Learning via Attribute Regression and Class Prototype Rectification.
IEEE Trans. Image Process., 2018

Landmark Free Face Attribute Prediction.
IEEE Trans. Image Process., 2018

Video-Based Person Re-Identification With Accumulative Motion Context.
IEEE Trans. Circuits Syst. Video Technol., 2018

Deep Recurrent Regression for Facial Landmark Detection.
IEEE Trans. Circuits Syst. Video Technol., 2018

Learning with rethinking: Recurrently improving convolutional neural networks through feedback.
Pattern Recognit., 2018

A Unified Alternating Direction Method of Multipliers by Majorization Minimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Annotation modification for fine-grained visual recognition.
Neurocomputing, 2018

Stochastic Primal-Dual Proximal ExtraGradient descent for compositely regularized optimization.
Neurocomputing, 2018

Subspace Learning by ℓ<sup>0</sup>-Induced Sparsity.
Int. J. Comput. Vis., 2018

Video super-resolution based on spatial-temporal recurrent residual networks.
Comput. Vis. Image Underst., 2018

Similarity R-C3D for Few-shot Temporal Activity Detection.
CoRR, 2018

A<sup>2</sup>-Nets: Double Attention Networks.
CoRR, 2018

Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
CoRR, 2018

What am I searching for?
CoRR, 2018

Finding any Waldo: zero-shot invariant and efficient visual search.
CoRR, 2018

Object Relation Detection Based on One-shot Learning.
CoRR, 2018

TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection.
CoRR, 2018

Learning Pixel-wise Labeling from the Internet without Human Interaction.
CoRR, 2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing.
CoRR, 2018

Transferable Meta Learning Across Domains.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Modeling Alzheimer's disease progression using deep recurrent neural networks.
Proceedings of the 2018 International Workshop on Pattern Recognition in Neuroimaging, 2018

Is deep learning better than kernel regression for functional connectivity prediction of fluid intelligence?
Proceedings of the 2018 International Workshop on Pattern Recognition in Neuroimaging, 2018

Efficient Stochastic Gradient Hard Thresholding.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A^2-Nets: Double Attention Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-View Image Generation from a Single-View.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Learning for Multimedia: Science or Technology?
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-Human Parsing Machines.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Egocentric Spatial Memory.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

3D-Aided Deep Pose-Invariant Face Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Sharing Residual Units Through Collective Tensor Factorization To Improve Deep Neural Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Denser Trajectories of Anchor Points for Action Recognition.
Proceedings of the 12th International Conference on Ubiquitous Information Management and Communication, 2018

Understanding Generalization and Optimization Performance of Deep CNNs.
Proceedings of the 35th International Conference on Machine Learning, 2018

Policy Optimization with Demonstrations.
Proceedings of the 35th International Conference on Machine Learning, 2018

WSNet: Compact and Efficient Networks Through Weight Sampling.
Proceedings of the 35th International Conference on Machine Learning, 2018

Empirical Risk Landscape Analysis for Understanding Deep Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms.
Proceedings of the 6th International Conference on Learning Representations, 2018

Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction.
Proceedings of the IEEE International Conference on Data Mining, 2018

Dynamic Conditional Networks for Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

ML-LocNet: Improving Object Localization with Multi-view Learning Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

Attention-Aware Deep Adversarial Hashing for Cross-Modal Retrieval.
Proceedings of the Computer Vision - ECCV 2018, 2018

TS ^2 2 C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pose Partition Networks for Multi-person Pose Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Multi-fiber Networks for Video Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Adversarial Subspace Clustering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Towards Pose Invariant Face Recognition in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Adversarial Complementary Learning for Weakly Supervised Object Localization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

MoNet: Deep Motion Exploitation for Video Object Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human Pose Estimation With Parsing Induced Learner.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Markov Clustering Networks for Scene Text Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Left-Right Comparative Recurrent Model for Stereo Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zigzag Learning for Weakly Supervised Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Better Guider Predicts Future Better: Difference Guided Generative Adversarial Networks.
Proceedings of the Computer Vision - ACCV 2018, 2018

Transferable Semi-Supervised Semantic Segmentation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Nonconvex Sparse Spectral Clustering by Alternating Direction Method of Multipliers and Its Convergence Analysis.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Cross-Domain Human Parsing via Adversarial Feature and Label Adaptation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Correlation Alignment for Unsupervised Domain Adaptation.
Proceedings of the Domain Adaptation in Computer Vision Applications., 2017

Diversified Visual Attention Networks for Fine-Grained Object Classification.
IEEE Trans. Multim., 2017

Attentive Contexts for Object Detection.
IEEE Trans. Multim., 2017

Human Facial Age Estimation by Cost-Sensitive Label Ranking and Trace Norm Regularization.
IEEE Trans. Multim., 2017

Deep Edge Guided Recurrent Residual Learning for Image Super-Resolution.
IEEE Trans. Image Process., 2017

End-to-End Comparative Attention Networks for Person Re-Identification.
IEEE Trans. Image Process., 2017

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

A survey on deep learning-based fine-grained object classification and semantic segmentation.
Int. J. Autom. Comput., 2017

Weaving Multi-scale Context for Single Shot Detector.
CoRR, 2017

WSNet: Compact and Efficient Networks with Weight Sampling.
CoRR, 2017

Personalized and Occupational-aware Age Progression by Generative Adversarial Networks.
CoRR, 2017

HashGAN: Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval.
CoRR, 2017

Deep Sparse Subspace Clustering.
CoRR, 2017

Discriminative Similarity for Clustering and Semi-Supervised Learning.
CoRR, 2017

On the Suboptimality of Proximal Gradient Descent for $\ell^{0}$ Sparse Approximation.
CoRR, 2017

Self-explanatory Deep Salient Object Detection.
CoRR, 2017

A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models.
CoRR, 2017

The Landscape of Deep Learning Algorithms.
CoRR, 2017

Multi-View Image Generation from a Single-View.
CoRR, 2017

A Unified Framework for Stochastic Matrix Factorization via Variance Reduction.
CoRR, 2017

A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion.
CoRR, 2017

Generative Partition Networks for Multi-Person Pose Estimation.
CoRR, 2017

Towards Real World Human Parsing: Multiple-Human Parsing in the Wild.
CoRR, 2017

Outlier Robust Online Learning.
CoRR, 2017

On Fundamental Limits of Robust Learning.
CoRR, 2017

Sharing Residual Units Through Collective Tensor Factorization in Deep Neural Networks.
CoRR, 2017

Neighborhood Regularized l^1-Graph.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Predicting Scene Parsing and Motion Dynamics in the Future.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Multimodal Learning and Reasoning for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Dual Path Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Robust Visual Object Tracking with Top-down Reasoning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Integrated Face Analytics Networks through Cross-Dataset Hybrid Training.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Attribute-preserving Metric Learning for Natural Language Object Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Generative Attention Model with Adversarial Self-learning for Visual Question Answering.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Online compressed robust PCA.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Online Robust Low-Rank Tensor Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Training Group Orthogonal Neural Networks with Privileged Information.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

High Performance Large Scale Face Recognition with Multi-cognition Softmax and Feature Retrieval.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Know You at One Glance: A Compact Vector Representation for Low-Shot Learning.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Neural Person Search Machines.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Regional Interactive Image Segmentation Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

FoveaNet: Perspective-Aware Urban Scene Parsing.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Scene Parsing with Predictive Feature Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

BoxFlow: Unsupervised Face Detector Adaptation from Images to Videos.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Outlier-Robust Tensor PCA.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Self-Supervised Neural Aggregation Networks for Human Parsing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Joint Rain Detection and Removal from a Single Image.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

3D-Assisted Coarse-to-Fine Extreme-Pose Facial Landmark Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Interpretable Structure-Evolving LSTM.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Perceptual Generative Adversarial Networks for Small Object Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Estimation of Affective Level in the Wild with Multiple Memory Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Deep Self-Taught Learning for Weakly Supervised Object Localization.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Detection with Diverse Proposals.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Marginalized CNN: Learning Deep Invariant Representations.
Proceedings of the British Machine Vision Conference 2017, 2017

Cascade Subspace Clustering.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multi-Path Feedback Recurrent Neural Networks for Scene Parsing.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Efficient Hyperparameter Optimization for Deep Learning Algorithms Using Deterministic RBF Surrogates.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Dual Low-Rank Pursuit: Learning Salient Features for Saliency Detection.
IEEE Trans. Neural Networks Learn. Syst., 2016

Modality-Dependent Cross-Media Retrieval.
ACM Trans. Intell. Syst. Technol., 2016

Beyond Object Proposals: Random Crop Pooling for Multi-Label Image Recognition.
IEEE Trans. Image Process., 2016

Scale-Aware Pixelwise Object Proposal Networks.
IEEE Trans. Image Process., 2016

Joint Rain Detection and Removal via Iterative Region Dependent Multi-Task Learning.
CoRR, 2016

Correlation Alignment for Unsupervised Domain Adaptation.
CoRR, 2016

Multi-stage Object Detection with Group Recursive Learning.
CoRR, 2016

Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods.
CoRR, 2016

Multi-Path Feedback Recurrent Neural Network for Scene Parsing.
CoRR, 2016

Scale-aware Pixel-wise Object Proposal Networks.
CoRR, 2016

A Focused Dynamic Attention Model for Visual Question Answering.
CoRR, 2016

Hyperparameter Transfer Learning through Surrogate Alignment for Efficient Deep Neural Network Training.
CoRR, 2016

Hyperparameter Optimization of Deep Neural Networks Using Non-Probabilistic RBF Surrogate Model.
CoRR, 2016

Ensemble Robustness of Deep Learning Algorithms.
CoRR, 2016

Auxiliary Image Regularization for Deep CNNs with Noisy Labels.
Proceedings of the 4th International Conference on Learning Representations, 2016

Tree-Structured Reinforcement Learning for Sequential Object Localization.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Live Face Swapper.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Robust Face Recognition with Deep Multi-View Representation Learning.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Deep Subspace Clustering with Sparsity Prior.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Happiness level prediction with sequential inputs via multiple regressions.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

ℓ ^0 ℓ 0 -Sparse Subspace Clustering.
Proceedings of the Computer Vision - ECCV 2016, 2016

Robust Facial Landmark Detection via Recurrent Attentive-Refinement Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Semantic Object Parsing with Graph LSTM.
Proceedings of the Computer Vision - ECCV 2016, 2016

Collaborative Layer-Wise Discriminative Learning in Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Recurrent Face Aging.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Highway Vehicle Counting in Compressed Domain.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Reversible Recursive Instance-Level Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Semantic Object Parsing with Local-Global Long Short-Term Memory.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Natural Language Object Retrieval.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recurrently Target-Attending Tracking.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Return of Frustratingly Easy Domain Adaptation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Deep Learning with S-Shaped Rectified Linear Activation Units.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Looking Inside Category: Subcategory-Aware Object Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2015

Collaborative Linear Coding for Robust Image Classification.
Int. J. Comput. Vis., 2015

Learning ℓ<sup>0</sup>-Graph for Data Clustering.
CoRR, 2015

Sense Beyond Expressions: Cuteness.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Learning the Structure of Deep Convolutional Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Fashion Parsing With Weak Color-Category Labels.
IEEE Trans. Multim., 2014

Autogrouped Sparse Representation for Visual Analysis.
IEEE Trans. Image Process., 2014

Seeing Human Weight from a Single RGB-D Image.
J. Comput. Sci. Technol., 2014

Distributed Robust Learning.
CoRR, 2014

Robust Logistic Regression and Classification.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Robust Subspace Segmentation with Block-Diagonal Prior.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Learning Scalable Discriminative Dictionary with Sample Relatedness.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Image Classification via Object-Aware Holistic Superpixel Selection.
IEEE Trans. Image Process., 2013

Linear Distance Coding for Image Classification.
IEEE Trans. Image Process., 2013

Improving Bottom-up Saliency Detection by Looking into Neighbors.
IEEE Trans. Circuits Syst. Video Technol., 2013

Multi-class learning from class proportions.
Neurocomputing, 2013

Online Robust PCA via Stochastic Optimization.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Online PCA for Contaminated Data.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Correlation Adaptive Subspace Segmentation by Trace Lasso.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Subcategory-Aware Object Classification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Perception Preserving Projections.
Proceedings of the British Machine Vision Conference, 2013

2012
Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game.
IEEE Trans. Multim., 2012

Histogram Contextualization.
IEEE Trans. Image Process., 2012

Towards a universal detector by mining concepts with small semantic gaps.
Expert Syst. Appl., 2012

Don't ask me what i'm like, just watch and listen.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Robust PCA in High-dimension: A Deterministic Approach.
Proceedings of the 29th International Conference on Machine Learning, 2012

Segmentation over Detection by Coupled Global and Local Sparse Representations.
Proceedings of the Computer Vision - ECCV 2012, 2012

Auto-Grouped Sparse Representation for Visual Analysis.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Purposive hidden-object-game: embedding human computation in popular game.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Purposive hidden-object game (P-HOG) towards imperceptible human computation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Segment an image by looking into an image corpus.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Geometric ℓp-norm feature pooling for image classification.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Image segmentation with patch-pair density priors.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Towards a universal detector by mining concepts with small semantic gaps.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Auto-generation of professional background music for home-made videos.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Learning to rank tags.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010


  Loading...