Gang Hua

Orcid: 0000-0001-9522-6157

Affiliations:
  • Wormpex AI Research, Bellevue, WA, USA
  • Microsoft Research Asia, Visual Computing Group, Beijing, China (2015 - 2018)
  • Stevens Institute of Technology, Hoboken, NJ, USA (2011 - 2015)
  • IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA (2010 - 2011)
  • Nokia Research Center Hollywood, Santa Monica, CA, USA (2009 - 2010)
  • Microsoft Corp., Redmond, WA, USA (2006 - 2009)
  • Northwestern University, Evanston, IL, USA (PhD 2006)


According to our database1, Gang Hua authored at least 294 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards Unified Robustness Against Both Backdoor and Adversarial Attacks.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Robust Model Watermarking for Image Processing Networks via Structure Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Transformer Based Pluralistic Image Completion With Reduced Information Loss.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Adversarial Attack and Defense in Deep Ranking.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Residual feature learning with hierarchical calibration for gaze estimation.
Mach. Vis. Appl., July, 2024

Deep Image Matting With Sparse User Interactions.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Transfer easy to hard: Adversarial contrastive feature learning for unsupervised person re-identification.
Pattern Recognit., January, 2024

Abnormal Ratios Guided Multi-Phase Self-Training for Weakly-Supervised Video Anomaly Detection.
IEEE Trans. Multim., 2024

Sparse Pedestrian Character Learning for Trajectory Prediction.
IEEE Trans. Multim., 2024

Disentangled Sample Guidance Learning for Unsupervised Person Re-Identification.
IEEE Trans. Image Process., 2024

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition.
IEEE Trans. Image Process., 2024

Toward High Quality Multi-Object Tracking and Segmentation Without Mask Supervision.
IEEE Trans. Image Process., 2024

End-to-end pedestrian trajectory prediction via Efficient Multi-modal Predictors.
Comput. Vis. Image Underst., 2024

Jigsaw++: Imagining Complete Shape Priors for Object Reassembly.
CoRR, 2024

Pluralistic Salient Object Detection.
CoRR, 2024

Towards Aligned Data Removal via Twin Machine Unlearning.
CoRR, 2024

Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding.
CoRR, 2024

Efficient LLM-Jailbreaking by Introducing Visual Modality.
CoRR, 2024

Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes.
CoRR, 2024

Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction.
CoRR, 2024

Deployment Prior Injection for Run-time Calibratable Object Detection.
CoRR, 2024

Jailbreaking Attack against Multimodal Large Language Model.
CoRR, 2024

Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Enhancing Implicit Shape Generators Using Topological Regularizations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

UGG: Unified Generative Grasping.
Proceedings of the Computer Vision - ECCV 2024, 2024

Stepwise Multi-grained Boundary Detector for Point-Supervised Temporal Action Localization.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Generalizable Multi-Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evidential Active Recognition: Intelligent and Prudent Open-World Embodied Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Temporal Correlation Vision Transformer for Video Person Re-Identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Representing Multimodal Behaviors With Mean Location for Pedestrian Trajectory Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

ContextLoc++: A Unified Context Model for Temporal Action Localization.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Memory-augmented appearance-motion network for video anomaly detection.
Pattern Recognit., June, 2023

Semantic Probability Distribution Modeling for Diverse Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Exploring Action Centers for Temporal Action Localization.
IEEE Trans. Multim., 2023

Instance Motion Tendency Learning for Video Panoptic Segmentation.
IEEE Trans. Image Process., 2023

Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map.
IEEE Trans. Image Process., 2023

Egocentric Computer Vision for Hands-Free Robotic Wheelchair Navigation.
J. Intell. Robotic Syst., 2023

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration.
CoRR, 2023

Designing a Better Asymmetric VQGAN for StableDiffusion.
CoRR, 2023

Implicit Autoencoder for Point-Cloud Self-Supervised Representation Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Trajectory Unified Transformer for Pedestrian Trajectory Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Flexible Visual Recognition by Evidential Modeling of Confusion and Ignorance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sparse Instance Conditioned Multimodal Trajectory Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SOAR: Scene-debiasing Open-set Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MotionTrack: Learning Robust Short-Term and Long-Term Motions for Multi-Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Progressive Backdoor Erasing via connecting Backdoor and Adversarial Attacks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Diversity-Aware Meta Visual Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Regularizing Second-Order Influences for Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boosted Dynamic Neural Networks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Multi-Stream Representation Learning for Pedestrian Trajectory Prediction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Weakly-Guided Self-Supervised Pretraining for Temporal Activity Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Action Coherence Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2022

Poison Ink: Robust and Invisible Backdoor Attack.
IEEE Trans. Image Process., 2022

E2Style: Improve the Efficiency and Effectiveness of StyleGAN Inversion.
IEEE Trans. Image Process., 2022

Local to Global Feature Learning for Salient Object Detection.
Pattern Recognit. Lett., 2022

Dual relation network for temporal action localization.
Pattern Recognit., 2022

Loss functions for pose guided person image generation.
Pattern Recognit., 2022

Deep Model Intellectual Property Protection via Deep Watermarking.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Efficient Semantic Image Synthesis via Class-Adaptive Normalization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Exploring Discrete Diffusion Models for Image Captioning.
CoRR, 2022

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition.
CoRR, 2022

E<sup>2</sup>TAD: An Energy-Efficient Tracking-based Action Detector.
CoRR, 2022

Implicit Autoencoder for Point Cloud Self-supervised Representation Learning.
CoRR, 2022

TxVAD: Improved Video Action Detection by Transformers.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition.
Proceedings of the Computer Vision, 2022

Uncertainty-Based Spatial-Temporal Attention for Online Action Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Disentangled Classification and Localization Representations for Temporal Action Localization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Social Interpretable Tree for Pedestrian Trajectory Prediction.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Complementary Attention Gated Network for Pedestrian Trajectory Prediction.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Object Cosegmentation in Noisy Videos With Multilevel Hypergraph.
IEEE Trans. Multim., 2021

Giant Panda Identification.
IEEE Trans. Image Process., 2021

Usability Studies of an Egocentric Vision-Based Robotic Wheelchair.
ACM Trans. Hum. Robot Interact., 2021

Editorial: Introduction to the Special Section on CVPR2019 Best Papers.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

A General Decoupled Learning Framework for Parameterized Image Operators.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Explicit Filterbank Learning for Neural Image Style Transfer and Image Processing.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Graph-based temporal action co-localization from an untrimmed video.
Neurocomputing, 2021

Poison Ink: Robust and Invisible Backdoor Attack.
CoRR, 2021

Exploring Structure Consistency for Deep Model Watermarking.
CoRR, 2021

Semi-supervised Long-tailed Recognition using Alternate Sampling.
CoRR, 2021

Sparse Pose Trajectory Completion.
CoRR, 2021

A Simple Baseline for StyleGAN Inversion.
CoRR, 2021

Beyond Visual Attractiveness: Physically Plausible Single Image HDR Reconstruction for Spherical Panoramas.
CoRR, 2021

Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Enriching Local and Global Contexts for Temporal Action Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Practical Relative Order Attack in Deep Ranking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

GistNet: a Geometric Structure Transfer Network for Long-Tailed Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta Pairwise Relationship Distillation for Unsupervised Person Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Diverse Semantic Image Synthesis via Probability Distribution Modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning View Selection for 3D Scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SGCN: Sparse Graph Convolution Network for Pedestrian Trajectory Prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Any-Precision Deep Neural Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improving Person Re-Identification With Iterative Impression Aggregation.
IEEE Trans. Image Process., 2020

Controllable Image Processing via Adaptive FilterBank Pyramid.
IEEE Trans. Image Process., 2020

Semi-online Multi-people Tracking by Re-identification.
Int. J. Comput. Vis., 2020

Semantic Image Synthesis via Efficient Class-Adaptive Normalization.
CoRR, 2020

Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Passport-aware Normalization for Deep Model Protection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Action Co-localization in an Untrimmed Video by Graph Neural Networks.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Joint Multi-object Detection and Segmentation from an Untrimmed Video.
Proceedings of the Artificial Intelligence Applications and Innovations, 2020

Fine-Grained Giant Panda Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Temporal Keypoint Matching and Refinement Network for Pose Estimation and Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adversarial Ranking Attack and Defense.
Proceedings of the Computer Vision - ECCV 2020, 2020

Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.
Proceedings of the Computer Vision - ECCV 2020, 2020

LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Few-Shot Open-Set Recognition Using Meta-Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SaccadeNet: A Fast and Accurate Object Detector.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

gDLS*: Generalized Pose-and-Scale Estimation Given Scale and Gravity Priors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Robust 3D Point Recognition via Gather-Vector Guidance.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Loss Functions for Person Image Generation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Ladder Loss for Coherent Visual-Semantic Embedding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Order-Preserving Optimal Transport for Distances between Sequences.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Video Imprint.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Gated Context Aggregation Network for Image Dehazing and Deraining.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Priming Deep Pedestrian Detection with Geometric Context.
Proceedings of the International Conference on Robotics and Automation, 2019

Action Coherence Network for Weakly Supervised Temporal Action Localization.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Capturing Piecewise SVBRDFs with Content Aware Lighting.
Proceedings of the Advances in Computer Graphics, 2019

Object Affordances Graph Network for Action Recognition.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Knowledge-Based Topic Model for Unsupervised Object Discovery and Localization.
IEEE Trans. Image Process., 2018

Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks.
IEEE Trans. Image Process., 2018

Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network.
Sensors, 2018

Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.
Sensors, 2018

Probabilistic Elastic Part Model: A Pose-Invariant Representation for Real-World Face Verification.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Collaborative Active Visual Recognition from Crowds: A Distributed Ensemble Approach.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Recurrent Variational Autoencoders for Learning Nonlinear Generative Models in the Presence of Outliers.
IEEE J. Sel. Top. Signal Process., 2018

Connections with Robust PCA and the Role of Emergent Sparsity in Variational Autoencoder Models.
J. Mach. Learn. Res., 2018

Guest Editorial.
Comput. Vis. Image Underst., 2018

A Compositional Textual Model for Recognition of Imperfect Word Images.
CoRR, 2018

Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition.
CoRR, 2018

Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition.
Proceedings of the Artificial Intelligence Applications and Innovations, 2018

Video Object Co-Segmentation from Noisy Videos by a Multi-Level Hypergraph Model.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Joint Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Stacked Cross Attention for Image-Text Matching.
Proceedings of the Computer Vision - ECCV 2018, 2018

Decouple Learning for Parameterized Image Operators.
Proceedings of the Computer Vision - ECCV 2018, 2018

Semi-supervised FusedGAN for Conditional Image Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Revisiting Deep Intrinsic Image Decompositions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Stereoscopic Neural Style Transfer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Towards Open-Set Identity Preserving Face Synthesis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Visual attribute transfer through deep image analogy.
ACM Trans. Graph., 2017

Visual Tracking via Joint Discriminative Appearance Learning.
IEEE Trans. Circuits Syst. Video Technol., 2017

Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Multi-Timescale Collaborative Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Exemplar-Guided Similarity Learning on Polynomial Kernel Feature Map for Person Re-identification.
Int. J. Comput. Vis., 2017

Understanding and Predicting The Attractiveness of Human Action Shot.
CoRR, 2017

Revisiting Deep Image Smoothing and Intrinsic Image Decomposition.
CoRR, 2017

Veiled Attributes of the Variational Autoencoder.
CoRR, 2017

Green Generative Modeling: Recycling Dirty Data using Recurrent Variational Autoencoders.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Hierarchical Multimodal LSTM for Dense Visual-Semantic Embedding.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Coherent Online Video Style Transfer.
Proceedings of the IEEE International Conference on Computer Vision, 2017

CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Fast, Accurate Thin-Structure Obstacle Detection for Autonomous Mobile Robots.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Neural Aggregation Network for Video Face Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Order-Preserving Wasserstein Distance for Sequence Matching.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Correlational Gaussian Processes for Cross-Domain Visual Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Collaborative Deep Reinforcement Learning for Joint Object Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

ER3: A Unified Framework for Event Retrieval, Recognition and Recounting.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

StyleBank: An Explicit Representation for Neural Image Style Transfer.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

How to Train a Compact Binary Neural Network with High Accuracy?
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Three-Dimensional Traffic Scenes Simulation From Road Image Sequences.
IEEE Trans. Intell. Transp. Syst., 2016

Introduction of New Associate Editors.
IEEE Trans. Circuits Syst. Video Technol., 2016

A Joint Gaussian Process Model for Active Visual Recognition with Expertise Estimation in Crowdsourcing.
Int. J. Comput. Vis., 2016

Neural Aggregation Network for Video Face Recognition.
CoRR, 2016

An egocentric computer vision based co-robot wheelchair.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Supervised Matrix Factorization for Cross-Modality Hashing.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Supervised Transformer Network for Efficient Face Detection.
Proceedings of the Computer Vision - ECCV 2016, 2016

Ordinal Regression with Multiple Output CNN for Age Estimation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Multi-level Contextual Model for Person Recognition in Photo Albums.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Topical Video Object Discovery From Key Frames by Modeling Word Co-Occurrence Prior.
IEEE Trans. Image Process., 2015

The VLSI implementation of a high-resolution depth-sensing SoC based on active structured light.
Mach. Vis. Appl., 2015

Guest editorial: selected papers from ICIMCS 2013.
Multim. Syst., 2015

Auxiliary Training Information Assisted Visual Recognition.
IPSJ Trans. Comput. Vis. Appl., 2015

Multimedia Big Data Computing.
IEEE Multim., 2015

Visual Topic Network: Building better image representations for images in social media.
Comput. Vis. Image Underst., 2015

Multi-View Visual Recognition of Imperfect Testing Data.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

An electronic device to record consensual reflex in human pupil.
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

Modeling Inter- and Intra-Part Deformations for Object Structure Parsing.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

CANNET: Context aware nonlocal convolutional networks for semantic image segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A New Approach to Detect Use of Alcohol Through Iris Videos Using Computer Vision.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Learning Discriminative Reconstructions for Unsupervised Outlier Removal.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-class Multi-annotator Active Learning with Robust Gaussian Process for Visual Recognition.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Report on the FG 2015 Video Person Recognition Evaluation.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

A convolutional neural network cascade for face detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Hierarchical-PEP model for real-world face recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Similarity learning on an explicit polynomial kernel feature map for person re-identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Development of an Effective Method and a Portable Device to Evaluate the Pupillary Reflex.
Proceedings of the 28th IEEE International Symposium on Computer-Based Medical Systems, 2015

2014
Joint Segmentation and Recognition of Categorized Objects From Noisy Web Image Collection.
IEEE Trans. Image Process., 2014

Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes.
IEEE Trans. Image Process., 2014

Hyperspectral Image Classification Through Bilayer Graph-Based Learning.
IEEE Trans. Image Process., 2014

ObjectPatchNet: Towards scalable and semantic image annotation and retrieval.
Comput. Vis. Image Underst., 2014

Probabilistic Elastic Part Model for Real-World Face Recognition.
Proceedings of the Face and Facial Expression Recognition from Real World Videos, 2014

The IJCB 2014 PaSC video face and person recognition competition.
Proceedings of the IEEE International Joint Conference on Biometrics, Clearwater, 2014

Video Object Discovery and Co-segmentation with Extremely Weak Supervision.
Proceedings of the Computer Vision - ECCV 2014, 2014

Egocentric Object Recognition Leveraging the 3D Shape of the Grasping Hand.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Description-Discrimination Collaborative Tracking.
Proceedings of the Computer Vision - ECCV 2014, 2014

Semi-supervised Relational Topic Model for Weakly Annotated Image Recognition in Social Media.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Unsupervised One-Class Learning for Automatic Outlier Removal.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Efficient Boosted Exemplar-Based Face Detection.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Can Visual Recognition Benefit from Auxiliary Information in Training?
Proceedings of the Computer Vision - ACCV 2014, 2014

Accurate Object Detection with Location Relaxation and Regionlets Re-localization.
Proceedings of the Computer Vision - ACCV 2014, 2014

Eigen-PEP for Video Face Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Vision-Based Interaction
Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01812-1, 2013

Introduction to the special section of best papers of ACM multimedia 2012.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Automatic salient object extraction with contextual cue and its applications to recognition and alpha matting.
Pattern Recognit., 2013

IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

An Integrated Model for Bayesian Learning of Sparse Representation and Classifier Training.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Semi-Supervised Learning with Manifold Fitted Graphs.
Proceedings of the IJCAI 2013, 2013

An egocentric vision based assistive co-robot.
Proceedings of the IEEE 13th International Conference on Rehabilitation Robotics, 2013

Large-scale video event classification using dynamic temporal pyramid matching of visual semantics.
Proceedings of the IEEE International Conference on Image Processing, 2013

Active Visual Recognition with Expertise Estimation in Crowdsourcing.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Probabilistic Elastic Part Model for Unsupervised Face Detector Adaptation.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Collaborative Active Learning of a Kernel Machine Ensemble for Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Video Demo: An Egocentric Vision Based Assistive Co-robot.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Probabilistic Elastic Matching for Pose Variant Face Verification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Introduction to the special section of best papers of ACM multimedia 2011.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Semantic Model Vectors for Complex Video Event Recognition.
IEEE Trans. Multim., 2012

Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields.
Image Vis. Comput., 2012

Introduction to the Special Issue on Mobile Vision.
Int. J. Comput. Vis., 2012

IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Concurrent segmentation of categorized objects from an image collection.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Multi-scale shared features for cascade object detection.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Scene Aligned Pooling for Complex Video Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Detection by detections: Non-parametric detector adaptation for a video.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Context aware topic model for scene recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications.
IEEE Trans. Image Process., 2011

Introduction to the Special Section on Real-World Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Discriminative Learning of Local Image Descriptors.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Modeling spatial and semantic cues for large-scale near-duplicated image retrieval.
Comput. Vis. Image Underst., 2011

IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Automatic salient object extraction with contextual cue.
Proceedings of the IEEE International Conference on Computer Vision, 2011

What characterizes a shadow boundary under the sun and sky?
Proceedings of the IEEE International Conference on Computer Vision, 2011

Motion divergence fields for dynamic hand gesture recognition.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Spatial-DiscLDA for visual recognition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Multiple instance boosting with global smoothness regularization.
Proceedings of the 8th International Conference on Information, 2011

Towards large scale land-cover recognition of satellite images.
Proceedings of the 8th International Conference on Information, 2011

2010
A Comprehensive Approach to Image Spam Detection: From Server to Client Solution.
IEEE Trans. Inf. Forensics Secur., 2010

A Hierarchical Visual Model for Video Object Summarization.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Visual quality assessment for web videos.
J. Vis. Commun. Image Represent., 2010

IBM Research TRECVID-2010 Video Copy Detection and Multimedia Event Detection System.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Building contextual visual vocabulary for large-scale image applications.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

ACM workshop on mobile cloud media computing.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A nonnegative sparsity induced similarity measure with application to cluster analysis of spam images.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative Tracking by Metric Learning.
Proceedings of the Computer Vision, 2010

Joint People, Event, and Location Recognition in Personal Photo Collections Using Cross-Domain Context.
Proceedings of the Computer Vision, 2010

Interest seam image.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Context-Aware Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Face Relighting from a Single Image under Arbitrary Unknown Lighting Conditions.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Descriptive visual words and visual phrases for image applications.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

What can visual content analysis do for text based image search?
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Which faces to tag: Adding prior constraints into active learning.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

A robust elastic and partial matching metric for face recognition.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Implicit elastic matching with random projections for pose-variant face recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Picking the best DAISY.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Multiple instance fFeature for robust part-based object detection.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Efficient Scale-Space Spatiotemporal Saliency Tracking for Distortion-Free Video Retargeting.
Proceedings of the Computer Vision, 2009

2008
VideoCut: Removing Irrelevant Frames by Discovering the Object of Interest.
Proceedings of the Computer Vision, 2008

Meta-tag propagation by co-training an ensemble classifier for improving image search relevance.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

Integrated feature selection and higher-order spatial feature extraction for object categorization.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
A decentralized probabilistic approach to articulated body tracking.
Comput. Vis. Image Underst., 2007

PEYE: Toward a Visual Motion Based Perceptual Interface for Mobile Devices.
Proceedings of the Human-Computer Interaction, 2007

Discriminant Embedding for Local Image Descriptors.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Face Re-Lighting from a Single Image under Harsh Lighting Conditions.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Face Recognition using Discriminatively Trained Orthogonal Rank One Tensor Projections.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Iterative Local-Global Energy Minimization for Automatic Extraction of Objects of Interest.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Sequential mean field variational analysis of structured deformable shapes.
Comput. Vis. Image Underst., 2006

Automatic Business Card Scanning with a Camera.
Proceedings of the International Conference on Image Processing, 2006

Measurement integration under inconsistency for robust tracking.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Efficient Optimal Kernel Placement for Reliable Visual Tracking.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Variational Maximum A Posteriori by Annealed Mean Field Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

A Statistical Field Model for Pedestrian Detection.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Learning to Estimate Human Pose with Data Driven Belief Propagation.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Multi-Scale Visual Tracking by Sequential Belief Propagation.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Tracking Articulated Body by Dynamic Markov Network.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Tracking Appearances with Occlusions.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Switching Observation Models for Contour Tracking in Clutter.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003


  Loading...