Antoni B. Chan

Orcid: 0000-0002-2886-2513

Affiliations:
  • City University of Hong Kong, Department of Computer Science
  • Cornell University, Ithaca, NY, USA


According to our database1, Antoni B. Chan authored at least 169 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Generalized Characteristic Function Loss for Crowd Analysis in the Frequency Domain.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Human attention guided explainable artificial intelligence for computer vision models.
Neural Networks, 2024

Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting.
CoRR, 2024

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks.
CoRR, 2024

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models.
CoRR, 2024

Learning Tracking Representations from Single Point Annotations.
CoRR, 2024

Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM.
CoRR, 2024

Affecting Audience Valence and Arousal in 360 Immersive Environments: How Powerful Neural Style Transfer Is?
Proceedings of the Virtual, Augmented and Mixed Reality, 2024

Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

A Fixed-Point Approach to Unified Prompt-Based Counting.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Single-Frame-Based Deep View Synchronization for Unsynchronized Multicamera Surveillance.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Modeling Noisy Annotations for Point-Wise Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Variational Nested Dropout.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

A Lightweight and Detector-Free 3D Single Object Tracker on Point Clouds.
IEEE Trans. Intell. Transp. Syst., May, 2023

Clustering Hidden Markov Models With Variational Bayesian Hierarchical EM.
IEEE Trans. Neural Networks Learn. Syst., March, 2023

On Distinctive Image Captioning via Comparing and Reweighting.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Retrieval-Augmented Multiple Instance Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ODAM: Gradient-based Instance-Specific Visual Explanations for Object Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scalable Video Object Segmentation with Simplified Framework.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Human Attention-Guided Explainable AI for Object Detection.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

2022
Accelerating Monte Carlo Bayesian Prediction via Approximating Predictive Uncertainty Over the Simplex.
IEEE Trans. Neural Networks Learn. Syst., 2022

Bits-Ensemble: Toward Light-Weight Robust Deep Ensemble by Bits-Sharing.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

On Diversity in Image Captioning: Metrics and Methods.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Kernel-Based Density Map Generation for Dense Object Counting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PRIMAL-GMM: PaRametrIc MAnifold Learning of Gaussian Mixture Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

RegGeoNet: Learning Regular Representations for Large-Scale 3D Point Clouds.
Int. J. Comput. Vis., 2022

3D Crowd Counting via Geometric Attention-Guided Multi-view Fusion.
Int. J. Comput. Vis., 2022

Wide-Area Crowd Counting: Multi-view Fusion Networks for Counting in Large Scenes.
Int. J. Comput. Vis., 2022

Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios.
CoRR, 2022

An Empirical Study on Distribution Shift Robustness From the Perspective of Pre-Training and Data Augmentation.
CoRR, 2022

A Comparative Survey of Deep Active Learning.
CoRR, 2022

A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds.
CoRR, 2022

Asymptotic optimality for active learning processes.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Improved Fine-Tuning by Better Leveraging Pre-Training Data.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Calibration-Free Multi-view Crowd Counting.
Proceedings of the Computer Vision - ECCV 2022, 2022

Crowd Counting in the Frequency Domain.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Angular-Driven Feedback Restoration Networks for Imperfect Sketch Recognition.
IEEE Trans. Image Process., 2021

Fine-Grained Crowd Counting.
IEEE Trans. Image Process., 2021

Tracking-by-Counting: Using Network Flows on Crowd Density Maps for Tracking Multiple Targets.
IEEE Trans. Image Process., 2021

Visual Tracking via Dynamic Memory Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Improved Fine-tuning by Leveraging Pre-training Data: Theory and Practice.
CoRR, 2021

Multiple-criteria Based Active Learning with Fixed-size Determinantal Point Processes.
CoRR, 2021

The Implicit Biases of Stochastic Gradient Descent on Deep Neural Networks with Batch Normalization.
CoRR, 2021

Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression.
CoRR, 2021

Hierarchical learning of Hidden Markov Models with clustering regularization.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Chinese White Dolphin Detection in the Wild.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Group-based Distinctive Image Captioning with Memory Attention.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Comparative Survey: Benchmarking for Pool-based Active Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Meta-Graph Adaptation for Visual Object Tracking.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cross-View Cross-Scene Multi-View Crowd Counting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Progressive Unsupervised Learning for Visual Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Generalized Loss Function for Crowd Counting and Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Eye movement consistency in global-local perceptual processing predicts schizotypy.
Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

2020
Incorporating Side Information by Adaptive Convolution.
Int. J. Comput. Vis., 2020

Wide-Area Crowd Counting: Multi-View Fusion Networks for Counting in Large Scenes.
CoRR, 2020

ALdataset: a benchmark for pool-based active learning.
CoRR, 2020

Improve Generalization and Robustness of Neural Networks via Weight Scale Shifting Invariant Regularizations.
CoRR, 2020

Single-Frame based Deep View Synchronization for Unsynchronized Multi-Camera Surveillance.
CoRR, 2020

Over-crowdedness Alert! Forecasting the Future Crowd Distribution.
CoRR, 2020

Modeling Noisy Annotations for Crowd Counting.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Fully Nested Neural Network for Adaptive Compression and Quantization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets.
Proceedings of the Computer Vision - ECCV 2020, 2020

ROAM: Recurrently Optimizing Tracking Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

The role of eye movement consistency in learning to recognise faces: Computational and experimental examinations.
Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020

Neighbours Matter: Image Captioning with Similar Images.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks - Counting, Detection, and Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2019

Density-Preserving Hierarchical EM Algorithm: Simplifying Gaussian Mixture Models for Approximate Inference.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Is that my hand? An egocentric dataset for hand disambiguation.
Image Vis. Comput., 2019

Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process.
CoRR, 2019

Accelerating Monte Carlo Bayesian Inference via Approximating Predictive Uncertainty over Simplex.
CoRR, 2019

A Fully Bayesian Infinite Generative Model for Dynamic Texture Segmentation.
CoRR, 2019

Parametric Manifold Learning of Gaussian Mixture Models.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

ButtonTips: Design Web Buttons with Suggestions.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Hand Detection Using Zoomed Neural Networks.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Adaptive Density Map Generation for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Wide-Area Crowd Counting via Ground-Plane Density Maps and Multi-View Fusion CNNs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Describing Like Humans: On Diversity in Image Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Residual Regression With Semantic Prior for Crowd Counting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Understanding Individual Differences in Eye Movement Pattern During Scene Perception through Co-Clustering of Hidden Markov Models.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

EMHMM: Eye Movement Analysis with Hidden Markov Models and Its Applications in Cognitive Research.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

2018
Color Orchestra: Ordering Color Palettes for Interpolation and Prediction.
IEEE Trans. Vis. Comput. Graph., 2018

EMHMM Simulation Study.
CoRR, 2018

CNN+CNN: Convolutional Decoders for Image Captioning.
CoRR, 2018

Learning Dynamic Memory Networks for Object Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

Hand Detection using Deformable Part Models on an Egocentric Perspective.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Fusing Crowd Density Maps and Visual Object Trackers for People Tracking in Crowd Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Optimal face recognition performance involves a balance between global and local information processing: Evidence from cultural difference.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

Crowd Counting by Adaptively Fusing Predictions from an Image Pyramid.
Proceedings of the British Machine Vision Conference 2018, 2018

Gated Hierarchical Attention for Image Captioning.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Dynamic Manga: Animating Still Manga via Camera Movement.
IEEE Trans. Multim., 2017

Martial Arts, Dancing and Sports dataset: A challenging stereo and multi-view dataset for 3D human pose estimation.
Image Vis. Comput., 2017

Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation.
Int. J. Comput. Vis., 2017

Efficient tree-structured SfM by RANSAC generalized Procrustes analysis.
Comput. Vis. Image Underst., 2017

Mining probabilistic color palettes for summarizing color use in artwork collections.
Proceedings of the SIGGRAPH ASIA 2017, Bangkok, Thailand, November 27 - 30, 2017, 2017

Recurrent Filter Learning for Visual Tracking.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Video Desnowing and Deraining Based on Matrix Decomposition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Insomniacs Misidentify Angry Faces as Fearful Faces Because of Missing the Eyes: an Eye-Tracking Study.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Learning word embeddings via context grouping.
Proceedings of the ACM Turing 50th Celebration Conference, 2017

2016
Directing user attention via visual flow on web designs.
ACM Trans. Graph., 2016

Counting People Crossing a Line Using Integer Programming and Local Features.
IEEE Trans. Circuits Syst. Video Technol., 2016

Crowd Counting by Adapting Convolutional Neural Networks with Side Information.
CoRR, 2016

Patternista: learning element style compatibility and spatial composition for ring-based layout decoration.
Proceedings of the 5th Joint Symposium on Computational Aesthetics, 2016

Mind reading: Discovering individual preferences from eye movements using switching hidden Markov models.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Analytic Eye Movement Patterns in Face Recognition are Associated with Better Performance and more Top-down Control of Visual Attention: an fMRI Study.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Hidden Markov Modeling of eye movements with image information leads to better discovery of regions of interest.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

2015
Enhanced Figure-Ground Classification With Background Prior Propagation.
IEEE Trans. Image Process., 2015

Leveraging Long-Term Predictions and Online Learning in Agent-Based Multiple Person Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2015

A Scalable and Accurate Descriptor for Dynamic Textures Using Bag of System Trees.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network.
Int. J. Comput. Vis., 2015

FlexyFont: Learning Transferring Rules for Flexible Typeface Synthesis.
Comput. Graph. Forum, 2015

An SVD-based Multimodal Clustering method for Social Event Detection.
Proceedings of the 31st IEEE International Conference on Data Engineering Workshops, 2015

Small instance detection by integer programming on object density maps.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Hidden Markov model analysis reveals better eye movement strategies in face recognition.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Eye Movement Pattern in Face Recognition is Associated with Cognitive Decline in the Elderly.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

2014
Look over here: attention-directing composition of manga elements.
ACM Trans. Graph., 2014

A Robust Likelihood Function for 3D Human Pose Tracking.
IEEE Trans. Image Process., 2014

Clustering hidden Markov models with variational HEM.
J. Mach. Learn. Res., 2014

A Robust Panel Extraction Method for Manga.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Joint Motion Segmentation and Background Estimation in Dynamic Scenes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Look Closely: Learning Exemplar Patches for Recognizing Textiles from Product Images.
Proceedings of the Computer Vision - ACCV 2014, 2014

3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
A Bag of Systems Representation for Music Auto-Tagging.
IEEE ACM Trans. Audio Speech Lang. Process., 2013

Clustering Dynamic Textures with the Hierarchical EM Algorithm for Modeling Video.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

On Approximate Inference for Generalized Gaussian Process Models.
CoRR, 2013

That was fast! Speeding up NN search of high dimensional distributions.
Proceedings of the 30th International Conference on Machine Learning, 2013

Objective measures of IS usage behavior under conditions of experience and pressure using eye fixation data.
Proceedings of the International Conference on Information Systems, 2013

Crossing the Line: Crowd Counting by Integer Programming with Local Features.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Understanding eye movements in face recognition with hidden Markov model.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Surveillance of Crowded Environments: Modeling the Crowd by Its Global Properties.
Proceedings of the Modeling, Simulation and Visual Analysis of Crowds, 2013

2012
Automatic stylistic manga layout.
ACM Trans. Graph., 2012

Counting People With Low-Level Features and Bayesian Regression.
IEEE Trans. Image Process., 2012

The variational hierarchical EM algorithm for clustering hidden Markov models.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Multivariate Autoregressive Mixture Models for Music Auto-Tagging.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Growing a bag of systems tree for fast and accurate classification.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Adaptive figure-ground classification.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Time Series Models for Semantic Music Annotation.
IEEE Trans. Speech Audio Process., 2011

Generalized Stauffer-Grimson background subtraction for dynamic scenes.
Mach. Vis. Appl., 2011

Tech Report A Variational HEM Algorithm for Clustering Hidden Markov Models
CoRR, 2011

Genre Classification and the Invariance of MFCC Features to Key and Tempo.
Proceedings of the Advances in Multimedia Modeling, 2011

Generalized Gaussian process models.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Modeling Music as a Dynamic Texture.
IEEE Trans. Speech Audio Process., 2010

Automatic Music Tagging With Time Series Models.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Clustering dynamic textures with the hierarchical EM algorithm.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Layered Dynamic Textures.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Bayesian Poisson regression for crowd counting.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Dynamic texture models of music.
Proceedings of the IEEE International Conference on Acoustics, 2009

Variational layered dynamic textures.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Beyond dynamic textures : a family of stochastic dynamical models for video with applications to computer vision.
PhD thesis, 2008

Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Privacy preserving crowd monitoring: Counting people without people models or tracking.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Supervised Learning of Semantic Classes for Image Annotation and Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Direct convex relaxations of sparse SVM.
Proceedings of the Machine Learning, 2007

Audio Information Retrieval using Semantic Similarity.
Proceedings of the IEEE International Conference on Acoustics, 2007

Classifying Video with Kernel Dynamic Textures.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
On measuring the change in size of pulmonary nodules.
IEEE Trans. Medical Imaging, 2006

2005
Mixtures of Dynamic Textures.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Probabilistic Kernels for the Classification of Auto-Regressive Visual Processes.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005


  Loading...