Yu Kong

Orcid: 0000-0001-6271-4082

Affiliations:
  • Michigan State University, East Lansing, MI, USA
  • Rochester Institute of Technology, NY, USA (former)
  • State University of New York at Buffalo, NY, USA (former)
  • Northeastern University, Boston, MA, USA (former)
  • Beijing Institute of Technology, Institute of Automation, National Laboratory of Pattern Recognition, China (former)


According to our database1, Yu Kong authored at least 83 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding.
CoRR, 2024

The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative.
CoRR, 2024

A Survey of Multimodal Sarcasm Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Facial Affective Behavior Analysis with Instruction Tuning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-pathway Text-Video Alignment.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
From Ensemble Clustering to Subspace Clustering: Cluster Structure Encoding.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Latent Space Energy-based Model for Fine-grained Open Set Recognition.
CoRR, 2023

On Model Explanations with Transferable Neural Pathways.
CoRR, 2023

Prompting Language-Informed Distribution for Compositional Zero-Shot Learning.
CoRR, 2023

Ancestor Search: Generalized Open Set Recognition via Hyperbolic Side Information Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

ATM: Action Temporality Modeling for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Uncertainty-aware State Space Transformer for Egocentric 3D Hand Trajectory Forecasting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Human Action Recognition and Prediction: A Survey.
Int. J. Comput. Vis., 2022

An Eye for an Eye: Defending against Gradient-based Attacks with Gradients.
CoRR, 2022

Universal 3-Dimensional Perturbations for Black-Box Attacks on Video Recognition Systems.
Proceedings of the 43rd IEEE Symposium on Security and Privacy, 2022

Learning of Global Objective for Network Flow in Multi-Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GateHUB: Gated History Unit with Background Suppression for Online Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OpenTAL: Towards Open Set Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Dynamic Meta-Learning Model for Time-Sensitive Cold-Start Recommendations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Accurate and Fast Image Denoising via Attention Guided Scaling.
IEEE Trans. Image Process., 2021

Residual Dense Network for Image Restoration.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Coupling Adversarial Graph Embedding for transductive zero-shot action recognition.
Neurocomputing, 2021

Revealing a history: palimpsest text separation with generative networks.
Int. J. Document Anal. Recognit., 2021

Adversarial Memory Networks for Action Prediction.
CoRR, 2021

Multiple Instance Relational Learning for Video Anomaly Detection.
Proceedings of the International Joint Conference on Neural Networks, 2021

Explainable Video Entailment with Grounded Visual Evidence.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Evidential Deep Learning for Open Set Action Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Gradient Frequency Modulation for Visually Explaining Video Understanding Models.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Visual Object Tracking Via Multi-Stream Deep Similarity Learning Networks.
IEEE Trans. Image Process., 2020

Aligned Dynamic-Preserving Embedding for Zero-Shot Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2020

Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Adversarial Action Prediction Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Activity-driven Weakly-Supervised Spatio-Temporal Grounding from Untrimmed Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Object-Aware Centroid Voting for Monocular 3D Object Detection.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Few-shot Human Motion Prediction via Learning Novel Motion Dynamics.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Privacy Attributes-aware Message Passing Neural Network for Visual Privacy Attributes Classification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Publishing Video Data with Indistinguishable Objects.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Group Activity Prediction with Sequential Relational Anticipation Model.
Proceedings of the Computer Vision - ECCV 2020, 2020

RIT-18: A Novel Dataset for Compositional Group Activity Understanding.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Deep Geo-Constrained Auto-Encoder for Non-Landmark GPS Estimation.
IEEE Trans. Big Data, 2019

2018
Probabilistic Low-Rank Multitask Learning.
IEEE Trans. Neural Networks Learn. Syst., 2018

Hierarchical and Spatio-Temporal Sparse Representation for Human Action Recognition.
IEEE Trans. Image Process., 2018

Clustered Lifelong Learning Via Representative Task Selection.
Proceedings of the IEEE International Conference on Data Mining, 2018

Residual Dense Network for Image Super-Resolution.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Action Prediction From Videos via Memorizing Hard-to-Predict Samples.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deeply Learned View-Invariant Features for Cross-View Action Recognition.
IEEE Trans. Image Process., 2017

Max-Margin Heterogeneous Information Machine for RGB-D Action Recognition.
Int. J. Comput. Vis., 2017

Deep Active Learning Through Cognitive Information Parcels.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-Stream Deep Similarity Learning Networks for Visual Tracking.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deep Sequential Context Networks for Action Prediction.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Sparse Subspace Clustering by Learning Approximation ℓ0 Codes.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Fast Low-Rank Projection for Image Classification.
IEEE Trans. Image Process., 2016

Discriminative Relational Representation Learning for RGB-D Action Recognition.
IEEE Trans. Image Process., 2016

Close Human Interaction Recognition Using Patch-Aware Models.
IEEE Trans. Image Process., 2016

Efficient Image Geotagging Using Large Databases.
IEEE Trans. Big Data, 2016

Max-Margin Action Prediction Machine.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Learning hierarchical 3D kernel descriptors for RGB-D action recognition.
Comput. Vis. Image Underst., 2016

Deep Convolutional Neural Network with Independent Softmax for Large Scale Face Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Hierarchical 3D kernel descriptors for action recognition using depth sequences.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Bilinear heterogeneous information machine for RGB-D action recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Interactive Phrases: Semantic Descriptionsfor Human Interaction Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Recognising human interaction from videos by a discriminative model.
IET Comput. Vis., 2014

Learning a discriminative mid-level feature for action recognition.
Sci. China Inf. Sci., 2014

Latent Tensor Transfer Learning for RGB-D Action Recognition.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

LASOM: Location Aware Self-Organizing Map for discovering similar and unique visual features of geographical locations.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

A Discriminative Model with Multiple Temporal Scales for Action Prediction.
Proceedings of the Computer Vision - ECCV 2014, 2014

Modeling Supporting Regions for Close Human Interaction Recognition.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

2013
Activity recognition by learning structural and pairwise mid-level features using random forest.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Decomposed contour prior for shape recognition.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Action recognition with discriminative mid-level features.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A Hierarchical Model for Human Interaction Recognition.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Learning Human Interaction by Interactive Phrases.
Proceedings of the Computer Vision - ECCV 2012, 2012

Contour-HOG: A Stub Feature based Level Set Method for Learning Object Contour.
Proceedings of the British Machine Vision Conference, 2012

2011
Adaptive learning codebook for action recognition.
Pattern Recognit. Lett., 2011

Recognizing human interaction by multiple features.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
Compact visual codebook for action recognition.
Proceedings of the International Conference on Image Processing, 2010

A swarm intelligence based searching strategy for articulated 3D human body tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Group Action Recognition Using Space-Time Interest Points.
Proceedings of the Advances in Visual Computing, 5th International Symposium, 2009

Learning Group Activity in Soccer Videos from Local Motion.
Proceedings of the Computer Vision, 2009

2008
Group action recognition in soccer videos.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008


  Loading...