Kai Wang

Orcid: 0000-0002-1154-5175

Affiliations:
  • National University of Singapore, School of Computing, Singapore
  • Alibaba Group, DAMO Acadmey, Hangzhou, China
  • Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China


According to our database1, Kai Wang authored at least 86 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Region Generation and Assessment Network for Occluded Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

HARWE: A multi-modal large-scale dataset for context-aware human activity recognition in smart working environments.
Pattern Recognit. Lett., 2024

Dynamic Diffusion Transformer.
CoRR, 2024

Real-Time Video Generation with Pyramid Attention Broadcast.
CoRR, 2024

Prioritize Alignment in Dataset Distillation.
CoRR, 2024

More Than Positive and Negative: Communicating Fine Granularity in Medical Diagnosis.
CoRR, 2024

Conditional LoRA Parameter Generation.
CoRR, 2024

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning.
CoRR, 2024

Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality.
CoRR, 2024

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation.
CoRR, 2024

A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training.
CoRR, 2024

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond.
CoRR, 2024

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation.
CoRR, 2024

DynST: Dynamic Sparse Training for Resource-Constrained Spatio-Temporal Forecasting.
CoRR, 2024

Neural Network Diffusion.
CoRR, 2024

Two Trades is not Baffled: Condensing Graph via Crafting Rational Gradient Matching.
CoRR, 2024

Must: Maximizing Latent Capacity of Spatial Transcriptomics Data.
CoRR, 2024

The Snowflake Hypothesis: Training and Powering GNN with One Node One Receptive Field.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MOMA: Mixture-of-Modality-Adaptations for Transferring Knowledge from Image Models Towards Efficient Audio-Visual Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024


Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ATOM: Attention Mixer for Efficient Dataset Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Summarizing Stream Data for Memory-Constrained Online Continual Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
MLLMs-Augmented Visual-Language Representation Learning.
CoRR, 2023

DREAM+: Efficient Dataset Distillation by Bidirectional Representative Matching.
CoRR, 2023

Can pre-trained models assist in dataset distillation?
CoRR, 2023

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation From Scratch.
CoRR, 2023

Color Prompting for Data-Free Continual Unsupervised Domain Adaptive Person Re-Identification.
CoRR, 2023

The Snowflake Hypothesis: Training Deep GNN with One Node One Receptive field.
CoRR, 2023

Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System.
CoRR, 2023

Summarizing Stream Data for Memory-Restricted Online Continual Learning.
CoRR, 2023

The 3rd Anti-UAV Workshop & Challenge: Methods and Results.
CoRR, 2023

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.
CoRR, 2023

DiM: Distilling Dataset into Generative Model.
CoRR, 2023

Expanding Small-Scale Datasets with Guided Imagination.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Does Graph Distillation See Like Vision Dataset Counterpart?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dataset Quantization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DREAM: Efficient Dataset Distillation by Representative Matching.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Spatio-Temporal Decomposition Network for Compressed Video Quality Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SEformer: Dual-Path Conformer Neural Network is a Good Speech Denoiser.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Prompt Vision Transformer for Domain Generalization.
CoRR, 2022

Architecture-Agnostic Masked Image Modeling - From ViT back to CNN.
CoRR, 2022

FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders.
CoRR, 2022

Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels.
CoRR, 2022

Crafting Better Contrastive Views for Siamese Representation Learning.
CoRR, 2022

Dataset Distillation via Factorization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DLME: Deep Local-Flatness Manifold Embedding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CAFE: Learning to Condense Dataset by Aligning Features.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Efficient Training Approach for Very Large Scale Face Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Crafting Better Contrastive Views for Siamese Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
GarbageNet: A Unified Learning Framework for Robust Garbage Classification.
IEEE Trans. Artif. Intell., 2021

Brain MRI super-resolution using coupled-projection residual network.
Neurocomputing, 2021

Align Yourself: Self-supervised Pre-training for Fine-grained Recognition via Saliency Alignment.
CoRR, 2021

An Efficient Training Approach for Very Large Scale Face Recognition.
CoRR, 2021

Learning to Cluster Faces via Transformer.
CoRR, 2021

Mask Aware Network for Masked Face Recognition in the Wild.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

2020
Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study.
IEEE Trans. Image Process., 2020

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition.
IEEE Trans. Image Process., 2020

AU-Guided Unsupervised Domain Adaptive Facial Expression Recognition.
CoRR, 2020

Learning Discriminative Representation For Facial Expression Recognition From Uncertainties.
Proceedings of the IEEE International Conference on Image Processing, 2020

Suppressing Mislabeled Data via Grouping and Self-attention.
Proceedings of the Computer Vision - ECCV 2020, 2020

Suppressing Uncertainties for Large-Scale Facial Expression Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Interactive Dual Generative Adversarial Networks for Image Captioning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Coupled-Projection Residual Network for MRI Super-Resolution.
CoRR, 2019

Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition.
Proceedings of the International Conference on Multimodal Interaction, 2019

Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression.
Proceedings of the International Conference on Multimodal Interaction, 2019

Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction.
Proceedings of the International Conference on Multimodal Interaction, 2019

Frame Attention Networks for Facial Expression Recognition in Videos.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Multi-Modal Face Anti-Spoofing Attack Detection Challenge at CVPR2019.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Deep Recurrent Multi-instance Learning with Spatio-temporal Features for Engagement Intensity Prediction.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

2017
Group emotion recognition with individual facial emotion CNNs and global image based CNNs.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017


  Loading...