Kele Xu

Orcid: 0000-0001-5997-5169

According to our database1, Kele Xu authored at least 141 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning.
IEEE Trans. Artif. Intell., November, 2024

SIAT: Document-level Event Extraction via Spatiality-Augmented Interaction Model with Adaptive Thresholding.
ACM Trans. Asian Low Resour. Lang. Inf. Process., October, 2024

Bidirectional Influence and Interaction for Multiagent Reinforcement Learning.
IEEE Trans. Artif. Intell., October, 2024

D-FaST: Cognitive Signal Decoding With Disentangled Frequency-Spatial-Temporal Attention.
IEEE Trans. Cogn. Dev. Syst., August, 2024

Nuclear Norm Maximization-Based Curiosity-Driven Reinforcement Learning.
IEEE Trans. Artif. Intell., May, 2024

Dynamic Memory-Based Curiosity: A Bootstrap Approach for Exploration in Reinforcement Learning.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2024

Enhanced Cross-Modal Transformer Model for Video Semantic Similarity Measurement.
IEEE Trans. Circuits Syst. II Express Briefs, January, 2024

Automated Data Augmentation for Audio Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Self-Supervised Learning-For Underwater Acoustic Signal Classification With Mixup.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Less confidence, less forgetting: Learning with a humbler teacher in exemplar-free Class-Incremental learning.
Neural Networks, 2024

Learning incremental audio-visual representation for continual multimodal understanding.
Knowl. Based Syst., 2024

Exploring structure diversity in atomic resolution microscopy with graph neural networks.
CoRR, 2024

AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation.
CoRR, 2024

Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models.
CoRR, 2024

D-FaST: Cognitive Signal Decoding with Disentangled Frequency-Spatial-Temporal Attention.
CoRR, 2024

QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge.
CoRR, 2024

Online Self-Preferring Language Models.
CoRR, 2024

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results.
CoRR, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
CoRR, 2024

NTIRE 2024 Challenge on Image Super-Resolution (⨉4): Methods and Results.
CoRR, 2024

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles.
CoRR, 2024

Trusted multi-scale classification framework for whole slide image.
Biomed. Signal Process. Control., 2024

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

THE-FD: Task Hierarchical Emotion-aware for Fake Detection.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Demonstrative Instruction Following in Multimodal LLMs via Integrating Low-Rank Adaptation with Ensemble Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Tracing Training Progress: Dynamic Influence Based Selection for Active Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Higher-Order Vision-Language Alignment for Social Media Prediction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Enhancing Unsupervised Visible-Infrared Person Re-Identification with Bidirectional-Consistency Gradual Matching.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Contrastive Learning-based Chaining-Cluster for Multilingual Voice-Face Association.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Iterative Regularized Policy Optimization with Imperfect Demonstrations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Self-Supervised Learning-Based General Fine-tuning Framework For Audio Classification and Event Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Voice-to-Face Generation: Couple of Self-Supervised Representation Learning with Diffusion Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Transformer-Inspired Lightweight Model for Efficient Time Series Forecasting.
Proceedings of the IEEE International Conference on Acoustics, 2024

Temporal Inconsistency-Based Active Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Nuclear-Norm Maximization for Low-Rank Updates.
Proceedings of the IEEE International Conference on Acoustics, 2024

Adapter-Based Incremental Learning for Face Forgery Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Weight Light, Hear Right: Heart Sound Classification with a Low-Complexity Model.
Proceedings of the 32nd European Signal Processing Conference, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024






Optimistic Model Rollouts for Pessimistic Offline Policy Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Multi-task Pre-training Language Model for Semantic Network Completion.
ACM Trans. Asian Low Resour. Lang. Inf. Process., November, 2023

Automatic Audio Augmentation for Requests Sub-Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

VTQAGen: BART-based Generative Model For Visual Text Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Temporal Inconsistency-Based Intrinsic Reward for Multi-Agent Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2023

DGSNet: Dual Graph Structure Network for Emotion Recognition in Multimodal Conversations.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

Cheap-Fake Detection with LLM Using Prompt Engineering.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Diversifying Message Aggregation in Multi-Agent Communication Via Normalized Tensor Nuclear Norm Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Raw Ultrasound-Based Phonetic Segments Classification Via Mask Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2023

Progressive Diversifying Policy for Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Complementary Learning System Based Intrinsic Reward in Reinforcement Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
WRMatch: Improving FixMatch With Weighted Nuclear-Norm Regularization for Few-Shot Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

Multi-representation knowledge distillation for audio classification.
Multim. Tools Appl., 2022

Focus on Hard Categories and Hard Examples: Remote Sensing Image Scene Classification via Expert Model and Hard Example Mining.
IEEE Geosci. Remote. Sens. Lett., 2022

HEROHE Challenge: Predicting HER2 Status in Breast Cancer from Hematoxylin-Eosin Whole-Slide Imaging.
J. Imaging, 2022

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning.
CoRR, 2022

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration.
CoRR, 2022

Wound Segmentation with Dynamic Illumination Correction and Dual-view Semantic Fusion.
CoRR, 2022

Trusted Multi-Scale Classification Framework for Whole Slide Image.
CoRR, 2022

Nuclear Norm Maximization Based Curiosity-Driven Learning.
CoRR, 2022

DFAID: Density-aware and feature-deviated active intrusion detection over network traffic streams.
Comput. Secur., 2022

VSM: A Versatile Semi-supervised Model for Multi-modal Cell Instance Segmentation.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Masked Modeling-based Audio Representation for ACM Multimedia 2022 Computational Paralinguistics ChallengE.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Seeing Speech: Magnetic Resonance Imaging-Based Vocal Tract Deformation Visualization Using Cross-Modal Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Understanding and Predicting Docker Build Duration: An Empirical Study of Containerized Workflow of OSS Projects.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

An Emotion Evolution Network for Emotion Recognition in Conversation.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

FINT: Field-Aware Interaction Neural Network for Click-Through Rate Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving the Classification of Phonetic Segments from Raw Ultrasound Using Self-Supervised Learning and Hard Example Mining.
Proceedings of the IEEE International Conference on Acoustics, 2022



EasySED: Trusted Sound Event Detection with Self-Distillation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
MoNuSAC2020: A Multi-Organ Nuclei Segmentation and Classification Challenge.
IEEE Trans. Medical Imaging, 2021

KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning.
Entropy, 2021

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization.
CoRR, 2021

Multimodal Feature Fusion for Video Advertisements Tagging Via Stacking Ensemble.
CoRR, 2021

FINT: Field-aware INTeraction Neural Network For CTR Prediction.
CoRR, 2021

KnowSR: Knowledge Sharing among Homogeneous Agents in Multi-agent Reinforcement Learning.
CoRR, 2021

NTIRE 2021 Challenge on Perceptual Image Quality Assessment.
CoRR, 2021

KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning.
CoRR, 2021

Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image.
CoRR, 2021

Two-Stage COVID-19 Lung Segmentation from CT Images by Integrating Rib Outlining and Contour Refinement.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Batch Weighted Nuclear-Norm Minimization for Medical Image Sequence Segmentation.
Proceedings of the Bioinformatics Research and Applications - 17th International Symposium, 2021

Multi-Actor-Attention-Critic Reinforcement Learning for Central Place Foraging Swarms.
Proceedings of the International Joint Conference on Neural Networks, 2021

Semi-supervised medical image classification based on CamMix.
Proceedings of the International Joint Conference on Neural Networks, 2021

Improving Ultrasound Tongue Contour Extraction Using U-Net and Shape Consistency-Based Regularizer.
Proceedings of the IEEE International Conference on Acoustics, 2021

Fden: Mining Effective Information of Features in Detecting Network Anomalies.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cross Modality Knowledge Distillation for Multi-Modal Aerial View Object Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Audio Tagging by Cross Filtering Noisy Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Collaborative deep learning across multiple data centers.
Sci. China Inf. Sci., 2020

NiCad+: Speeding the Detecting Process of NiCad.
Proceedings of the 14th IEEE International Conference on Service Oriented Systems Engineering, 2020

Multimodal Deep Learning for Social Media Popularity Prediction With Attention Mechanism.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Quantitative Comparison of Different Machine Learning Approaches for Human Spermatozoa Quality Prediction Using Multimodal Datasets.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Quantification of Transducer Misalignment in Ultrasound Tongue Imaging.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020


2019
Multi Model-Based Distillation for Sound Event Detection.
IEICE Trans. Inf. Syst., 2019

Learning to Cooperate via an Attention-Based Communication Neural Network in Decentralized Multi-Robot Exploration.
Entropy, 2019

Multistructure-Based Collaborative Online Distillation.
Entropy, 2019

Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems.
CoRR, 2019

FoxNet: A Multi-face Alignment Method.
CoRR, 2019

Predicting tongue motion in unlabeled ultrasound videos using convolutional LSTM neural network.
CoRR, 2019

Improving Fast Adaptation for Newcomers in Multi-Robot Reinforcement Learning System.
Proceedings of the 2019 IEEE SmartWorld, 2019

Learning to Communicate Efficiently with Group Division in Decentralized Multi-agent Cooperation.
Proceedings of the 13th IEEE International Conference on Service-Oriented System Engineering, 2019

Mixup Based Privacy Preserving Mixed Collaboration Learning.
Proceedings of the 13th IEEE International Conference on Service-Oriented System Engineering, 2019

Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Quantitative Analysis Platform for PD-L1 Immunohistochemistry based on Point-level Supervision Model.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Mobile Application for Sound Event Detection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Predicting Tongue Motion in Unlabeled Ultrasound Videos Using Convolutional Lstm Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Denoising Convolutional Autoencoder Based B-mode Ultrasound Tongue Image Feature Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Learning data augmentation policies using augmented random search.
CoRR, 2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data.
CoRR, 2018

General audio tagging with ensembling convolutional neural network and statistical features.
CoRR, 2018

Collaborative Deep Learning Across Multiple Data Centers.
CoRR, 2018

Environmental Sound Classification Based on Multi-temporal Resolution CNN Network Combining with Multi-level Features.
CoRR, 2018

Sample Dropout for Audio Scene Classification Using Multi-scale Dense Connected Convolutional Neural Network.
Proceedings of the Knowledge Management and Acquisition for Intelligent Systems, 2018

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Large-Scale Whale Call Classification Using Deep Convolutional Neural Network Architectures.
Proceedings of the 2018 IEEE International Conference on Signal Processing, 2018

Multi-scale DenseNet-Based Electricity Theft Detection.
Proceedings of the Intelligent Computing Theories and Application, 2018

Meta learning based audio tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Sample mixed-based data augmentation for domestic audio tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Full-reference image quality assessment-based B-mode ultrasound image similarity measure.
CoRR, 2017

2016
3D tongue motion visualization based on the B-mode ultrasound tongue images. (Visualisation tridimensionnelle de la langue basée sur des séquences d'image échographique en mode-B).
PhD thesis, 2016

Tongue contour extraction from ultrasound images based on deep neural network.
CoRR, 2016

An Articulatory-Based Singing Voice Synthesis Using Tongue and Lips Imaging.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Contour-based 3D tongue motion visualization using ultrasound image sequences.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

A Novel Human Interaction Game-like Application to Learn, Perform and Evaluate Modern Contemporary Singing - "Human Beat Box".
Proceedings of the VISAPP 2015, 2015

A Novel Hybrid Mobile Malware Detection System Integrating Anomaly Detection With Misuse Detection.
Proceedings of the 6th International Workshop on Mobile Cloud Computing and Services, 2015

Development of a 3D tongue motion visualization platform based on ultrasound image sequences.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Tongue contour extraction from ultrasound images based on deep neural network.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

2014
3d tongue motion visualization based on ultrasound image sequences.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An educational platform to capture, visualize and analyze rare singing.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014


  Loading...