2024
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

When Attention Sink Emerges in Language Models: An Empirical View.
CoRR, 2024

On Calibration of LLM-based Guard Models for Reliable Content Moderation.
CoRR, 2024

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Disentangled Adversarial Domain Adaptation for Phonation Mode Detection in Singing and Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

On Memorization in Diffusion Models.
CoRR, 2023

Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models.
CoRR, 2023

Elucidate Gender Fairness in Singing Voice Transcription.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization.
Trans. Mach. Learn. Res., 2022

Boosting Monocular 3D Human Pose Estimation With Part Aware Attention.
IEEE Trans. Image Process., 2022

Towards Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
CoRR, 2022

Unsupervised Mismatch Localization in Cross-Modal Sequential Data.
CoRR, 2022

Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MM-ALT: A Multimodal Automatic Lyric Transcription System.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

2021
Laser Endoscopic Manipulator Using Spring-Reinforced Multi-DoF Soft Actuator.
IEEE Robotics Autom. Lett., October, 2021

2020
Distilling a Deep Neural Network into a Takagi-Sugeno-Kang Fuzzy Inference System.
CoRR, 2020