Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024
When Attention Sink Emerges in Language Models: An Empirical View.
CoRR, 2024
On Calibration of LLM-based Guard Models for Reliable Content Moderation.
CoRR, 2024
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Disentangled Adversarial Domain Adaptation for Phonation Mode Detection in Singing and Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
On Memorization in Diffusion Models.
CoRR, 2023
Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models.
CoRR, 2023
Elucidate Gender Fairness in Singing Voice Transcription.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization.
Trans. Mach. Learn. Res., 2022
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention.
IEEE Trans. Image Process., 2022
Towards Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
CoRR, 2022
Unsupervised Mismatch Localization in Cross-Modal Sequential Data.
CoRR, 2022
Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
MM-ALT: A Multimodal Automatic Lyric Transcription System.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Laser Endoscopic Manipulator Using Spring-Reinforced Multi-DoF Soft Actuator.
IEEE Robotics Autom. Lett., October, 2021
Distilling a Deep Neural Network into a Takagi-Sugeno-Kang Fuzzy Inference System.
CoRR, 2020