Xiangming Gu

Orcid: 0000-0003-0637-8664

According to our database1, Xiangming Gu authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Disentangled Adversarial Domain Adaptation for Phonation Mode Detection in Singing and Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

On Memorization in Diffusion Models.
CoRR, 2023

Deep Audio-Visual Singing Voice Transcription based on Self-Supervised Learning Models.
CoRR, 2023

Elucidate Gender Fairness in Singing Voice Transcription.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization.
Trans. Mach. Learn. Res., 2022

Boosting Monocular 3D Human Pose Estimation With Part Aware Attention.
IEEE Trans. Image Process., 2022

Towards Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
CoRR, 2022

Unsupervised Mismatch Localization in Cross-Modal Sequential Data.
CoRR, 2022

Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MM-ALT: A Multimodal Automatic Lyric Transcription System.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

2021
Laser Endoscopic Manipulator Using Spring-Reinforced Multi-DoF Soft Actuator.
IEEE Robotics Autom. Lett., October, 2021

2020
Distilling a Deep Neural Network into a Takagi-Sugeno-Kang Fuzzy Inference System.
CoRR, 2020


  Loading...