Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching.
CoRR, 2024
Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment.
CoRR, 2024
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Pixel Exclusion: Uncertainty-aware Boundary Discovery for Active Cross-Domain Semantic Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Learning a Weighted Classifier for Conditional Domain Adaptation.
Knowl. Based Syst., 2021
Domain Adaptive Semantic Segmentation without Source Data.
CoRR, 2021
Test-time Batch Statistics Calibration for Covariate Shift.
CoRR, 2021
Domain Adaptive Semantic Segmentation without Source Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Multi-source domain adaptation with graph embedding and adaptive label prediction.
Inf. Process. Manag., 2020