Xiaoxue Gao

Nancy F. Chen

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

The single- and dual-brain mechanisms underlying the adviser's confidence expression strategy switching during influence management.

[BibT_eX]

[DOI]

NeuroImage, April, 2023

PoLyScriber: Integrated Fine-Tuning of Extractor and Lyrics Transcriber for Polyphonic Music.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Token2vec: A Joint Self-Supervised Pre-Training Framework Using Unpaired Speech and Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Transriber: Few-Shot Lyrics Transcription With Self-Training.

[BibT_eX]

[DOI]

Xianghu Yue

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Automatic Lyrics Transcription of Polyphonic Music With Lyrics-Chord Multi-Task Learning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Note on Path-Connectivity of Complete Bipartite Graphs.

[BibT_eX]

[DOI]

Shasha Li

Yan Zhao

J. Interconnect. Networks, 2022

PoLyScribers: Joint Training of Vocal Extractor and Lyrics Transcriber for Polyphonic Music.

[BibT_eX]

[DOI]

CoRR, 2022

k-Path-Connectivity of Completely Balanced Tripartite Graphs.

[BibT_eX]

[DOI]

Pi Wang

Shasha Li

Axioms, 2022

Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

NHSS: A speech and singing parallel database.

[BibT_eX]

[DOI]

Speech Commun., 2021

The mutuality of social emotions: How the victim's reactive attitude influences the transgressor's emotional responses.

[BibT_eX]

[DOI]

NeuroImage, 2021

2020

Affective evaluation of others' altruistic decisions under risk and ambiguity.

[BibT_eX]

[DOI]

NeuroImage, 2020

Personalized Singing Voice Generation Using WaveRNN.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

2019

NUS Speak-to-Sing: A Web Platform for Personalized Speech-to-Singing Conversion.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker-independent Spectral Mapping for Speech-to-Singing Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Behaviour Pattern When Designers Have Difficulties.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2018

Analysis of Speech and Singing Signals for Temporal Alignment.

[BibT_eX]

[DOI]

Karthika Vijayan