Yuxuan Wang
Affiliations:- ByteDance AI Lab, Mountain View, CA, USA
- Google, Mountain View, CA, USA
- Ohio State University, Columbus, OH, USA (former, PhD)
According to our database1,
Yuxuan Wang
authored at least 90 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Joint Multiscale Cross-Lingual Speaking Style Transfer With Bidirectional Attention Mechanism for Automatic Dubbing.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition.
CoRR, 2024
CoRR, 2024
CoRR, 2024
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing.
CoRR, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models.
CoRR, 2023
Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition.
CoRR, 2023
Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Memory Augmented Lookup Dictionary Based Language Modeling for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Trans. Int. Soc. Music. Inf. Retr., 2022
CoRR, 2022
Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Non-intrusive Speech Quality Assessment with a Multi-Task Learning based Subband Adaptive Attention Temporal Convolutional Neural Network.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Neufa: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Joint Echo Cancellation and Noise Suppression based on Cascaded Magnitude and Complex Mask Estimation.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Adversarial Feature Learning and Unsupervised Clustering Based Speech Synthesis for Found Data With Acoustic and Textual Noise.
IEEE Signal Process. Lett., 2020
High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times.
CoRR, 2020
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement.
CoRR, 2020
CoRR, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
A Unified Sequence-to-Sequence Front-End Model for Mandarin Text-to-Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020
2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2019
Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019
Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
A feature study for classification-based speech separation at low signal-to-noise ratios.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
A feature study for classification-based speech separation at very low signal-to-noise ratio.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Speech Audio Process., 2013
IEEE Trans. Speech Audio Process., 2013
A sparse representation approach for perceptual quality improvement of separated speech.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012